Download History of Artificial Intelligence

History of Artificial Intelligence Dana Nejedlová Department of Informatics Faculty of Economics Technical University of Liberec 1 What is Intelligence? • Common definition of artificial intelligence: – AI is a field which attempts to build intelligent machines and tries to understand intelligent entities. • But what is intelligence? – Learning, manipulating with facts, but also creativity, consciousness, emotion and intuition. • Can machines be intelligent? – Up to the present day it is not sure whether it is possible to build a machine that has all aspects of intelligence. – This kind of research is central in the field of AI. 2 What Is Artificial Intelligence? • Building machines that are able of symbolic processing, recognition, learning, and other forms of inference • Solving problems that must use heuristic search instead of analytic approach • Using inexact, missing, or poorly defined information – Finding representational formalisms to compensate this • Reasoning about significant qualitative features of a situation • Working with syntax and semantics • Finding answers that are neither exact nor optimal but in some sense „sufficient“ • The use of large amounts of domain-specific knowledge • The use of meta-level knowledge (knowledge about knowledge) to effect more sophisticated control of problem solving strategies 3 Before the Creation of Electronic Computers • Ancient and medieval myths – Talos, Pandora, Golem • artificial men, robots • Research in the antiquity till the 17th century – Aristotle, Gottfried Wilhelm Leibniz • automation of reasoning – Thomas Hobbes, René Descartes • mechanistic understanding of living beings • 20th century, 1948 – Norbert Wiener – Cybernetics: Or the Control and Communication in the Animal and the Machine. • Intelligent behavior is the result of the feedback mechanism. 4 The Beginnings of Electronic Computers • John Louis von Neumann (1903 – 1957) – Von Neumann’s architecture of a computer • Consultations on the EDVAC Project (1945) – Game Theory (1944) • It can be applied to the interacting intelligent agents. – Cellular automata (1966) • They have computational capacity. • Alan Mathison Turing (1912 – 1954) – Turing Machine (1936) • formalization of algorithm, abstraction of computer – Turing Test (1950) • proposal how to test the ability of a machine to demonstrate thinking – Programming of “Manchester Mark I” computer (1949) 5 The birth of “Artificial Intelligence” • John McCarthy used the term “Artificial Intelligence” for the first time as the topic of the Dartmouth conference in 1956. – Venue: • Dartmouth College, Hanover, state New Hamphshire, USA – Organizers: • John McCarthy, Marvin Minsky, Nathaniel Rochester, and Claude Shannon – Participants: • Ray Solomonoff, Oliver Selfridge, Trenchard More, Arthur Samuel, Herbert Simon, and Allen Newell – Proposal: • To prove that every aspect of learning or any other feature of intelligence can be so precisely described that a machine can 6 be made to simulate it. Approaches to Artificial Intelligence • Good Old-fashioned Artificial Intelligence (GOFAI) or symbolic artificial intelligence (John Haugeland, 1985) – Program (e.g. classifier) in the GOFAI style is composed of parts (e.g. rules), that have clear relation to the real world. • New-fangled Artificial Intelligence – The most important branch was connectionism – artificial neural networks (McCulloch – Pitts, 1943). – Genetic algorithms (Holland, 1975) and other kinds of biologically inspired information processing • Strong AI (John Searle, 1980) – Artificial intelligence is real intelligence. – Solution of complex problems, e.g. robotics. • Weak AI – Artificial intelligence is a mere imitation of human real intelligence. – Solution of a specific problems that do not cover the whole scale 7 of human capabilities, e.g. OCR or chess. Motivations for Biologically Inspired Information Processing • Danny Hillis: The Connection Machine (1985) – Machines programmed in a GOFAI style tend to slow down as they acquire more knowledge. • They must search their knowledge base. – Humans have the opposite property. • They have massively parallel brain architecture. – Humans were not produced by an engineering process. • They are the result of evolution. • Marvin Minsky: The Society of Mind (1986) – Model of human intelligence which is built from the interactions of simple parts called agents which are themselves mindless. • It would be difficult to imagine how evolution could shape a single system as complex as mind. • Evolution could, however, shape individual specialized cognitive units and form the mechanisms that enable the modules to interact. • Marvin Minsky: The Emotion Machine (2006) – Emotions are different ways to think that our mind uses to increase our intelligence. 8 Artificial Intelligence Philosophy • What is intelligence and thinking? – Turing test (1950) – According to GOFAI thinking is symbol manipulation, that is why program in the GOFAI style is thinking. – Chinese Room Problem (John Searle, 1980) • Thinking of humans and computers is different. • Is human intelligence inseparable from mind and emotions? • In what sense can we say that a computer can understand natural language? • Who is responsible for the decisions made by AI? • What should be the ethics of people of dealing 9 with the creations of artificial intelligence? Hard Versus Soft Computing • Good Old-fashioned Artificial Intelligence – IF – THEN Rules – Heuristics • New-fangled Artificial Intelligence – Neural networks – Fuzzy logic – Probabilistic reasoning • • • • belief networks (Bayes networks) genetic algorithms chaos theory parts of learning theory (machine learning) 10 Heuristics • Problem-solving method that is usually successful, but can fail i some situations • Unclearly defined problems with missing or ambiguous data – Medical diagnosis – Vision, speech recognition • Helps to decide among infinite number of possible interpretations. • A problem may have an exact solution, but the computational cost of finding it may be prohibitive. – Chess, tic-tac-toe, 15 or 8-puzzle, scheduling, path-finding… – Heuristic evaluation function • Evaluates each stage of solution. – Number of conflicts in a number of possible schedules • Helps to decide about the next step leading to the goal. – Selecting the schedule with minimum number of conflicts for the next 11 small changes attempting to find some correct schedule Expectations from Artificial Intelligence • Predictions of Herbert Simon and Allen Newell (Heuristic Problem Solving, 1958), that within ten years – a digital computer will be the world's chess champion, – a digital computer will discover and prove an important new mathematical theorem, – a digital computer will compose critically acclaimed music, – most theories in psychology will take the form 12 of computer programs. Typical AI Problem • 8 Queens Puzzle • Is there a way of placing 8 queens on the chessboard so that no two queens would be able to attack each other? 13 Hard Problem for AI • Truncated Chessboard Problem • Is there a way of placing dominos on the board so that each square is covered and each domino covers exactly two squares? 14 Limitations of Artificial Intelligence • David Hilbert (1862 – 1943) and Kurt Gödel (1906 – 1978) – Gödel‘s Incompleteness Theorem (1931) • Consistency of a formal system cannot be proved within the system, because it can contain statements with selfreference – logical paradoxes of the type: – This statement is false. – Some tasks have no algorithms. • The halting problem – It is not decidable whether the algorithm will halt or not. – The algorithms in question contain again self-reference. • Complexity Theory (NP-completeness, 1971) – Some tasks have algorithms, but the computation cannot be completed in practice (on a real computer), 15 because it would take too much time. Gödel‘s Incompleteness Theorem • There are unprovable statements in every axiomatic mathematical system expressive enough to define the set of natural numbers. • Example theorem 1 = 2 • Proof of the theorem: • If a = b, a ≠ 0, b ≠ 0, • then the two following equalities are also true: a2 – b2 = (a – b) ∙ (a + b), a2 – b2 = a2 – ab. • And the following statements can be derived from them: a2 – ab = (a – b) ∙ (a + b) a ∙ (a – b) = (a – b) ∙ (a + b) a=a+b a=a+a a = 2a 1=2 • Truth can be verified only when knowledge beyond the natural finite numbers arithmetic is used. 16 The Logic Theorist – The First Artificial Intelligence Program • Allen Newell, J.C. Shaw and Herbert Simon at Carnegie Institute of Technology, now Carnegie Mellon University, in 1955 • It did logic proofs from the book “Principia Mathematica” (Bertrand Russell and Alfred North Whitehead, 1910). • It used mental processes of human experts. – cognitive science • To implement Logic Theorist on a computer, the three researchers developed a programming 17 language, IPL, a predecessor of Lisp. Programming Languages • Tasks like natural language processing, knowledge representation, or theorem proving needed a special language allowing processing of symbolic data. • Lisp (John McCarthy, USA, 1958) – functional paradigm / list processing • Program consists of functions of nested functions. • Data and programs are represented the same way: a list. – (+ 1 2 3) is a both a list of 4 atoms and a function returning value 6. • Program can serve as data for another program! – Powerful feature allowing flexible and productive coding. • Prolog (Alain Colmerauer, Europe, 1972) – declarative paradigm / logic programming • Program consists of facts and rules. – Programmer describes (i.e. declares) a problem. • Compiler deduces new facts from them. – Programmer does not write the algorithm for the solution. 18 Programs with Symbolic Artificial Intelligence • The General Problem Solver (1957) – It was solving formalized symbolic problems, e.g. mathematical proofs and chess. • The Geometry Theorem Prover (1958) – It was proving theorems with the help of explicitly represented axioms. • SAINT (Symbolic Automatic INTegrator) – Integral calculus (1961) • ANALOGY (1963) – The picture A is to picture B like picture C to picture D. • IQ tests are used for measuring the intelligence of people. • Computers can be programmed to excel in IQ tests. • But those programs would be stupid in real-world situations. 19 Natural Language Processing • STUDENT (1964, 1967) – It was solving word problems in algebra. • SIR (Semantic Information Retrieval, 1968) – It was reading simple sentences and answered questions. • ELIZA (1965) – It was simulating psychologist. • TLC (Teachable Language Comprehender) (1969) – It was reading text and making semantic network. • SUR (Speech Understanding Research) (1971) – 5-year plan of the ARPA (today DARPA) agency of a 20 research in continuous speech recognition Expert Systems • They belong to the symbolic AI. • They use a set of rules and heuristics. • MACSYMA (MIT, 1968 -1982) – It was doing symbolic math calculations. • DENDRAL (SRI, 1965) – It is identifying chemicals. • MYCIN (SRI, Edward Shortliffe, 1974) – It diagnosed infectious blood diseases. – The following systems: EMYCIN, PUFF, INTERNIST - CADUCEUS 21 Commercial Expert Systems • PROSPECTOR (SRI, 1974 – 1983) – It is analyzing geological data and searching for deposits of minerals. • XCON – eXpert CONfigurer (CMU, 1978) – It was configuring DEC’s VAX computers. • TEIRESIAS (SRI, Randall Davis, 1976) – Knowledge Acquisition System (KAS) – It is acquiring knowledge from human experts. – It is building knowledge bases for expert 22 systems. Robotics • • • • • Marvin Lee Minsky (* 1927) Freddy (University of Edinburgh,1973) SHAKEY (SRI, 1969) SHRDLU (MIT, Terry Winograd, 1970) blocks worlds (MIT, 1970) – Robot has to manipulate building blocks according to instructions. • computer vision • natural language understanding • planning 23 The First Artificial Neural Networks • Warren McCulloch and Walter Pitts – Model of artificial neuron (1943) – Neuron represents functions. • Donald Olding Hebb – Rule for neural network training (1949) • Marvin Minsky and Dean Edmonds have built the first computer with neural network. – SNARC (1951) 24 Other Artificial Neural Networks • Frank Rosenblatt – Perceptron (1957) • a single-layer network and its learning rule capable of learning linearly separable functions • Bernard Widrow and Marcian Ted Hoff – Minimization of network’s root square error – Delta rule (learning rule of a neural network) – ADAptive LINEar Systems or neurons or ADALINEs (1960) – MADALINEs (1962) • multi-layer versions of ADALINEs 25 Neural Networks Critique • Book „Perceptrons“ (Marvin Minsky and Seymour Papert, 1969) – When single-layer neural networks of a Perceptron type cannot learn XOR function (it is linearly inseparable), also multi-layer networks cannot learn it. – Hence funding of neural network research was stopped until the beginning of the 20th century 80’s. • But multi-layer neural networks can learn the XOR function. • All that is needed for this is to find the right 26 algorithm for their training. Neural Networks Resurrection • Hopfield net (John Hopfield, 1982) – It can learn a couple of pictures (patterns). • Self-Organizing Map (SOM) (Teuvo Kohonen, 1982) – It can do unsupervised learning. • Backpropagation (Arthur Bryson and Yu-Chi Ho, 1969) – algorithm for training of a multilayer neural network – It needs network’s neurons not to have a sharp threshold. – Because it was not noticed, it was then rediscovered several times in the 70’s and the 80’s of the 20th century and popularized in 1986. • NETtalk (Terry Sejnowski and Charles Rosenberg, 1986) – Multi-layer neural network, that learned English pronunciation and could generalize. – It used the backpropagation algorithm. 27 The Most Important AI Laboratories • MIT (Massachusetts Institute of Technology) – 1959 - John McCarthy and Marvin Minsky founded Artificial Intelligence Laboratory. • SRI (Stanford Research Institute) – 1963 - John McCarthy founded AI Laboratory. • CMU (Carnegie Mellon University) – 1980 - Raj Reddy founded The Robotics Institute. • IBM • AT&T Bell Labs • University of Edinburgh 28 Present • • • • • • • • • • • Robotic toys, space probes Robotics in machinery Home appliances (washers, vacuum cleaners) Data Mining, fraud detection, spam filtering Searching on the Internet (web agents) Modeling of interactive processes (agents) E-business – e-shops personalization Intelligent tutoring systems and SW interfaces Role-playing games, chess programs Speech and video recognition Machine translation 29

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Download History of Artificial Intelligence