Download Artificial Intelligence - Academic year 2016/2017

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Affective computing wikipedia , lookup

Machine learning wikipedia , lookup

Visual Turing Test wikipedia , lookup

Artificial intelligence in video games wikipedia , lookup

Pattern recognition wikipedia , lookup

Technological singularity wikipedia , lookup

Human–computer interaction wikipedia , lookup

Computer Go wikipedia , lookup

Wizard of Oz experiment wikipedia , lookup

Computer vision wikipedia , lookup

Knowledge representation and reasoning wikipedia , lookup

Embodied cognitive science wikipedia , lookup

AI winter wikipedia , lookup

Intelligence explosion wikipedia , lookup

Existential risk from artificial general intelligence wikipedia , lookup

Ethics of artificial intelligence wikipedia , lookup

Philosophy of artificial intelligence wikipedia , lookup

History of artificial intelligence wikipedia , lookup

Transcript
Artificial Intelligence
Academic year 2016/2017
Giorgio Fumera
http://pralab.diee.unica.it
[email protected]
Pattern Recognition and Applications Lab
Department of Electrical and Electronic Engineering
University of Cagliari
Outline
Part I: Historical notes
Part II: Solving Problems by Searching
Uninformed Search
Informed search
Part III: Knowledge-based Systems
Logical Languages
Expert systems
Part IV: The Lisp Language
Part V: Machine Learning
Decision Trees
Neural Networks
Part I
Historical Notes
Artificial Intelligence
A discipline born in the 1950s.
Goal: building “intelligent” machines.
A goal envisaged since a long time before, e.g.:
I
Leonardo da Vinci’s robot
knight (about 1495)
I
Automaton chess player
(“Mechanical Turk”, late
18th century)
I
science fiction
What is “intelligence”?
Broad definition: a set of capabilities that allow humans to
I
learn
I
think
I
understand
I
communicate
I
be self-conscious
I
build abstract models of the world
I
plan
I
adapt to novel external conditions
I
...
Some of these capabilities are exhibited also by animals (e.g.,
associative memory, reacting to stimuli, communicating).
Early investigations: clues from humans
Goal: understanding human intelligence.
I
High-level manifestation: rationality
Logic: Aristotle (4th cent. bc), G.W. Leibniz (17th-18th
cent.), G. Boole (19th cent.), etc.
I
High-level manifestation: behavior and mind
Psychology and cognitive science (since 19th cent.)
I
Low-level biological support: the brain
Neuroanatomy and Neurophysiology (since 19th cent.):
McCulloch and Pitt’s model of neuron (1943), D.O. Hebb’s
theory on neurons as basic units of thought, etc.
Early investigations: clues from technology
Goal: building machines capable of autonomously performing some
task.
I
Automata (e.g., jacquard loom, 1804)
Cybernetics (feedback and control):
N. Wiener, W.R. Ashby (1940s)
I
Statistics and probability as tools to deal
with uncertainty in reasoning and
decision-making: T. Bayes (18th cent.);
K. Pearson, R.A. Fisher, A. Wald,
J. Neyman (late 19th – 20th cent.)
Early investigations: clues from technology
The computer:
I
precursors: mechanical devices: B. Pascal and G.W. Leibniz (17th
cent.), C. Babbage’s analytical engine (19th cent.)
I
from engineering: electromechanical and electronic devices
(1940s): K. Zuse, J.P. Eckert, J.W. Mauchly, J. von Neumann
I
from logic and mathematics: computational theory, the foundation
of computer science: A.M. Turing, A. Church (1930s)
Early investigations: “thinking” computers
Electronic computers: the right tool for building intelligent
machines?
Alan M. Turing’s (1912-1954) main contributions:
I
first investigations into the nature of
computing
I
the “logical computing machine” (Turing
machine): a universal computer
I
envisioning intelligent computers:
Computing Machinery and Intelligence,
Mind, Vol. LIX, No. 263, 433–460, 1950;
operational definition of intelligence: the
Turing Test
The birth of AI
1950s: many researchers from different disciplines investigate
intelligence under two main viewpoints:
I
understanding human intelligence
I
building intelligent machines
Dartmouth College Workshop (USA, summer 1956): official birth
of AI, as a discipline aimed at building intelligent machines.
The founders: J. McCarthy, M. Minsky, A. Newell, H. Simon,
C. Shannon, O. Selfridge, R. Solomonoff, . . .
Early explorations: 1950s and 1960s
Goals: identifying specific tasks that require intelligence, and
figuring out how to get machines to do them.
Great interest in mimicking high-level human thought and mental
abilities, e.g.:
I
reasoning
I
understanding natural language
I
understanding images
and some related low-level abilities:
I
recognizing speech sounds
I
distinguishing objects in images
I
reading cursive script
But: how do humans do that?
Early explorations: 1950s and 1960s
Starting point: toy problems, and some real-world ones
I
game playing 15-puzzle, checkers, chess, etc.
I
theorem proving
I
natural language processing (NLP)
I
recognizing objects in images
Early explorations: 1950s and 1960s
First approaches and tools:
I symbolic (high-level) approach: based on the assumption
that the essence of intelligence is symbol processing
–
–
–
–
I
heuristic search
syntax analysis/generation
symbolic knowledge representation (symbols, lists)
symbolic knowledge processing: new programming languages
(LISP, etc.)
non-symbolic (low-level) approach
– pattern recognition
– artificial neural networks
Heuristic search methods
Symbol processing approach, applied to:
I
game playing: 15-puzzle, checkers, chess (the “Drosophila of AI”)
I
geometric analogy problems
I
theorem proving
I
mechanizing problem solving: A. Newell and H. Simon’s General
Problem Solver (1959)
Heuristic search methods
Common approach:
I
knowledge representation: lists of symbols
(main feature of the LISP language, 1958–, J. McCarthy)
I
search methods: search tree, heuristics
Natural language processing
Symbol processing approach.
Goals: understanding, generating, translating natural language.
Linguistic levels:
I
morphology: word parts (e.g.: walking = walk + -ing)
I
syntax (grammar): rules that define well-formed sentences
(e.g.: John hit the ball: Yes; ball the hit John: No)
I
semantics: meaning of a sentence
I
pragmatics: context and background knowledge, e.g.:
John went to the bank
John threw the ball to the window and broke it
John threw the glass to the wall and broke it
Natural language processing
Main focus of early research: syntactic level.
Seminal work: N. Chomsky, Syntactic Structures, 1957.
Grammar definition: syntax rules for analyzing/generating
sentences; main tool: parse tree.
Applications:
I
question answering (original goal: computer interfaces)
I
machine translation: early optimism, but a very hard task
Pattern recognition
Goal: classifying input signals (images, sounds, electronic signals,
etc.) into one of several categories.
First problems: image classification.
Early efforts: optical character recognition (OCR).
Main approaches:
I
I
template matching
learning
1. image pre-processing (noise filtering, line thickening, edge
enhancement, ...)
2. feature extraction
3. classification “rules” learnt from examples
Artificial neural networks
Non-symbolic (low-level), connectionist approach.
F. Rosenblatt (psychologist): perceptron (1957–) as a potential
model of human learning, cognition and memory:
I
network of McCulloch-Pitts’ neural elements
I
learning algorithm for adjusting connection weights from
examples
First applications: pattern recognition
I
OCR
I
aerial image recognition
Great expansion: mid–1960s to early 1980s
From toy/lab problems to real-world and commercial applications:
I
computer vision
I
mobile robots
I
game playing
I
speech recognition, NLP
I
knowledge representation and reasoning
Research funding:
I
DARPA’s Strategic Computing Program (USA)
I
Fifth Generation Computer Systems (Japan)
I
ESPRIT (Europe)
Great expansion: mid–1960s to early 1980s
Computer vision:
I
Summer Vision project (MIT, 1966)
I
low-level, hierarchical image processing (hints from biology); image
filters; line, corner, surface detection; 3D reconstruction
I
early application: guiding a robot arm to manipulate blocks
I
high-level vision: finding objects in scenes (templates, parts)
I
two main approaches emerge:
– whole scene reconstruction: difficult
– perceiving to guide robot action (purposive vision): easier
Great expansion: mid–1960s to early 1980s
Mobile robots:
I
sensors, actuators, computer vision, environment modeling,
planning
I
route finding: heuristic search, A* algorithm
I
first autonomous vehicles
Great expansion: mid–1960s to early 1980s
Game playing: progress in chess
I
programs that attain human-level (not master) capability
I
investigations into human ability: accumulated knowledge vs
massive search
Great expansion: mid–1960s to early 1980s
NLP: greater success on less ambitious goals
I
improvements in grammars
I
machine translation with humans in the loop
I
declarative (logical languages, inference algorithms) vs procedural
(“hard-wired”) knowledge
I
dialog systems (1971, T. Winograd’s SHRDLU: blocks’ world,
procedural knowledge)
I
speech recognition (easy), and understanding (difficult)
Great expansion: mid–1960s to early 1980s
Knowledge representation and reasoning:
I
consulting / decision support / expert systems
main idea: solving domain-specific problems by embedding
expert knowledge in the form of IF-THEN rules
I
applications: chemistry, medical diagnosis, geology, military;
since 1990s: business
Great expansion: mid–1960s to early 1980s
Summing up, until the 70s AI research is mainly based on the
symbol processing conception of human intelligence
I
main approach to AI: mimicking high-level human abilities
through heuristic search and symbolic processing (“good,
old-fashioned AI”, GOFAI)
I
many successful applications through a pragmatic approach in
specific tasks . . .
I
. . . but very limited achievements with respect to early
expectations for a “general” AI
Mid 1980s: the “AI winter”
Real-world tasks require much more “intelligence” than that
achievable by heuristic search and symbolic processing (GOFAI).
Two main issues emerge:
I
computational complexity: combinatorial explosion
I
need of a large body of knowledge (including common sense)
Also the non-symbolic, connectionist approach (artificial neural
networks) exhibits limitations.
Main consequences:
I
drop of interest in AI
I
scaling back AI’s goals
I
cut of research funds
Mid–1980s to 1990s: technical and theoretical advances
New results in several fields, solid theoretical foundations from:
I
mathematics
I
statistics and probability theory
I
control engineering
Concrete progress in real-world tasks (albeit still far from initial
expectations).
Mid–1980s to 1990s: technical and theoretical advances
Knowledge representation and reasoning:
I
I
new paradigms, e.g.; fuzzy logic, soft computing (inspired by
human mind)
semantic networks, ontologies; e.g.:
– WordNet, http://wordnet.princeton.edu
– BabelNet, http://babelnet.org
I
probabilistic reasoning to overcome the limits of logic
(probabilistic graphical models, Bayesian networks, learning)
Search algorithms: evolutionary approach, genetic algorithms
(inspired by evolution).
Mid–1980s to 1990s: technical and theoretical advances
The rise of machine learning:
I
huge amount of data in digital form become available
I
main idea: automatically inferring knowledge (patterns, rules,
etc.) from data instead of eliciting it from domain experts
I
data analysis methods: data mining, etc.
I
theoretical foundations: statistics
I
novel techniques: inductive logic programming, decision trees,
resurgence of ANNs (1986: back-propagation algorithm),
support vector machines, ensemble methods, etc.
I
applications to many fields: computer vision, natural language
processing, etc.
Mid–1980s to 1990s: technical and theoretical advances
Computer vision:
I
two main approaches persist: scene analysis, purposive vision
I
main achievements: surface, depth; tracking, object
recognition
I
fruitful exchanges with research on animal/human vision
I
novel techniques: hierarchical models, ANNs, deep neural
networks
I
extensive application of machine learning techniques
Mid–1980s to 1990s: technical and theoretical advances
Intelligent Agent architectures:
I
sensor networks
I
autonomous, cooperating robots; emergent behavior
I
the intelligent agent paradigm
A toy (?) example: soccer-playing robots
Mid 1990s – today: main achievements
Original goal (building intelligent machines): still far-reaching.
But: successes in real-world problems, many commercial
applications, many startup companies.
Examples:
I
games: master-level programs in checkers, chess and (very
recently) Go
I
robots: driverless automobiles, space vehicles
I
pervasive applications: home, cars, route finding in maps
(search algorithms), recommender systems (machine learning,
social/collaborative filtering), characters in video games, ...
I
automated (high-frequency) trading
I
medicine
I
business rule management systems
I
translating systems
I
computer vision (face recognition, ...)
Summary of the main approaches to AI
Four distinct approaches to build intelligent machines:
Mind/
thinking
Behavior
Human performance
Systems that think
like humans
(cognitive modeling approach)
Systems that act
like humans
(Turing test approach)
Rationality
Systems that think
rationally
(“law of thought” approach)
Systems that act
rationally
(rational agent approach)
Rational agents (acting rationally): the most general approach,
amenable to scientific/technological development (though, maybe,
not useful enough for understanding human intelligence)
A snapshot of current AI research
Associations:
I
Association for the Advancement of Artificial Intelligence (AAAI)
www.aaai.org
I
European Coordinating Committee for Artificial Intelligence
(ECCAI), www.eccai.org
I
Italian Association for Artificial Intelligence (AI*IA)
www.aixia.it
Conferences:
I
Int. Joint Conf. on Artificial Intelligence, ijcai.org
Scientific journals:
I
Artificial Intelligence
www.journals.elsevier.com/artificial-intelligence
I
J. of Artificial Intelligence Research, www.jair.org
A snapshot of current AI research
Research topics:
I
Machine learning
I
Knowledge representation and reasoning
I
Reasoning under uncertainty or imprecision
I
Natural language
I
Vision
I
Pattern recognition
I
Heuristic search
I
Intelligent robotics
I
Multi-agent systems
I
Planning
I
...
Philosophical issues
A long-standing question: Can machines be intelligent?
Two main hypotheses:
I
Weak AI: machines can emulate intelligence (act intelligently)
I
Strong AI: machines can be intelligent
(if they act intelligently, they are intelligent, e.g.: Turing test)
Another long-standing question: Is (human) mind a machine?
Philosophical issues
Some arguments raised against Weak AI:
I
machines can never do . . . (make mistakes, learn from
experience, have a sense of humor, enjoy ice creams, . . . )
I
machines are formal systems, and formal systems cannot
establish the truth of every mathematical sentence (Gödel
incompleteness theorem), whereas humans (in principle) can
I
human behavior can not be captured by a set of rules
A. Turing’s viewpoint (Mind, 1950):
Can a machine think? is an ill-posed question.
Consider this one: Can a machine fly/swim?
I
airplanes “fly”, but not as birds “fly”
I
ships “swim” in Russian, not in English or in Italian...
Philosophical issues
Some issues about Strong AI:
I
even if machines can emulate intelligence, they cannot be
self-conscious
I
relationship between mental states and body (brain) states
(free will, consciousness, intentions): dualism (R. Descartes,
17th cent.) vs materialism (“brains cause mind”)
I
a machine running the “right” program (e.g., for natural
language understanding) does not necessarily have a mind
(the Chinese room thought experiment, J. Searle, 1980)
I
intelligence is an emerging behavior that can be only
supported by biological brains
Ethical issues
I
Even if we could build intelligent machines, should we?
I
Consequences on humans: loss of jobs, loss of the sense of
being unique, end of human race, etc.
I
Accountability (e.g., driverless cars)
I
...
Ethical concerns are now re-flourishing, as many believe that
human-level AI is now in reach (e.g., current focus of the Future of
Life Institute, http://futureoflife.org).
Some recent projects
Human Brain Project (EU)
https://www.humanbrainproject.eu
Overall goal: understanding the human brain and its
diseases, and emulating its computational capabilities
RoboLaw (EU)
http://www.robolaw.eu
Regulating Emerging Robotic Technologies in Europe:
Robotics facing Law and Ethics