Download Allen Institute for Artificial Intelligence: Vision, Projects, Results

Document related concepts

History of artificial intelligence wikipedia , lookup

Transcript
Allen Institute for Artificial Intelligence:
Vision, Projects, Results
Peter Turney
September 2015
Epigraph
“What if a cure for an intractable cancer is hidden within the
tedious reports on thousands of clinical studies? In 20 years'
time, AI will be able to read — and more importantly,
understand — scientific text. These AI readers will be able to
connect the dots between disparate studies to identify novel
hypotheses and suggest experiments which would otherwise
be missed. AI-based discovery engines will help to find the
answers to science's thorniest problems and ultimately
revolutionise science.”
Oren Etzioni
CEO, Allen Institute for Artificial Intelligence
Wired Magazine, September, 2015
2
Outline
▪ The Vision of AI2
▪
AI for the Common Good
▪ Project Aristo
▪
Answering science questions
▪ Semantic Scholar Project
▪
Semantic literature search
▪ Project Plato
▪
Extracting knowledge from images, diagrams, and videos
▪ Project Euclid
▪
Solving math and geometry problems
▪ The Future of AI
▪
3
human intelligence + human compassion + AI assistance
Background
▪ Paul Allen
▪
▪
philanthropist
Microsoft co-founder
▪ Allen Institute for Brain Science
▪
▪
founded 2003
dedicated to understanding how the human brain works
▪ Allen Institute for Artificial Intelligence
▪
founded 2013
▪
dedicated to constructing AI systems with reasoning, learning, and
reading capabilities
▪ Allen Institute for Cell Science
4
▪
founded 2014
▪
dedicated to understanding the role of genes in living cells, in health
and in disease
Allen Institute for Artificial Intelligence
5
Vision of AI2
▪ AI for the Common Good
▪
mission is to contribute to humanity through high-impact AI research
and engineering
▪ focus is on mastering math and science
▪
▪
to help students learn math and science
to assist scientists in their daily research tasks
▪ evaluate progress of research against human
benchmarks
▪
▪
pass human science exams, solve school math and geometry problems
acquire knowledge from textbooks, understand diagrams, tables
▪ help solve the big problems
▪
▪
▪
▪
6
climate change
loss of species diversity
human health and disease
...
Project Aristo
7
Project Aristo
▪ answering science questions
▪
using human exams as a benchmark of progress
▪ currently working with grade 4 exams
▪
then grade 8, grade 12, university, …
▪ various types of question
▪
▪
▪
8
purely textual, multiple-choice questions
mix of text and diagrams, multiple-choice questions
interpretation of diagrams, graphs, tables is particularly challenging
Project Aristo
▪ questions that combine text and diagrams are difficult for
computers
▪
45% of the grade 4 exam questions involve diagrams
▪ which letter shows runoff?
9
Project Aristo
▪ which letter points to the plant structure that takes in
water and nutrients?
10
Project Aristo
▪ try Aristo at http://allenai.org/aristo.html
11
Project Aristo
▪ sample questions from the online demo
12
Project Aristo
▪ What does Aristo need to know to pass the exam?
13
Project Aristo
▪ Analysis of 2500 key terms for grade 4 science
14
Project Aristo
▪ Types of questions
15
Project Aristo
▪ A grade 4 science question
In New York State, the longest period of daylight occurs
during which month?
(A) June
(B) March
(C) December
(D) September
▪ How to get a computer to select the right answer?
16
Project Aristo
▪ The high road — continuous model with space and time
▪
▪
17
powerful model, enables complex reasoning
difficult to implement, difficult to learn automatically
Project Aristo
▪ The medium road — reasoning with logic
▪
challenges: linguistic variability, polysemy, paraphrase, synonymy,
complex reasoning, multiple steps, common sense knowledge
In New York State, the longest period of daylight
occurs during which month?
New York State
→ Northern Hemisphere
Longest Period of Daylight
→ Summer Solstice
Summer Solstice & Northern Hemisphere
→ June 20 ∨ June 21 ∨ June 22
June 20 ∨ June 21 ∨ June 22
→ June
18
Project Aristo
▪ The low road — table lookup
▪
19
semi-structured, easy for humans, easy for machines
Project Aristo
▪ The sidewalk — statistical association
▪
simple, robust, easy, effective
In New York State, the longest period of daylight occurs during which
month?
(A) June
(B) March
(C) December
(D) September
association(“june”, “longest ... daylight”)
association(“march”, “longest ... daylight”)
association(“december”, “longest ... daylight”)
association(“september”, “longest ... daylight”)
20
= 6.448
= 0.000
= 4.683
= 0.000
Project Aristo
▪ results so far
▪
▪
purely textual (no diagrams), multiple-choice questions
passing grades 4 and 8, but failing grade 12
9-10 years old
21
13-14 years old
17-18 years old
Project Aristo
▪ Project Aristo: Publications
▪
22
http://allenai.org/papers.html
Semantic Scholar Project
23
Semantic Scholar Project
▪ semantic literature search
▪
▪
24
114 million scholarly documents online
making search better for researchers
Semantic Scholar Project
▪ launching in November
▪
25
sign up at http://allenai.org/semantic-scholar.html
Semantic Scholar Project
▪ … definitions of terms (NLP), most cited papers in area, ...
26
Semantic Scholar Project
▪ … title, authors (with links), PDF, abstract, ...
27
Semantic Scholar Project
▪ … thumbnails of figures and tables, with zoom, ...
28
Semantic Scholar Project
▪ … papers that this paper cites ...
29
Semantic Scholar Project
▪ … papers that cite this paper
30
Semantic Scholar Project
▪ Semantic Scholar Project: Publications
▪
31
http://allenai.org/papers.html
Project Plato
32
Project Plato
▪ extracting knowledge from
images, diagrams, and videos
33
▪
focused on extracting visual knowledge
from images and videos
▪
aim is to enrich knowledge bases that are
conventionally derived from textual resources
Project Plato
▪ LEVAN: Learning EVerything about ANything
34
Project Plato
35
Project Plato
▪ LEVAN
▪ similar images with different captions
▪
maybe the captions mean similar things?
▪ similar captions with different images
▪
maybe the images contain similar things?
▪ learn the relations between captions and images
▪
36
analyze huge collection of images and text from the web
Project Plato
▪ VisKE: Visual Knowledge Extraction
▪ learn facts about the world by combining visual and
textual information
37
Project Plato
▪ VisKE
38
Project Plato
▪ Visual Entailment
Horse Racing
Horse Running
39
Project Plato
▪ Visual Entailment
Horse Racing
Horse Running
40
Project Plato
▪ DEVAN: understanding diagrams
41
Project Plato
▪ Past work on image retrieval from sketches
▪
42
problem is that professional diagrams are not like amateur sketches
Project Plato
▪ amateur frog sketches
43
Project Plato
▪ Where can we get enough good diagrams to train a
diagram understanding system?
▪ Automatically generate diagrams from photographs!
Photograph
44
Automatically Generated Diagram
Project Plato
▪ now we can recognize diagrams of frogs
45
Project Plato
▪ Project Plato: Publications
▪
46
http://allenai.org/papers.html
Project Euclid
47
Project Euclid
▪ Solving math and geometry problems
48
▪
multiple-choice math and geometry problems
in standardized tests
▪
geometry problems require us to combine text
and diagram understanding
Project Euclid
SAT Geometry Problems
In the diagram at the right, secant
AB intersects circle O at D, secant
AC intersects circle O at E, AE = 4,
AC = 24, and AB = 16. Find AD.
(A) 5
(B) 6
(C) 4
(D) 10
49
Project Euclid
50
Project Euclid
▪
51
3 Year Challenge: To produce an end-to-end pipeline that outperforms the
average test-taker on the entire math portion of the SAT
Project Euclid
▪ Project Euclid: Publications
▪
52
http://allenai.org/papers.html
The Future of AI
53
The Future of AI
▪ AI has been getting some bad press lately
▪
▪
▪
Elon Musk
Stephen Hawking
Bill Gates
▪ Future of Life Institute
▪
▪
$10 million donation from Elon Musk
aimed at keeping AI beneficial to humanity
▪ AI2 position
54
▪
▪
▪
worth thinking about risks
Oren Etzioni (CEO of AI2) wrote a paper about Asimov’s Laws in 1994
I wrote a paper about Asimov’s Laws in 1991
▪
the benefits of AI2’s vision of an assistant for scientists are far greater
than the risks
The Future of AI
▪ Andrew Ng (Stanford, Google, Baidu)
“I don’t work on preventing AI from turning evil for the same reason
that I don’t work on combating overpopulation on the planet Mars ...
Hundreds of years from now when hopefully we’ve colonized Mars,
overpopulation might be a serious problem and we’ll have to deal
with it. ... it’s just not productive to work on that right now.”
55
The Future of AI
▪ Eric Horvitz (Microsoft Research)
“It's the absence of AI technologies that is already killing people
through errors.”
▪ medical errors and ineffective disease treatment that could be
prevented and improved using AI
“In the end we'll be able to get incredible benefits from machine
intelligence in all realms of life, from science to education to
economics to daily life.”
56
The Future of AI
▪ autonomy versus intelligence
high
autonomy
low
autonomy
autonomous drones
software viruses
malware
spyware
word processor
spreadsheet
low
intelligence
57
caution
Project Aristo
Semantic Scholar Project
Project Plato
Project Euclid
high
intelligence
The Future of AI2
▪ The future of AI2
▪
▪
▪
human intelligence + human compassion + AI assistance
human in charge; very limited autonomy for AI
AI helps scientist analyze vast quantities of data
▪ Project Aristo
▪
reading, understanding, filtering scientific text
▪ Semantic Scholar Project
▪
searching, summarizing, navigating, citation analysis of scientific papers
▪ Project Plato
▪
understanding images, learning from images, reasoning with diagrams
▪ Project Euclid
▪
58
understanding, reasoning with mathematics and geometry
The Future
▪ some of the big problems where AI can help scientists
▪
▪
▪
▪
▪
▪
▪
▪
59
mitigate climate change
reduce species extinction
reduce environmental degradation
develop new ways to harness wind and solar energy
develop new ways to store energy
better maintenance of health
better treatment of disease
help with increasing elderly population
The Future
60