Download Abstract Neuron  { y

Document related concepts

Convolutional neural network wikipedia , lookup

Neuropsychopharmacology wikipedia , lookup

Catastrophic interference wikipedia , lookup

Biological neuron model wikipedia , lookup

Central pattern generator wikipedia , lookup

Neuroesthetics wikipedia , lookup

Artificial neural network wikipedia , lookup

Neurophilosophy wikipedia , lookup

Neural modeling fields wikipedia , lookup

Neural engineering wikipedia , lookup

Development of the nervous system wikipedia , lookup

Channelrhodopsin wikipedia , lookup

Metastability in the brain wikipedia , lookup

Nervous system network models wikipedia , lookup

Types of artificial neural networks wikipedia , lookup

Recurrent neural network wikipedia , lookup

Embodied language processing wikipedia , lookup

Transcript
Abstract Neuron
output y
y {1 if net > 0
0 otherwise
n
net   wiii
i 0
w0
i0=1
w1
i1
w2
i2
wn
...
input i
in
Link to Vision: The Necker Cube
Constrained Best Fit in Nature
inanimate
physics
chemistry
biology
vision
language
animate
lowest energy
state
molecular
minima
fitness, MEU
Neuroeconomics
threats,
friends
errors,
NTL
Triangle Nodes:
Encoding relational information with abstract neurons
• The triangle node (aka 2/3 node) is a useful
function that activates its outputs (3) if any (2)
of its 3 inputs are active
• Such a node will be useful for lots of
representations.
Triangle nodes and McCulloughPitts Neurons
Relation (A)
Object (B) Value (C)
A
B
C
Basic Ideas
• Parallel activation streams.
• Top down and bottom up activation combine to
determine the best matching structure.
• Triangle nodes bind features of objects to values
• Mutual inhibition and competition between
structures
• Mental connections are active neural connections
5 levels of Neural Theory of
Language
Pyscholinguistic
experiments
Spatial
Relation
Motor
Control
Metaphor Grammar
Cognition and Language
abstraction
Computation
Structured Connectionism
Neural Net
Triangle Nodes
SHRUTI
Computational Neurobiology
Biology
Neural
Development
Quiz
Midterm
Finals
Behavioral Experiments
•Identity – Mental activity is Structured Neural
Activity
•Spreading Activation — Psychological model/theory
behind priming and interference experiments
•Simulation — Necessary for meaningfulness and
contextual inference
•Parameters — Govern simulation, strict inference, link to
language
Bottom-up vs. Top-down Processes
• Bottom-up: When processing is driven by
the stimulus
• Top-down: When knowledge and context
are used to assist and drive processing
• Interaction: The stimulus is the basis of
processing but almost immediately topdown processes are initiated
Stroop Effect
• Interference between form and meaning
Name the words
Book Car Table Box Trash Man Bed
Corn Sit Paper Coin Glass House Jar
Key Rug Cat Doll Letter Baby Tomato
Check Phone Soda Dish Lamp Woman
Name the print color of the words
Blue Green Red Yellow Orange Black Red
Purple Green Red Blue Yellow Black Red
Green White Blue Yellow Red Black Blue
White Red Yellow Green Black Purple
Body-Specificity Hypothesis
If concepts and word meanings are constituted, in part, by
mental simulations our own perceptions and actions…
…then their neurocognitive representations should differ
for people with different kinds of bodies, who perceive and
act upon the environment in systematically different ways.
(Casasanto, in review, link on course page)
Testing body-specificity
pinch
Manual Action
chew
Non-Manual Action
Testing body-specificity
Left handed movements
for one block of trials
Right handed movements
for the other block of trials
Design
• 96 Words (48 manual verbs, 48 non-manual verbs)
• 2 blocks (LH movements, RH movements)
• 2 groups of Ss (Left handers, Right handers)
• Dependent measures: RT & Surprise recognition (Old/New)
Move marbles by word color
caress
jab
fling
grip
pound
tap
yank
erase
dial
Manual Action
sigh
cough
growl
watch
crawl
peek
tumble
dance
say
Non-Manual Action
Predictions
If action word meanings are constituted, in part, by mental
simulation of perceptuo-motor experiences, then:
(a) Both online and offline effects of congruity should be
found between manual motor actions and the meanings of
manual action verbs (but not non-manual action verbs).
(b) Right and left handed participants should show opposite
effects of using their right and left hands to move marbles
during incidental encoding of manual action verbs.
(Casasanto, in review)
Marble movement time in ms
Reaction Time Results
700
650
600
***
Left Hand
Right Hand
Left Hand
Right Hand
700
***
550
650
ns
ns
Lefties
Righties
600
550
500
500
450
450
400
400
350
350
300
300
Lefties
Righties
Manual Action Verb
Non-Manual Action Verb
(Casasanto, in review)
Handedness-ness predicts congruity effect
effectininmsms
Congruityeffect
Congruity
300
250
200
150
100
y = 2.01x + 31.39
50
R2 = 0.36, p=.0001
0
50
60
70
80
90
Laterality quotient
ABS
Laterality Quotient
100
Proportion correct recognition
Recognition Memory Results
LeftHand
RightHand
0.80
0.75
0.70
***
***
0.80
0.75
0.70
0.65
0.65
0.60
0.60
0.55
0.55
0.50
0.50
LEFTIES
LeftHand
RightHand
RIGHTIES
Manual Action Verb
ns
ns
LEFTIES
RIGHTIES
Non-Manual Action Verb
(Casasanto, in review)
Procedure for experiment that demonstrates the word-superiority effect. First the word is
presented, then the mask XXXX’s, then the letters.
Word-Superiority Effect
Reicher (1969)
• Which condition resulted in faster & more
accurate recognition of the letter?
– The word condition
– Letters are recognized faster when they are part
of a word then when they are alone
– This rejects the completely bottom-up feature
model
– Also a challenge for serial processing
Connectionist Model
McClelland & Rumelhart (1981)
• Knowledge is distributed and processing
occurs in parallel, with both bottom-up and
top-down influences
• This model can explain the WordSuperiority Effect because it can account for
context effects
Connectionist Model of
Word Recognition
Basic Ideas
• Parallel activation streams.
• Top down and bottom up activation combine to
determine the best matching structure.
• Triangle nodes bind features of objects to values
• Mutual inhibition and competition between
structures
• Mental connections are active neural connections
Interaction in language processing:
Pragmatic constraints on lexical
access
Jim Magnuson
Columbia University
Information integration
• A central issue in psycholinguistics and cognitive
science:
– When/how are such sources integrated?
• Two views
– Interaction
• Use information as soon as it is available
• Free flow between levels of representation
– Modularity
• Protect and optimize levels by encapsulation
• Staged serial processing
• Reanalyze / appeal to top-down information only when needed
Reaction Times in Milliseconds
after: “They all rose”
0 delay
200ms. delay
flower
685
659
stood
677
623
desk
711
652
Example: Modularity and word recognition
• Tanenhaus et al. (1979) [also Swinney, 1979]
– Given a homophone like rose, and a context biased towards one
sense, when is context integrated?
• Spoken sentence primes ending in homophones:
– They all rose
vs. They bought a rose
• Secondary task: name a displayed orthographic word
– Probe at offset of ambiguous word: priming for both
“stood” and “flower”
– 200 ms later: only priming for appropriate sense
• Suggests encapsulation followed by rapid integration
• But the constraint here is weak -- overestimates modularity?
• How could we examine strong constraints in natural contexts?
“They all rose”
triangle nodes:
when two of the
abstract neurons
fire, the third
also fires
model of
spreading
activation
Allopenna, Magnuson & Tanenhaus (1998)
Eye camera
Scene camera
‘Pick up the beaker’
Eye
tracking
computer
Do rhymes compete?
• Cohort (Marlsen-Wilson): onset similarity is
primary because of the incremental nature
of speech
(serial/staged; Shortlist/Merge)
– Cat activates cap, cast, cattle, camera, etc.
– Rhymes won’t compete
• NAM (Neighborhood Activation Model; Luce):
global similarity is primary
– Cat activates bat, rat, cot, cast, etc.
– Rhymes among set of strong competitors
• TRACE (McClelland & Elman): global
similarity constrained by incremental nature
of speech
TRACE predictions
– Cohorts and rhymes compete, but with
different time course
Allopenna et al. Results
Study 1 Conclusions
• As predicted by interactive models, cohorts and
rhymes are activated, with different time courses
• Eye movement paradigm
–
–
–
–
More sensitive than conventional paradigms
More naturalistic
Simultaneous measures of multiple items
Transparently linkable to computational model
• Time locked to speech at a fine grain
Theoretical conclusions
• Natural contexts provide strong constraints that are
used
• When those constraints are extremely predictive,
they are integrated as quickly as we can measure
• Suggests rapid, continuous interaction among
– Linguistic levels
– Nonlinguistic context
• Even for processes assumed to be low-level and
automatic
• Constrains processing theories, also has
implications for, e.g., learnability
Producing words from pictures or from other
words:
A comparison of aphasic lexical access from two
different input modalities
Gary Dell
with
Myrna Schwartz, Dan Foygel, Nadine Martin, Eleanor
Saffran, Deborah Gagnon, Rick Hanley, Janice Kay,
Susanne Gahl, Rachel Baron, Stefanie Abel, Walter
Huber
Boxes and arrows in the linguistic
system
Semantics
Syntax
Lexicon
Input
Phonology
Output
Phonology
Picture Naming Task
Semantics
Say: “cat”
Syntax
Lexicon
Input
Phonology
Output
Phonology
A 2-step Interactive Model of Lexical
Access in Production
Semantic Features
FOG
f
r
d
Onsets
k
DOG
m
CAT
ae
RAT
o
Vowels
MAT
t
g
Codas
Step 1 – Lemma Access
Activate semantic features of CAT
FOG
f
r
d
Onsets
k
DOG
m
CAT
ae
RAT
o
Vowels
MAT
t
g
Codas
Step 1 – Lemma Access
Activation spreads through network
FOG
f
r
d
Onsets
k
DOG
m
CAT
ae
RAT
o
Vowels
MAT
t
g
Codas
Step 1 – Lemma Access
Most active word from proper category is
selected and linked to syntactic frame
NP
FOG
f
r
d
Onsets
k
DOG
m
CAT
ae
RAT
o
Vowels
N
MAT
t
g
Codas
Step 2 – Phonological Access
Jolt of activation is sent to selected word
NP
FOG
f
r
d
Onsets
k
DOG
m
CAT
ae
RAT
o
Vowels
N
MAT
t
g
Codas
Step 2 – Phonological Access
Activation spreads through network
NP
FOG
f
r
d
Onsets
k
DOG
m
CAT
ae
RAT
o
Vowels
N
MAT
t
g
Codas
Step 2 – Phonological Access
Most activated phonemes are selected
FOG
DOG
CAT
RAT
MAT
Syl
On Vo Co
f
r
d
Onsets
k
m
ae
o
Vowels
t
g
Codas
Semantic Error – “dog”
Shared features activate semantic neighbors
NP
FOG
f
r
d
Onsets
k
DOG
m
CAT
ae
RAT
o
Vowels
N
MAT
t
g
Codas
Formal Error – “mat”
Phoneme-word feedback activates formal neighbors
NP
FOG
f
r
d
Onsets
k
DOG
m
CAT
ae
RAT
o
Vowels
N
MAT
t
g
Codas
Mixed Error – “rat”
Mixed semantic-formal neighbors gain activation
from both top-down and bottom-up sources
NP
FOG
f
r
d
Onsets
k
DOG
m
CAT
ae
RAT
o
Vowels
N
MAT
t
g
Codas
Errors of Phonological Access- “dat”
“mat”
Selection of incorrect phonemes
FOG
DOG
CAT
RAT
MAT
Syl
On Vo Co
f
r
d
Onsets
k
m
ae
o
Vowels
t
g
Codas
A Test of the Model:
Picture-naming Errors in Aphasia
“cat”
175 pictures of concrete nouns–Philadelphia Naming Test
94 patients (Broca,Wernicke, anomic, conduction)
60 normal controls
Response Categories
Correct
CAT
Semantic Formal Mixed
DOG
MAT
RAT
Unrelated Nonword
LOG
DAT
Continuity Thesis:
Normal Error Pattern:
97% Correct
cat dog mat rat log dat
Random Error Pattern:
80% Nonwords
cat dog mat rat log dat
Implementing the Continuity Thesis
Random Pattern
Model Random Pattern
cat dog mat rat log dat
1.Set up the model lexicon so
that when noise is very large, it
creates an error pattern similar
to the random pattern.
2. Set processing
parameters of the model so
that its error pattern matches
the normal controls.
Normal Controls
Model Normal Pattern
cat dog mat rat log dat
Lesioning the model: The semanticphonological weight hypothesis
Semantic Features
Semantic-word
weight: S
FOG
f
r
d
Onsets
k
DOG
m
CAT
ae
RAT
o
Vowels
MAT
t
Phonological
word weight:
P
g
Codas
Patient CAT
DOG
MAT
Correct Semantic Formal
LH
RAT
LOG
DAT
Mixed Unrelated Nonword
.71
.03
.07
.01
.02
.15
s=.024 p=.018 .69
.06
.06
.01
.02
.17
IG
.77
s=.019 p=.032 .77
.10
.09
.06
.06
.03
.01
.01
.04
.03
.03
GL
.29
s=.010 p=.016 .31
.04
.10
.22
.15
.03
.01
.10
.13
.32
.30
Representing Model-Patient Deviations
Root Mean Square
Deviation (RMSD)
LH .016
IG .016
GL .043
94 new patients—no exclusions
94.5 % of variance accounted for
Conclusions
The logic underlying box-and-arrow- models
is perfectly compatible with connectionist models.
Connectionist principles augment the boxes and arrows
with
-- a mechanism for quantifying degree of damage
-- mechanisms for error types and hence an
explanation of the error patterns
Implications for recovery and rehabilitation
Behavioral and Imaging
Experiments
Ben Bergen and Shweta Narayan
Do Words and Images Match?
• Behavioral – Image First
Does shared effector slow negative response?
• Imaging – Simple sentence using verb first
Does verb evoke activity in motor effector area?
Structured Neural Computation in NTL
The theory we are outlining uses the computational
modeling mechanisms of the Neural Theory of Language
(NTL).
NTL makes use of structured connectionism (Not PDP
connectionism!).
NTL is ‘localist,’ with functional clusters as units.
Localism allows NTL to characterize precise
computations, as needed in actions and in inferences.
Simulation
To understand the meaning of the concept grasp,
one must at least be able to imagine oneself or
someone else grasping an object.
Imagination is mental simulation, carried out by
the same functional clusters used in acting and
perceiving.
The conceptualization of grasping via simulation
therefore requires the use of the same functional
clusters used in the action and perception of
grasping.
The Simulation Hypothesis
How do mirror neurons work?
By simulation.
When the subject observes another individual doing an action,
the subject is simulating the same action.
Since action and simulation use some of the same neural
substrate, that would explain why the same neurons are firing
during action-observation as during action-execution.
Conclusion 1
The Sensory-Motor System Is Sufficient
For at least one concept, grasp, functional clusters, as
characterized in the sensory-motor system and as modeled
using structured connectionist binding and inference
mechanisms, have all the necessary conceptual properties.
Conclusion 2
The Neural Version of Ockham’s Razor
Under the traditional theory, action concepts have to be
disembodied, that is, to be characterized neurally entirely
outside the sensory motor system.
If true, that would duplicate all the apparatus for
characterizing conceptual properties that we have
discussed. Unnecessary duplication of this sort is highly
unlikely in a brain that works by neural optimization.
Behavioral Experiments
Ben Bergen and Shweta Narayan
Do Words and Images Match?
Does shared effector slow negative response?
• Behavioral – Image First
Does verb evoke activity in motor effector area?
WALK
GRASP
WALK
Preliminary Behavior Results
Same Action
40 Native
Speakers
Eliminate
RT > 2 sec.
Other Effector
Same Effector
788
804
871
767
785
825
5 levels of Neural Theory of
Language
Spatial
Relation
Motor
Control
Metaphor Grammar
Cognition and Language
abstraction
Computation
Structured Connectionism
Neural Net
Triangle Nodes
SHRUTI
Computational Neurobiology
Biology
Neural
Development
Quiz
Midterm
Finals