Download YBurak-Sloan-Swartz-Annual-2009

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Pattern recognition wikipedia , lookup

Optogenetics wikipedia , lookup

Transcript
Image Stabilization by
Bayesian Dynamics
Yoram Burak
Sloan-Swartz annual meeting, July 2009
What does neural activity represent?
In Bayesian models: probabilities
Accumulated evidence in area LIP
Shadlen and Newsome (2001)
Direction of motion: single, static variable
What does neural activity represent?
In Bayesian models: probabilities
Accumulated evidence in area LIP
Shadlen and Newsome (2001)
Direction of motion: single, static variable
What about multi-dimensional, dynamic quantities?
Foveal vision and fixational drift
Foveal vision and fixational drift
Fixational drift is large in the fovea:
- between micro-saccades ~20 receptive fields
- between spikes (100 Hz) ~2-4 receptive fields !
By Xaq from: X. Pitkow
Image
Pitkow
cone separation:
0.5 arcmin
Foveal vision and fixational drift
Fixational drift is large in the fovea:
- between micro-saccades ~20 receptive fields
- between spikes (100 Hz) ~2-4 receptive fields !
By Xaq from: X. Pitkow
Image
Pitkow
cone separation:
0.5 arcmin
Downstream areas require knowledge
of trajectory to interpret spikes
Joint decoding of image and position
Bayesian:
Discrimination task:
vs.
N x 2 probabilities
X. Pitkow et al, Plos Biology (2007)
# positions
Joint decoding of image and position
Bayesian:
Discrimination task:
vs.
N x 2 probabilities
X. Pitkow et al, Plos Biology (2007)
# positions
Unconstrained image
30 x 30
binary
pixels
N x 2900 probabilities
Joint decoding of image and position
Bayesian:
Discrimination task:
vs.
N x 2 probabilities
X. Pitkow et al, Plos Biology (2007)
# positions
Unconstrained image
30 x 30
binary
pixels
N x 2900 probabilities
Can the brain apply a Bayesian approach to
this problem?
Can the brain apply a Bayesian approach to
this problem?
Decoding strategy
Performance in parameter space
What are the biological implications?
Can the brain apply a Bayesian approach to
this problem?
Decoding strategy
Performance in parameter space
What are the biological implications?
Decoding strategy
Factorized representation:
Discards information about correlations
Decoding strategy
Factorized representation:
Discards information about correlations
Update dynamics:
evidence,
diffusion
Exact if trajectory is known.
minimize
DKL
Decoding strategy
Factorized representation:
Discards information about correlations
Update dynamics:
evidence,
diffusion
minimize
DKL
Exact if trajectory is known.
Retinal encoding model:
evidence - Poisson spiking (rate λ1 for on pixels, λ0 for off)
diffusion - Random walk (diffusion coefficient D)
Decoding strategy
Factorized representation:
Discards information about correlations
Neural Implementation -
Two populations: where
, what
For 30 x 30 pixels: N × 2900 → N + 900 quantities.
Update rules
Update of what neurons:
Ganglion cells
multiplicative gating
nonlinearity
What
Where
Update rules
Update of what neurons:
Ganglion cells
multiplicative gating
What
nonlinearity
Where
Update of where neurons:
Ganglion cells
multiplicative gating
Where
What
+ diffusion
Demo
m x m binary pixels
image
retina
2d diffusion (D)
Poisson spikes:
100 Hz (on), 10 Hz (off)
Decoder
Demo
Can the brain apply a Bayesian approach to
this problem?
Decoding strategy
Performance in parameter space
What are the biological implications?
Performance
Performance degrades with larger D
accuracy
Convergence time [s]
(and smaller λ)
D
D
Performance
Faster and more accurate for larger images
accuracy
Convergence time [s]
m = 5, 10, 30, 50, 100
D
D
Demo
Performance
Faster and more accurate for larger images
accuracy
Convergence time [s]
m = 5, 10, 30, 50, 100
D
D
Performance
Faster and more accurate for larger images
accuracy
Convergence time [s]
m = 5, 10, 30, 50, 100
D
D
Performance
Faster and more accurate for larger images
accuracy
Convergence time [s]
m = 5, 10, 30, 50, 100
D
D
Performance
accuracy
Convergence time [s]
scales with linear image size m
m x m pixels
D/m
D/m
Performance
m x m pixels
D*
D/m
D/m
Analytical scaling:
Convergence time [s]
accuracy
scales with linear image size m
Performance
Performance improves with image size.
Success for images 10 x 10 or larger
Prediction for psychophysics:
Degradation in high acuity tasks when visual scene contains little
background detail.
Temporal response of Ganglion cells
f(t)
Common view: fixational motion important to activate cells, due to biphasic
response
50 ms
t
Temporal response makes decoding much more difficult.
Non-Markovian:
Need history
Temporal response of Ganglion cells
What
Where
“filtered
trajectory”
accuracy
Ganglion
Convergence time [s]
Approach: Choose decoder that is Bayes optimal if the
trajectory is known.
D
D
history dependent decoder /
naive decoder
Temporal response of Ganglion cells
Is fixational motion beneficial?
Convergence time [s]
Known trajectory , perfect inhibitory balance
D
Optimal D - order of magnitude smaller than biological value
Can the brain apply a Bayesian approach to
this problem?
Decoding strategy
Performance in parameter space
What are the biological implications?
Network architecture
Each ganglion cell innervates multiple what & where cells
(spread: ~10 arcmin)
Reciprocal, multiplicative gating
Ganglion
What
Where
Activity:
What neurons
Slow dynamics, evidence accumulation
Where neurons Fewer. Highly dynamic activity
Tonic, sparse in retinal stabilization conditions.
Activity:
What neurons
Slow dynamics, evidence accumulation
Where neurons Fewer. Highly dynamic activity
Tonic, sparse in retinal stabilization conditions.
Where in the brain?
Monocular
If so, suggests LGN or V1
LGN?
Modulatory inputs to relay cells (gating?)
V1?
Lateral connectivity in where network,
Increase in number of neurons.
Summary
Strategy for stabilization of foveal vision
Factorized Bayesian approach to multi-dimensional inference
Summary
Strategy for stabilization of foveal vision
Factorized Bayesian approach to multi-dimensional inference
Explicit representation of stabilized image
“What” and “where” populations
Summary
Strategy for stabilization of foveal vision
Factorized Bayesian approach to multi-dimensional inference
Explicit representation of stabilized image
“What” and “where” populations
Good performance at 1 arcmin resolution
Problem is easier for large images, for coarser reconstruction
Summary
Strategy for stabilization of foveal vision
Factorized Bayesian approach to multi-dimensional inference
Explicit representation of stabilized image
“What” and “where” populations
Good performance at 1 arcmin resolution
Problem is easier for large images, for coarser reconstruction
Network architecture:
Many-to-one inputs from retina, multiplicative gating (what/where)
Acknowledgments
Uri Rokni
Haim Sompolinsky
Markus Meister
Special thanks - the Swartz foundation