Download No Slide Title

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Artificial neural network wikipedia , lookup

Perceptual control theory wikipedia , lookup

Central pattern generator wikipedia , lookup

Optogenetics wikipedia , lookup

Machine learning wikipedia , lookup

Pattern recognition wikipedia , lookup

Learning classifier system wikipedia , lookup

Types of artificial neural networks wikipedia , lookup

Transcript
Plasticity and learning
Dayan and Abbot
Chapter 8
Introduction
• Learning occurs through synaptic plasticity
• Hebb (1949): If neuron A often contributes to the firing of
neuron B, then the synapse from A to B should be
strengthened
– Stimulus response (Pavlov)
– Converse: If neuron A does not contribute to the firing of B, the
synapse is weakened
– Hippocampus, neocortex, cerebellum
LTP and LTD at Shaffer collateral inputs to CA1
region of rat hippocampal slice. High stimulation yields
LTP. Low stimulation yields LTD.
NB: no stimulation yields no LTD
Function of learning
• Unsupervised learning (ch 10)
– Feature selection, receptive fields, density estimation
• Supervised learning (ch 7)
– Input-output mapping, feedback as teacher signal
• Reinforcement learning (ch 9)
– Feedback in terms of reward, similar to control theory
• Hebbian learning (ch 8)
– Biologically plausible + normalization
– Covariance rule for (un)supervised learning
– Occular dominance, maps
Rate model with fast time scale
• Neural activity as continuous rate, not spike train
V is output neuron, u is vector input neurons, w is
vector of weights
If tau_r small wrt learning time:
Basic Hebb rule
• V and u are functions of time. Makes dynamics hard to
solve. Alternative is to assume v,u from distribution p(v,u)
and assume p time independent.
Using v=w. u we get
Basis Hebb rule
• Hebb rule is unstable, because norm always increases
• Continuous differential equation can be simulated using
Euler scheme
Covariance rule
• Basic Hebb rule describes only LTP since u, v positive
• LTD occurs when pre-synaptic activity co-occurs with low
post synaptic activity
• Alternatively,
Covariance rule
• When
Either rule produces
Covariance rule is unstable
BCM rule
• Bienenstock, Munro, Cooper (1982):
Requires both pre and post synaptic activity for learning
For fixed threshold, the BMC rule is also unstable
BMC rule
• BMC rule can be made stable by threshold dynamics
• Tau_theta is smaller than tau_w
• BMC rule implements competition between synapses
– Strenghtening one synapse, increases the threshold, makes
strengthening of other synapses more difficult
• Such competition can also be implemented by
normalization
Synaptic normalization
• Limit the sum of weights or sum of squared weights
• Impose this constraint rigidly, or dynamically
• Two examples:
– Rigid scheme for sum of weights constraint (subtractive norm.)
– Dynamic scheme for sum of squared weights (multipl. Norm.)
Subtractive normalization
• Subtractive normalization ensures that sum w does not
change
Not clear how to implement this rule biophysically (nonlocality).
We must add a constraint that weights are non-negative.
Multiplicative normalization
• Oja rule (1982)
• The rule implements the constraint dynamically:
Unsupervised learning
• Adapting the network for a set of tasks
– Neural selectivity, receptive field
– Cortical map
• Process depends partly on neural activity and partly not
(axon growth)
• Ocular dominance
– Adult neurons favor one eye over the other (layer 4 input from
LGN)
– Neurons are clustered in bands or stripes
Single post-synaptic neuron
• We analyze Eq. 8.5
Single post-synaptic neuron
• Solution in terms of eigenvalues of Q
• Eigen values are positive, so solution explodes.
• Asymptotically
– e1 is the principle eigen direction
– Neuron projects input onto this direction:
Single post-synaptic neuron
• Example with two weights.
• Weights grow indefinite, one positive one negative. Choice
depends on initial conditions.
• Limit to [0, 1] yields different solutions depending in init
value
Single post-synaptic neuron
• Subtractive normalization. Averaging over inputs:
• Analysis in terms of eigenvectors:
– In ocular dominance e1=n/sqrt(n). W in direction of e1 has rhs
equal to zero. Ie this component of w is unaltered
– In other directions normalizing term is zero
– W asymptotically dominated by second eigenvector
Hebbian development of ocular
dominance
Subtractive normalization may solve this, since e1=n weight
grows proportional to e2=(1,-1)
Single post-synaptic neuron
• Using the Oja rule
• Show that each eigenvector of Q is solution.
• One can show that only principal eigenvector is stable.
Single post-synaptic neuron
• A: Behavior of
–
–
Unnormalized Hebbian learning
Multiplicative normalization (Oja rule)
gives w propto e1. This is similar to PCA.
• B: Shifting mean of u may yield different solution
• C: Covariance based learning corrects for mean
• Saturation constraints may alter this conclusion
Hebbian development of ocular
dominance
• Model layer 4 cell with input from two LGN cells, each
associated with different eye.
Hebbian development of
orientation selectivity
Spectral analysis also applicable to non-linear systems
Dominant eigenvector uniform.
Non-uniform receptive fields result from sub-dominant
eigenvector
Cortical receptive fields from LGN. ON-center (white) and
OFF-center (black) cells excite cortical neuron.
Multiple postsynaptic neurons
Hebbian development of ocular
dominance stripes
• A: model with right and left eye inputs drive array of
cortical neurons
• B: ocular dominance maps. Top: light and dark areas in top
and bottom cortical layer show ocular dominance in cat
primary cortex. Bottom: model of 512 neurons with
Hebbian learning
Hebbian development of ocular
dominance stripes
• Use 8.31 with W=(w+,w-) the n*2 matrix. See book.
• Subtractive normalization dw+/dt=0
• Ocular dominance pattern given by largest eig. vector of K
Hebbian development of ocular
dominance stripes
• Suppose K translation invariant
– Periodic boundary conditions simulating a patch of cortex ignoring
boundary effects
– Eigenvectors are
– Eigenvalues are Fourrier components
– Solution of learning is spatially periodic (viz. fig 8.7)
Feature based models
• Multi dimensional input (retinal location, ocular
dominance, orientation preference, ....)
• Replace input neurons by input features, W_ab is
selectivity of neuron a to feature b
– Feature u1 is location on retina in coordinates
– Feature u2 is ocularity (how much is the stimulus prefering left
over right eye), a single number
• The coupling
to neuron a describe the preferred
stimulus
• Activity of output a is
Feature based models
• Output is soft-max
• Combined with lateral averaging
– Self-organizing map (SOM)
– Elastic net
Feature based models
optical imaging shows ocularity and orientation selectivity in
macaque primary visual cortex. Dark lines are ocular
dominance boundaries, light lines are iso-orientation
contours. Pin wheel singularities, linear zones
Feature based models
• Elastic net output
• SOM, competitive Hebbian rules can produce similar output
Anti-hebbian modification
• Another way to make different outputs specialize is by
adaptive anti-Hebbian modification
• Consider Oja rule:
• Each output a will be identical
• Anti-Hebbian modification is shown at synapses from
parallel fibers to Purkinje cells in cerebellum.
• Combination yields different eigenvectors as outputs
Timing based rules
• Left: in vitro cortical slice. Right: in vivo xenopus tadpoles
• LTP when pre-synaptic spike precedes post-synaptic spike
• LTD when pre-synaptic spike follows post-synaptic spike
Timing based rules
• Simulating spike-time plasticity requires spiking neurons
• Approximate description with firing rates
• H(t) positive/negative for t positive/negative
Timing based plasticity and
prediction
• Consider array of neurons labeled by a with receptive
fields f_a(s) (dashed and solid curves)
• Timing based learning
rule. Stimulus s moves
from left to right.
Timing based plasticity and
prediction
• If a left of b, then link a to b is strengthened and link b to a
is weakened. Receptive field of neuron a is asymmetrically
deformed (A solid bold line)
• Prediction: next presentation of s(t) will activate a earlier.
• In agreement with shift of place field mean when rats run
around track (B).
Supervised Hebbian learning
• Weight decay:
• Asymptotic solution is
Classification and the Perceptron
• If output values are +/- 1 the model implements a
classifier, called the Perceptron:
– The weight vector defines a separating hyper plane:
=
gamma.
– The perceptron can solve problems that are ‘linearly separable’