Download 2015 International Joint Conference on Neural Networks

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Artificial neural network wikipedia , lookup

Rheobase wikipedia , lookup

Neuroethology wikipedia , lookup

Embodied language processing wikipedia , lookup

Neural modeling fields wikipedia , lookup

Central pattern generator wikipedia , lookup

Premovement neuronal activity wikipedia , lookup

Time perception wikipedia , lookup

Eyeblink conditioning wikipedia , lookup

Development of the nervous system wikipedia , lookup

Neural coding wikipedia , lookup

Stimulus (physiology) wikipedia , lookup

Recurrent neural network wikipedia , lookup

Holonomic brain theory wikipedia , lookup

Neuroeconomics wikipedia , lookup

Embodied cognitive science wikipedia , lookup

Types of artificial neural networks wikipedia , lookup

Neuropsychopharmacology wikipedia , lookup

Agent-based model in biology wikipedia , lookup

Neural correlates of consciousness wikipedia , lookup

Synaptic gating wikipedia , lookup

Channelrhodopsin wikipedia , lookup

Cerebral cortex wikipedia , lookup

Metastability in the brain wikipedia , lookup

Feature detection (nervous system) wikipedia , lookup

Biological neuron model wikipedia , lookup

Nervous system network models wikipedia , lookup

Transcript
2015 International Joint Conference on Neural Networks (IJCNN)
July 12 – July 17, 2015, Killarney, Ireland
A Neurocomputational Model Implemented on
Humanoid Robot for Learning Action Selection
Emeç Erçelik, N. Serap Şengör - Istanbul Technical University
[email protected],[email protected]
Sensory Inputs
SNN Network
R
Abstract
Based on a computational model of Basal ganglia-Thalamus-Cortex
(BTC) circuit proposed for action selection, the task of associating a
sensory stimulus with a desired action is realized on a humonoid
robot. The computational model of BTC circuit, incorporates two
different levels of modeling: point neuorns and mass models. With
the point neuron it is aimed to obtain a more realistic method to
investigate the model in real time, while mass model provides
realizability of the task on humanoid robot platform, Darwin-Op.
Point neurons are used in modeling cortex which consists of
channels for each action to be elected. The plastic all-to-all
connections from the sensory stimuli to the basal ganglia structures
are modulated with reward.
In the task, the sensory inputs, namely colors, are presented to the
humanoid robot and it is expected that these sensory inputs would
be associated with the predefined actions by modulating the
connections.
Despite the model lack the reality and detail computational models
of basal ganglia, thalamus and cortex loop for action selection, its
simple approach is sufficient from modeling aspect to show the
action selection behavior on cortex in real time applications.
Y
B
80x3 Regular
Spiking Neurons
The Model
Excitatory
Inhibitory
1
[Wc]3x3 (Orbitofrontal Cortex ? )
20x3 Fast Spiking
Neurons
Cortex
SNN Model
Camera
[Wr]3x1
Str
Dynamical System
Mass Model
GPe
GPi
Thl
Stn
20 servo motors
Wc and Wr updated with
Temporal Difference Learning (TDL)
The Task
When a color is presented to
robot, it tries to decide on
an action, at first randomly,
and gets reward according
to its decision. The reward
given, causes an expectation
The model consists of two
parts, one is Cortex that is
modeled
using
İzhikevich
neurons[4] and the other is BTC
which is modeled using mass
model given in [1,2]. The BTC
model is coded in Python and the
Cortex is simulated in NEST[6].
There are three channels each for
a different action and the
winning channel in Cortex is
considered as the selected
action.
The camera information is get
from the C++ coded part and sent
to the model, and the BTC model
determines the action to be
selected. The output of cortex is
determined by firing rates of
three
channels
and
this
information is sent to the Basal
ganglia-Thalamus loop in a
normalized form to manage the
calculations.
Basal
ganglia
Thalamus loop employs the
action selected to cortex and this
information
with
striatum
information
is
used
in
reinforcement learning to update
the model parameters using
temporal difference learning in
C++ coded part.
error, as the robot was expecting nothing as a result of its choice.
This urges robot to reinforce the desired stimulus-action pair.
Once robot is rewarded due to its right choice, expectation error
updates the parameters of BTC model in charge of action selection.
When update is completed, robot manages to select the desired
action related with the given stimulus. In case, when an expected
reward is not given to the selected action associated with a
stimulus, this also causes an expectation error and urges robot to
rearrange the previously built association between a stimulus and
action.
1st experiment: Stimulus are given in sequence
Raster Plot of Neurons in Cortex
2nd experiment: Stimulus are given in random order
Channel 1
Channel 3
Between
550 ms
750 ms
Between
3550 ms
3750 ms
Referances
[1] N. S. Sengor, O. Karabacak and U. Steinmetz, A Computational Model of CorticoStriato-Thalamic Circuits in Goal-Directed Behaviour, Proceedings of International
Conference of Artificial Neural Networks (ICANN’08), Springer-Verlag, pp.328-337, 2008.
[2] B. Denizdurduran and N. S. Sengor, A Realization of Goal-Directed BehaviorImplementing a Robot Model Based on Cortico-StriatoThalamic Circuits. ICAART 2012,
Algarve, Portugal, 2012.
[3] G. E. Alexander, M. D. Crutcher and M. R. DeLong, Basal gangliathalamacortical
circuits: Parallel substrates for motor, oculomotor, ”prefrontal” and ”limbic” functions,
Progress in Brain Research, 85, pp.119-146, 1990.
[4] E. M. Izhikevich, Simple Model of Spiking Neurons, IEEE Transactions on Neural
Networks, Vol. 14, No. 6, 2003.
[5] W. Schultz, P. Dayan, P. Montague, A Neural Substrate of Prediction and Reward,
Science, 275, pp.1593-1599, 1997.
[6] M-O. Gewaltig and M. Diesmann, NEST (Neural Simulation Tool), Scholarpedia
2(4):1430, 2007.
Acknowledgements
The Humanoid Robot Platform is purchased by TUBITAK Project 111E264. We thank to Berat Denizdurduran for his valuable discussions and guidance on expanding and implementing the action
selection model
2015 International Joint Conference on Neural
Networks