Download Bootstrapping shared combinatorial speech

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Koinophilia wikipedia , lookup

Transcript
BOOTSRAPPING SHARED COMBINATORIAL SPEECH
CODES FROM BASIC IMITATION: THE ROLE OF
SELF-ORGANIZATION
PIERRE-YVES OUDEYER
Sony CSL Paris,
75005 Paris, France
Human vocalizations have a complex organization. They are discrete and
combinatorial: vocalizations are built through the combination of units, and these
units are systematically re-used from one vocalization to the other. These units
appear at multiple levels (e.g.the gestures, the coordination of gestures, the
phonemes, the morphemes). While for example the articulatory space that
defines the physically possible gestures is continuous, each language only uses a
discrete set of gestures. While there is a wide diversity of the repertoires of
these units in the world languages, there are also very strong regularities (for
example, the high frequency of the 5 vowel system /e,i,o,a,u/). Moreover, in each
language there are “rules” which determine what combinations of phonemes can
or cannot be produced: this is what is called phonotactics.
It is then obvious to ask where this organization comes from. There are two
complementary kinds of answers that must be given (Oudeyer, 2006). The first
kind is a functional answer stating what is the function of systems of speech
sounds, and then showing that systems having the organization that we described
are efficient for achieving this function. This has for example been proposed by
(Lindblom, 1992) who showed that discreteness and statistical regularities can be
predicted by searching for the most efficient vocalization systems. This kind of
answer is necessary, but not sufficient : it does not say how evolution (genetic or
cultural) might have found this optimal structure. In particular, naive darwinian
search with random mutations (i.e. plain natural selection) might not be
sufficient to explain the formation of this kind of complex structure : the search
space is just too large (Ball, 2003). This is why there needs a second kind of
answer stating how evolution might have found these structures. In particular,
this amounts to show how self-organization might have constrained the search
space and helped natural selection. This can be done by showing that a much
simpler system spontaneously self-organizes into the more complex structure that
we want to explain.
In this talk, I will present a computational model which is a generalization of
the model developped in (Oudeyer, 2005a,b, 2006), in which only one type of
neuron is used. This model involves a population of agents endowed with
operational models of the ear, of the vocal tract, and of the neural structures that
connect them. It shows how the generic coupling of evolutionarily simple neural
structures can produce spontaneously, thanks to self-organization, a primitive
combinatorial vocalization system with phonotactics shared by a population of
agents whose vocalizations were initially holistic and unorganized. What is
original is that: 1) there is no explicit pressure for building a system of
distinctive sounds (and there are no repulsive forces whatsoever in the system);
2) agents do not possess capabilities of coordinated interactions, in particular
they do not play language games; 3) agents possess no specific linguistic
capacities; 4) initially there exist no convention that agents can use.
I will also propose a new interpretation of this model. The neural structures
which are used look very much like what is needed for basic vocal imitation,
defined as the capacity to reproduce a sound which has been perceived. As a
consequence, they might have biologically evolved under a pressure for
imitation. What is interesting, is that thanks to self-organization, a combinatorial
speech code with phonotactics is formed as a side effect, even if such a speech
code is not necessary for imitation (indeed, basic vocal imitation does not even
need a system of distinctive sound categories nor a repertoire of discrete
vocalizations). This shows that the evolutionary step from vocal imitation to
shared combinatorial human-like speech codes might have been rather small. I
will also discuss how this is confirmed by the observation that many species of
birds and whales capable of vocal imitation do indeed possess such a shared
primitive combinatorial "vocal" code.
References
Ball P. (2001) The self-made tapestry, Pattern formation in nature, Oxford University
Press.
Lindblom, B.
(1992) Phonological Units as Adaptive Emergents of Lexical
Development, in Ferguson, Menn, Stoel-Gammon (eds.) Phonological development:
Models, Research, Implications, York Press, Timonnium, MD, pp. 565-604.
Oudeyer, P-Y. (2005a) The Self-Organization of Speech Sounds, Journal of Theoretical
Biology, Volume 233, Issue 3, pp.435—449
Oudeyer, P-Y. (2005b) The self-organisation of combinatoriality and phonotactics in
vocalization systems, Connection Science, vol. 17, No. 3, pp. 1-17
Oudeyer, P-Y. (2006) Self-Organization in the Evolution of Speech, Studies in the
Evolution of Language, Oxford University Press.