Download A Neural Network Model for the Representation of Natural Language

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Embodied language processing wikipedia , lookup

Development of the nervous system wikipedia , lookup

Embodied cognitive science wikipedia , lookup

Biological neuron model wikipedia , lookup

Neuroethology wikipedia , lookup

Artificial neural network wikipedia , lookup

Neural engineering wikipedia , lookup

Convolutional neural network wikipedia , lookup

Nervous system network models wikipedia , lookup

Cognitive science wikipedia , lookup

Neurophilosophy wikipedia , lookup

Neural modeling fields wikipedia , lookup

Knowledge representation and reasoning wikipedia , lookup

Holonomic brain theory wikipedia , lookup

Catastrophic interference wikipedia , lookup

Hierarchical temporal memory wikipedia , lookup

Types of artificial neural networks wikipedia , lookup

Recurrent neural network wikipedia , lookup

Metastability in the brain wikipedia , lookup

Transcript
A Neural Network Model for the Representation of Natural Language
Eleni Koutsomitopoulou
Georgetown University (Washington, DC) and LexisNexis (London, Great Britain)
[email protected]
Current research in natural language processing demonstrates the importance of analyzing
syntactic relationships, such as word order, topicalization, passivization, dative movement,
particle movement, pronominalization as dynamic resonant patterns of neuronal activation
(Loritz, 1999). Following this line of research this study demonstrates the importance of also
analyzing conceptual relationships, such as polysemy, homonymy, ambiguity, metaphor,
neologism, coreference, as dynamic resonant patterns represented in terms of neuronal
activation. This view has implications for the representation of natural language (NL).
Alternatively, formal representation methods abstract away from the actual properties of realtime natural language input and rule-based systems are of limited representational power.
Since NL is a human neurocognitive phenomenon we presume that it can be best
represented in a neural network model. This study focuses on a neural network simulation, the
Cognitive Linguistic Adaptive Resonant Network (CLAR-NET) model of online and realtime associations among concepts. The CLAR-NET model is a simulated Adaptive
Resonance Theory (ART, Grossberg 1972 et seq.) model. Through a series of experiments, I
address particular linguistic problems such as homonymy, neologism, polysemy, metaphor,
constructional polysemy, contextual coreference, subject-object control, event-structure
metaphor and negation. The aim of this study is to infer NL specific mappings of concepts in
the human neurocognitive system on the basis of known facts and observations provided
within the realms of conceptual metaphor theory (CMT), and adaptive grammar (AG, Loritz
1999), theories of linguistic analysis, and known variables drawn from the brain and cognitive
sciences as well as previous neural network systems built for similar purposes.
My basic hypothesis is that the association among concepts is primarily an expression of
domain-general cognitive mechanisms that depend on continuous learning of both previously
presented linguistic input and everyday, direct experiential (i.e. sensory-physical) behaviors
represented in natural language as "common knowledge" (or "common sense"). According to
this hypothesis, complex conceptual representations are not actually associated with prepostulated feature structures, but with time-sensitive dynamic patterns of activation. These
patterns can reinforce previous learning and/or create new "place-holders" in the conceptual
system for future value binding. This line of investigation holds implications for language
learning, neurolinguistics, metaphor theory, information retrieval, knowledge engineering,
case-based reasoning, knowledge-based machine translation systems and related ontologies.
This study finds that although short-term memory (STM) effects in ART-like networks are
significant, most of the time long-term memory (LTM) calculation yields better semantic
discrimination. It is suggested that the internal structure of lexical frames that correspond to
clusters of congenial associations (in fact, neuronal subnetworks), is maintained as long as it
resonates with new input patterns or learned in long-term memory traces. Different degrees
of similarity (or deviation) from previously acquired knowledge clusters are computed as
activation levels of the corresponding neuronal nodes and may be measured via differential
equations of neuronal activity. The overall conclusion is that ART-like networks can model
interesting linguistic phenomena in a neurocognitively plausible way.