* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Attention -
Survey
Document related concepts
Transcript
AGI & Attention Helgi Páll Helgason [email protected] AI researcher, Ph.D. candidate Center for Analysis and Design of Intelligent Agents, Reykjavik University Intellifest 2012 Today: Attention Friday November 2nd: Attention Tuesday November 6th: Autonomy Intellifest 2012 Importance of attention for AI Example of inspiration from human attention (cognitive psychology) Attention beyond the human level (for meta-cognitive purposes) Design of an attention mechanism Intellifest 2012 To work on AI research, each researcher or team needs to have • A clear and explicit working definition of intelligence • A clear and explicit motivation Intellifest 2012 Displaying human-like behavior? Solving computationally complex problems (in some unspecified amount of time)? Performing isolated tasks that have conventionally required humans? Adapting to a complex, dynamic environment with insufficient knowledge and resources? Intellifest 2012 Development and validation of psychological models? Development models? and validation of neurological Development and implementation of practical, flexible and versatile autonomous system? How accurately do we want to replicate existing biological mechanisms? • To what degree are we biologically inspired? Intellifest 2012 Nature’s way of dealing with complexity under resource and time constraints No real-world intelligence exists that does not address the passage of time head on Intellifest 2012 “Intelligence is the capacity of a system to adapt to its environment while operating with insufficient knowledge and resources.” - Pei Wang (Rigid flexibility: The Logic of Intelligence. Springer 2006) Intellifest 2012 Intelligence is a capability of information processing systems Intelligence is adaptation: The system’s solution of one problem is not only determined by the problem itself, but also prior experience Intellifest 2012 Insufficient knowledge -> system will usually not have the best solution at hand Insufficient resources -> system can not consider (process) every possibility nor store all information Intellifest 2012 “If either time or computational resources are infinite, intelligence is irrelevant .” - Dr. Kristinn R. Thórisson Intellifest 2012 In the domain of intelligent systems, the management of system resources is typically called “attention” Biological (Human) Attention: • Selective concentration on one aspect environment while ignoring others of the Artificial Attention: • Resource management and control mechanism to assign limited system resources to processing of most relevant or important information Intellifest 2012 Time constraints ATTENTION Abundant information Limited resources Intellifest 2012 Time constraints INTELLIGENCE Abundant information Limited resources Intellifest 2012 “Narrow“ (classical) AI: • Systems explicitly designed to solve specific, reasonably well-defined problems E.g. Deep Blue, Watson, etc. Artificial General Intelligence (AGI) • Systems designed to autonomously learn novel tasks and adapt to changing environments Intellifest 2012 When tasks and environments are prespecified, we know in advance… • what information is relevant to system operation • how frequently the system has to sample information • how frequently the system has to act • the resource requirements of the system Intellifest 2012 Substantial dynamic adaption to task not required Information filtering can be pre-programmed if characteristics of relevant information known in advance Resource management and processing handtuned for specific tasks and environments Major reduction in complexity (compared to realworld tasks and environments) • End up with limited and closed models of the real-world Intellifest 2012 When tasks and environments are unknown, we do not know in advance… • what information is relevant to system operation • how frequently the system has to sample information • how frequently the system has to act • the resource requirements of the system Intellifest 2012 Must assume up-front: • Real world environmental complexity • All information is potentially important • Not just limited, but insufficient resources at all times • Dynamic tasks, environments and time constraints Intellifest 2012 “Narrow” AI • Substantial dynamic adaptation to task not required • Data filtering can be pre-programmed if characteristics of useful data known in advance • Lower than real world task complexity Resource management and processing hand-tuned for specific scenarios → Attention not required (?) AGI • Real world environmental complexity assumed up-front • Computational resources for the AI assumed to be insufficient at all times Complexity calls for data filtering and intelligent resource allocation • Environments and tasks unknown at implementation time Resource management must be adaptive → Demands strong focus on resource management and realtime processing Intellifest 2012 Real world is highly dynamic and complex, provides abundance of information. System resources not only limited, but insufficient in light of amount of available information. Range of time constraints (many of which are dictated by the environment) must be satisfied. Unexpected events requiring response may occur at any time. Intellifest 2012 immediate AGI SUMMER SCHOOL 2012 Intellifest 2012 “Everyone knows what attention is. It is the taking possession by the mind, in clear and vivid form, of one out of what seem several simultaneously possible objects or trains of thought. Focalization, concentration, of consciousness are of its essence. It implies withdrawal from some things in order to deal effectively with others, and is a condition which has a real opposite in the confused, dazed, scatterbrained state which in French is called distraction, and Zerstreutheit in German.” - William James, 1890 Intellifest 2012 Modern attention research started with the “cocktail party effect” (Colin Cherry, 1953) Number of attention models have been proposed, most belonging to two classes: • Early selection: Selection of information occurs early in sensory pipeline based on shallow, primitive processing with no or limited analysis of meaning • Late selection: Selection of information based on deep analysis of meaning, occurs late in sensory pipeline Many early selection models contradicted by observed human behavior (e.g. cocktail party scenario) Intellifest 2012 Intellifest 2012 Intellifest 2012 Intellifest 2012 Intellifest 2012 Knudsen EI: Fundamental components of attention. Annu Rev Neurosci 2007, 30:57-78. Intellifest 2012 Two types of attention: • Top-down Deliberate, goal-driven, targets information related to the tasks being performed • Bottom-up Reactive, targets unexpected but potentially important information Intellifest 2012 Information competes for limited system resources • Allows processing decisions to be made as late as possible, when resource availability is known Early selection may be a problematic paradigm • Ignoring information without analysis of meaning introduces operational risk Two simultaneously active functions of attention can allow systems to perform tasks while remaining reactive to unexpected events. Top-down attention may be controlled by active goals and predictions of the system to catch information related to current tasks. Bottom-up attention can be controlled by novelty and unexpectedness of incoming information Intellifest 2012 General Attention Mechanisms for Cognitive Architectures • My Ph.D. project Intellifest 2012 Design a general attention mechanism intended for implementation in AGI systems (cognitive architectures) In progress: Implementation and evaluation of resulting attention mechanism in state-ofthe-art cognitive architectures While work is biologically inspired at highlevel, replication of any existing attention mechanism is not a goal Intellifest 2012 Complete • Targets all operational information (internal + external) • Top-down + Bottom-up General • No limiting assumptions about tasks, environments or modalities Uniform • Data from all modalities treated identically (at some level of processing) Adaptive • Learns from experience Intellifest 2012 Modality neutral All modalities treated identically, at some level of processing Including proprioception (internal modalities, self-sensing) Architecture-independent Intellifest 2012 Attention functionality implemented in handful of AGI systems • E.g. NARS, LIDA, CLARION Limitations: • Data-filtering only (control issues ignored) • External information only (internal states ignored) • Realtime processing not addressed Intellifest 2012 Amount of available information constantly assumed to exceed system processing capacity Limited system resources must be focused on most relevant or important information Requires capability to determine degree of information importance, based on: • Current operating context • Time constraints • Resource availability Intellifest 2012 Is it sufficient to only evaluate relevance for data? • What about the relevance of system processes? Just another “box” in the sensory • Or something more pervasive? pipeline? Can we retrofit existing AGI architectures with attention? Can we extend attention capabilities in useful ways for AGI systems? Intellifest 2012 Quantify current relevance of data Data relevance: • Goal-related • Novelty / Unexpectedness Quantify current relevance of processes Process relevance: • Operational experience • Available data Intellifest 2012 Constructivist AI • “From Constructionist to Constructivist AI”, Thórisson 2009, BICA proceedings Systems manage own growth • From manually constructed initial state (bootstrap/seed) Methodology for building flexible AGI systems capable of autonomous selfreconfiguration at the architecture level Intellifest 2012 Internally, the system and its operation can be viewed as a dynamic and complex environment • Similar to external task environment Meta-cognitive functions responsible for system growth must also process information selectively • Resources remain limited Applying the same attention mechanism to external and internal environments may produce AI systems capable of performing tasks and improving own performance while being subject to realtime constraints and resources limitations. Intellifest 2012 Introspection, self-growth, self-improvement As the sum of internal system activity is a vast stream of information… applying attention to the internal environment can lend significant support for meta-cognitive operation by… helping determine important information and processes for meta-cognitive functions… in the same way it supports task performance in the external environment Intellifest 2012 While design is architectureindependent, some requirements are necessary Intellifest 2012 Data-driven • All processing is triggered by the occurrence of data • Eliminates the need for fixed control loops, allowing for operation at multiple time scales and greater flexibility Fine-grained • Data and processing units are small but numerous • Reasoning about small, simple components and their effects is significantly more tractable than for larger, more complex components Intellifest 2012 Predictive capabilities • Capacity to generate predictions and expectations • Necessary control data for top-down attention in addition to goals Unified sensory pipeline • Data given identical treatment regardless of origin (external, internal) Intellifest 2012 Unified sensory pipeline: External (environmental) and internal data handled identically at architecturelevel Environment (Real world) Sampled data Sensory devices Data items New data Processes Data-driven: Processes are activated only when paired with compatible data Fine-grained: Data and process objects are small and numerous Actuation devices Commands Goals / Predictions Derived Attentional patterns Predictive capabilities: Predictions are necessary control information for top-down attention Environment (Real world) Matching Sampled data Top-down Sensory devices Data biasing Data items Processes Data and processess have priority values that are assigned by biasing. Actuation devices Commands Goals / Predictions Environment (Real world) Derived Attentional patterns Matching Sampled data Top-down Sensory devices Data biasing Bottom-up attentional processess Bottom-up Data items Evaluation Processes Actuation devices Commands Goals / Predictions Environment (Real world) Derived Attentional patterns Matching Sampled data Top-down Sensory devices Data biasing Bottom-up attentional processess Bottom-up Data items Commands Evaluation Data -> Process mapping Processes Process biasing Actuation devices Goals / Predictions Environment (Real world) Derived Attentional patterns Matching Sampled data Top-down Sensory devices Data biasing Bottom-up attentional processess Bottom-up Data items Commands Evaluation Data -> Process mapping Contextual process evaluation Processes Contextualized process performance history Experience-based process activation Process biasing Actuation devices Cognitive architectures • NARS • LIDA • http://goertzel.org/agiri06/%5B4%5D%20StanFranklin.pdf CLARION https://sites.google.com/site/narswang/ https://sites.google.com/site/clarioncognitivearchitecture/ Publications: • • • Cognitive Architectures and Autonomy: A Comparative Review Kristinn R. Thórisson, Helgi Páll Helgason http://versita.metapress.com/content/052t1h656614848h/?p=4e1d01ba40e04d5d9f51da3977a8be04&pi=0 Attention Capabilities for AI Systems Helgi Páll Helgason, Kristinn R. Thórisson http://www.perseptio.com/publications/Helgason-ICINCO-2012.pdf On Attention Mechanisms for AGI Architectures: A Design Proposal (to be published) Helgi Páll Helgason, Kristinn R. Thórisson, Eric Nivel http://www.perseptio.com/publications/Helgason-AGI-2012.pdf Intellifest 2012 Wiki reading material for attention Additional paper: • On Attention Mechanisms for AGI Architectures: A Design Proposal (to be published) Helgi Páll Helgason, Kristinn R. Thórisson, Eric Nivel http://www.perseptio.com/publications/Helgason-AGI2012.pdf Post questions to Proboard Intellifest 2012