* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Attention -
		                    
		                    
								Survey							
                            
		                
		                
                            
                            
								Document related concepts							
                        
                        
                    
						
						
							Transcript						
					
					AGI & Attention Helgi Páll Helgason [email protected] AI researcher, Ph.D. candidate Center for Analysis and Design of Intelligent Agents, Reykjavik University Intellifest 2012  Today: Attention  Friday November 2nd: Attention  Tuesday November 6th: Autonomy Intellifest 2012  Importance of attention for AI  Example of inspiration from human attention (cognitive psychology)  Attention beyond the human level (for meta-cognitive purposes)  Design of an attention mechanism Intellifest 2012  To work on AI research, each researcher or team needs to have • A clear and explicit working definition of intelligence • A clear and explicit motivation Intellifest 2012  Displaying human-like behavior?  Solving computationally complex problems (in some unspecified amount of time)?  Performing isolated tasks that have conventionally required humans?  Adapting to a complex, dynamic environment with insufficient knowledge and resources? Intellifest 2012  Development and validation of psychological models?  Development models? and validation of neurological  Development and implementation of practical, flexible and versatile autonomous system?  How accurately do we want to replicate existing biological mechanisms? • To what degree are we biologically inspired? Intellifest 2012  Nature’s way of dealing with complexity under resource and time constraints  No real-world intelligence exists that does not address the passage of time head on Intellifest 2012 “Intelligence is the capacity of a system to adapt to its environment while operating with insufficient knowledge and resources.” - Pei Wang (Rigid flexibility: The Logic of Intelligence. Springer 2006) Intellifest 2012 Intelligence is a capability of information processing systems Intelligence is adaptation: The system’s solution of one problem is not only determined by the problem itself, but also prior experience Intellifest 2012 Insufficient knowledge -> system will usually not have the best solution at hand Insufficient resources -> system can not consider (process) every possibility nor store all information Intellifest 2012 “If either time or computational resources are infinite, intelligence is irrelevant .” - Dr. Kristinn R. Thórisson Intellifest 2012  In the domain of intelligent systems, the management of system resources is typically called “attention”  Biological (Human) Attention: • Selective concentration on one aspect environment while ignoring others of the  Artificial Attention: • Resource management and control mechanism to assign limited system resources to processing of most relevant or important information Intellifest 2012 Time constraints ATTENTION Abundant information Limited resources Intellifest 2012 Time constraints INTELLIGENCE Abundant information Limited resources Intellifest 2012  “Narrow“ (classical) AI: • Systems explicitly designed to solve specific, reasonably well-defined problems  E.g. Deep Blue, Watson, etc.  Artificial General Intelligence (AGI) • Systems designed to autonomously learn novel tasks and adapt to changing environments Intellifest 2012  When tasks and environments are prespecified, we know in advance… • what information is relevant to system operation • how frequently the system has to sample information • how frequently the system has to act • the resource requirements of the system Intellifest 2012     Substantial dynamic adaption to task not required Information filtering can be pre-programmed if characteristics of relevant information known in advance Resource management and processing handtuned for specific tasks and environments Major reduction in complexity (compared to realworld tasks and environments) • End up with limited and closed models of the real-world Intellifest 2012  When tasks and environments are unknown, we do not know in advance… • what information is relevant to system operation • how frequently the system has to sample information • how frequently the system has to act • the resource requirements of the system Intellifest 2012  Must assume up-front: • Real world environmental complexity • All information is potentially important • Not just limited, but insufficient resources at all times • Dynamic tasks, environments and time constraints Intellifest 2012  “Narrow” AI • Substantial dynamic adaptation to task not required • Data filtering can be pre-programmed if characteristics of useful data known in advance • Lower than real world task complexity  Resource management and processing hand-tuned for specific scenarios → Attention not required (?)  AGI • Real world environmental complexity assumed up-front • Computational resources for the AI assumed to be insufficient at all times  Complexity calls for data filtering and intelligent resource allocation • Environments and tasks unknown at implementation time  Resource management must be adaptive → Demands strong focus on resource management and realtime processing Intellifest 2012   Real world is highly dynamic and complex, provides abundance of information. System resources not only limited, but insufficient in light of amount of available information.  Range of time constraints (many of which are dictated by the environment) must be satisfied.  Unexpected events requiring response may occur at any time. Intellifest 2012 immediate AGI SUMMER SCHOOL 2012 Intellifest 2012 “Everyone knows what attention is. It is the taking possession by the mind, in clear and vivid form, of one out of what seem several simultaneously possible objects or trains of thought. Focalization, concentration, of consciousness are of its essence. It implies withdrawal from some things in order to deal effectively with others, and is a condition which has a real opposite in the confused, dazed, scatterbrained state which in French is called distraction, and Zerstreutheit in German.” - William James, 1890 Intellifest 2012  Modern attention research started with the “cocktail party effect” (Colin Cherry, 1953)  Number of attention models have been proposed, most belonging to two classes: • Early selection: Selection of information occurs early in sensory pipeline based on shallow, primitive processing with no or limited analysis of meaning • Late selection: Selection of information based on deep analysis of meaning, occurs late in sensory pipeline  Many early selection models contradicted by observed human behavior (e.g. cocktail party scenario) Intellifest 2012 Intellifest 2012 Intellifest 2012 Intellifest 2012 Intellifest 2012 Knudsen EI: Fundamental components of attention. Annu Rev Neurosci 2007, 30:57-78. Intellifest 2012  Two types of attention: • Top-down  Deliberate, goal-driven, targets information related to the tasks being performed • Bottom-up  Reactive, targets unexpected but potentially important information Intellifest 2012  Information competes for limited system resources • Allows processing decisions to be made as late as possible, when resource availability is known  Early selection may be a problematic paradigm • Ignoring information without analysis of meaning introduces operational risk    Two simultaneously active functions of attention can allow systems to perform tasks while remaining reactive to unexpected events. Top-down attention may be controlled by active goals and predictions of the system to catch information related to current tasks. Bottom-up attention can be controlled by novelty and unexpectedness of incoming information Intellifest 2012  General Attention Mechanisms for Cognitive Architectures • My Ph.D. project  Intellifest 2012  Design a general attention mechanism intended for implementation in AGI systems (cognitive architectures)  In progress: Implementation and evaluation of resulting attention mechanism in state-ofthe-art cognitive architectures  While work is biologically inspired at highlevel, replication of any existing attention mechanism is not a goal Intellifest 2012  Complete • Targets all operational information (internal + external) • Top-down + Bottom-up  General • No limiting assumptions about tasks, environments or modalities  Uniform • Data from all modalities treated identically (at some level of processing)  Adaptive • Learns from experience Intellifest 2012  Modality neutral  All modalities treated identically, at some level of processing  Including proprioception (internal modalities, self-sensing)  Architecture-independent Intellifest 2012  Attention functionality implemented in handful of AGI systems • E.g. NARS, LIDA, CLARION  Limitations: • Data-filtering only (control issues ignored) • External information only (internal states ignored) • Realtime processing not addressed Intellifest 2012  Amount of available information constantly assumed to exceed system processing capacity  Limited system resources must be focused on most relevant or important information  Requires capability to determine degree of information importance, based on: • Current operating context • Time constraints • Resource availability Intellifest 2012  Is it sufficient to only evaluate relevance for data? • What about the relevance of system processes?  Just another “box” in the sensory • Or something more pervasive? pipeline?  Can we retrofit existing AGI architectures with attention?  Can we extend attention capabilities in useful ways for AGI systems? Intellifest 2012  Quantify current relevance of data  Data relevance: • Goal-related • Novelty / Unexpectedness  Quantify current relevance of processes  Process relevance: • Operational experience • Available data Intellifest 2012  Constructivist AI • “From Constructionist to Constructivist AI”, Thórisson 2009, BICA proceedings  Systems manage own growth • From manually constructed initial state (bootstrap/seed)  Methodology for building flexible AGI systems capable of autonomous selfreconfiguration at the architecture level Intellifest 2012  Internally, the system and its operation can be viewed as a dynamic and complex environment • Similar to external task environment  Meta-cognitive functions responsible for system growth must also process information selectively • Resources remain limited  Applying the same attention mechanism to external and internal environments may produce AI systems capable of performing tasks and improving own performance while being subject to realtime constraints and resources limitations. Intellifest 2012  Introspection, self-growth, self-improvement  As the sum of internal system activity is a vast stream of information…  applying attention to the internal environment can lend significant support for meta-cognitive operation by…  helping determine important information and processes for meta-cognitive functions…  in the same way it supports task performance in the external environment Intellifest 2012  While design is architectureindependent, some requirements are necessary Intellifest 2012  Data-driven • All processing is triggered by the occurrence of data • Eliminates the need for fixed control loops, allowing for operation at multiple time scales and greater flexibility  Fine-grained • Data and processing units are small but numerous • Reasoning about small, simple components and their effects is significantly more tractable than for larger, more complex components Intellifest 2012  Predictive capabilities • Capacity to generate predictions and expectations • Necessary control data for top-down attention in addition to goals  Unified sensory pipeline • Data given identical treatment regardless of origin (external, internal) Intellifest 2012 Unified sensory pipeline: External (environmental) and internal data handled identically at architecturelevel Environment (Real world) Sampled data Sensory devices Data items New data Processes Data-driven: Processes are activated only when paired with compatible data Fine-grained: Data and process objects are small and numerous Actuation devices Commands Goals / Predictions Derived Attentional patterns Predictive capabilities: Predictions are necessary control information for top-down attention Environment (Real world) Matching Sampled data Top-down Sensory devices Data biasing Data items Processes Data and processess have priority values that are assigned by biasing. Actuation devices Commands Goals / Predictions Environment (Real world) Derived Attentional patterns Matching Sampled data Top-down Sensory devices Data biasing Bottom-up attentional processess Bottom-up Data items Evaluation Processes Actuation devices Commands Goals / Predictions Environment (Real world) Derived Attentional patterns Matching Sampled data Top-down Sensory devices Data biasing Bottom-up attentional processess Bottom-up Data items Commands Evaluation Data -> Process mapping Processes Process biasing Actuation devices Goals / Predictions Environment (Real world) Derived Attentional patterns Matching Sampled data Top-down Sensory devices Data biasing Bottom-up attentional processess Bottom-up Data items Commands Evaluation Data -> Process mapping Contextual process evaluation Processes Contextualized process performance history Experience-based process activation Process biasing Actuation devices  Cognitive architectures • NARS  • LIDA  • http://goertzel.org/agiri06/%5B4%5D%20StanFranklin.pdf CLARION   https://sites.google.com/site/narswang/ https://sites.google.com/site/clarioncognitivearchitecture/ Publications: • • • Cognitive Architectures and Autonomy: A Comparative Review  Kristinn R. Thórisson, Helgi Páll Helgason  http://versita.metapress.com/content/052t1h656614848h/?p=4e1d01ba40e04d5d9f51da3977a8be04&pi=0 Attention Capabilities for AI Systems  Helgi Páll Helgason, Kristinn R. Thórisson  http://www.perseptio.com/publications/Helgason-ICINCO-2012.pdf On Attention Mechanisms for AGI Architectures: A Design Proposal (to be published)  Helgi Páll Helgason, Kristinn R. Thórisson, Eric Nivel  http://www.perseptio.com/publications/Helgason-AGI-2012.pdf Intellifest 2012  Wiki reading material for attention  Additional paper: • On Attention Mechanisms for AGI Architectures: A Design Proposal (to be published)  Helgi Páll Helgason, Kristinn R. Thórisson, Eric Nivel  http://www.perseptio.com/publications/Helgason-AGI2012.pdf  Post questions to Proboard Intellifest 2012
 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
                                             
                                            ![[SENSORY LANGUAGE WRITING TOOL]](http://s1.studyres.com/store/data/014348242_1-6458abd974b03da267bcaa1c7b2177cc-150x150.png) 
                                             
                                             
                                             
                                             
                                             
                                             
                                             
                                             
                                            