* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download CHS-Soar - AGI conferences
History of artificial intelligence wikipedia , lookup
Machine learning wikipedia , lookup
Pattern recognition wikipedia , lookup
Unification (computer science) wikipedia , lookup
Knowledge representation and reasoning wikipedia , lookup
Ecological interface design wikipedia , lookup
Hard problem of consciousness wikipedia , lookup
Multi-armed bandit wikipedia , lookup
Complexity of constraint satisfaction wikipedia , lookup
Introducing Constrained Heuristic Search to the Soar Cognitive Architecture (demonstrating domain independent learning in Soar) The Second Conference on Artificial General Intelligence, AGI-09 Sean A. Bittle Mark S. Fox March 7th, 2009 1 /11 The Problem General problem solving and learning are central goals of AI research on cognitive architectures However, there are few examples of domain independent learning in cognitive architectures The Goal Demonstrate Soar can learn and apply domain independent knowledge But to achieve this goal we need to augment the Soar’s problem-solving paradigm (CHS-Soar) 2 /11 Soar Cognitive Architecture Developed by Newell, Laird and Rosenbloom at CMU, 1983 Symbolic Cognitive Architecture where all long term knowledge is encoded as productions rules. Based on the hypothesis that all goal-oriented behavior can be cast as the selection and application of operators to a state in a problem space 3 /11 Constrained Heuristic Search (CHS) Developed by Fox, Sadeh and Bayken, 1989 CHS is a problem solving approach that combination of constraint satisfaction and heuristic search where the definition of the problem space is refined to include: Problem Topology Problem Textures Problem Objective Constraint Graph Va Ci Vc Cii Vb CP/CHS allows us to employ a generalized problem representation (CSP) and utilize generic, yet powerful problem solving techniques 4 /11 CHS-Soar “What are we trying to learn?” 5 /11 CHS-Soar What are “Texture Measures?” Textures are measures of the problem topology •Minimum Remaining Values (MRV) – variable selection •Degree (DEG) – variable selection •Least Constraining Value (LCV) – value selection Constraint Graph External Agent Problem Data Var Dom Actual MRV Soar Agent Normalized DEG MRV DEG Pruned Num MRV DEG WA R,G,B 3 2 0.50 0.40 1 0.50 0.00 NT R,G,B 3 3 0.50 0.60 2 - 0.40 SA R,G,B 3 5 0.50 1.00 3 - 0.60 Q R,G,B 3 3 0.50 0.60 4 - 1.00 NSW R,G,B 3 3 0.50 0.60 V R,G,B 3 2 0.50 0.40 T R,G,B 3 0 0.50 0.00 6 /11 CHS-Soar “How Do We Select a “Good” Texture Measure?” MRV DEG DEG DEG DEG 0.50 0.00 0.40 0.60 1.00 7 /11 CHS-Soar “What Do We Learn...Again?” “Chunk” decoupled from problem type Traditional Soar Agent Chunks tend to include domain specific knowledge Standard Soar Chunk (Water Jugs) sp {chunk-54*d150*tie*2 :chunk (state <s1> ^name water-jug ^operator <o1> + ^problem-space <p1> ^desired <d1> ^jug <j1> ^jug <j2>) (<o1> ^name fill ^jug <j1>) (<p1> ^name water-jug) (<j1> ^contents 0 ^volume 3) (<j2> ^contents 0 ^volume 5) (<d1> ^jug <j3>) (<j3> ^contents 1 ^volume 3) --> (<s1> ^operator <o1> >) } Hyper-heuristics: heuristics to choose heuristic measures CHS-Soar Chunk sp {chunk-128*d351*tie*2 :chunk (state <s1> ^phase |SelectVariableTexture| ^top-state <s1> ^name |CHS-Soar| ^operator <o1> + ^operator { <o2> <> <o1> } + ^problem-space <p1> ^desired <d1>) (<o1> ^texture |DEG| ^value 0.66 ^name |VariableTexture|) (<o2> ^texture |MRV| ^value 1. ^name |VariableTexture|) (<p1> ^name |CHS-Soar|) --> (<s1> ^operator <o1> > <o2>) } 8 /11 Experiments Three experiments conducted to investigate: 1. 2. 3. Integration of rule and constraint based reasoning Domain Independent Learning Scalability of externally learned chunks Problem types being considered: • • • • • • • Job Shop Scheduling (JSS) Map Colouring Radio Frequency Assignment Problem (RFAP) N-Queens, Sudoku, Latin Square Towers of Hanoi, Water Jugs Configuration Problems Random CSPs 9 /11 Results: Domain Independent Learning Decisions 100 Map Colouring (n = 11) Benchmark Internal External 90 80 70 Benchmark (HardCoded) MapCol (n = 11) JSS (n = 15) RFAP (n = 200) NQueens (n = 16) 75 Decisions Job Shop Scheduling (n = 15) 65 55 45 Benchmark (HardCoded) JSS (n = 15) MapCol (n = 11) RFAP (n = 200) NQueens (n = 16) Benchmark (HardCoded) RFAP (n = 200) MapCol (n = 11) JSS (n = 15) NQueens (n = 16) RFAP (n = 200) Decisions 3700 3600 3500 3400 10 /11 Conclusions Demonstrated integration of rule and constraint based reasoning Demonstrated the ability to learn rules while solving one problem type that can be successfully applied in solving another problem type Demonstrated ability to discover, learn and use multi-textured “hyper-heuristics” leading to improved solutions over traditional unary heuristics 11 /11