* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Neural Networks in Computer Science
Survey
Document related concepts
Transcript
Methodology of Simulations CS/PY 399 Lecture Presentation # 19 February 21, 2001 Mount Union College Simulating Systems (Ch. 2, Plunkett & Elman) To study complex systems of interacting components, we can: 1) Study real systems – can we do this without disturbing the system to the point that results are flawed? 2) Build a model of the system and simulate actual system operation – does model include all important aspects of the actual system? Advantages of Modeling and Simulation correlation between model components and brain structure – we can include more relevant details than in a verbal description of system operation non-linear nature of Neural Networks makes purely mathematical analysis difficult or impossible surprising behavior can be uncovered – leads to new experimental hypotheses Advantages of Modeling and Simulation It is easier to dissect a model than a brain! – model will work after dissection We can begin to discover why some behavior arises from connected neuron clusters by studying node activations Simulation as Experiment Several factors must be defined before we use the computer to simulate a network Define problem or goal – purpose of the experiment – e.g., to see if a network can learn letter sequences Simulation as Experiment Formulate a hypothesis – e.g., a feed-forward network can learn letter sequences Design a mechanism for testing the hypothesis – e.g., see if training data can train network with a global error rate < ε Evaluate results of simulation Decisions when Designing Connectionist Simulations Task – what is the desired behavior (output) for the network – defined by the training environment • input/output patterns in training set – network is taught by example • does not learn rules; learns to recognize patterns • is this what happens with humans? – e.g., multiplication tables Decisions when Designing Connectionist Simulations Stimulus Representation – our models will use a numerical representation of input stimuli – vector notation used for multiple inputs • defines an n-dimensional space, for n inputs – output also numeric Decisions when Designing Connectionist Simulations Network Architecture – number and arrangement of nodes – one large, completely connected network? • simplest to design – several sub-networks connected in some configuration? • this is more like brain structure – feed-forward vs. recurrent connections – these decisions will powerfully affect the network’s capabilities Completely-Connected Network Layered Feedforward Network Recurrent Network Analyzing the Simulation after Training Global Error – output produced vs. output desired averaged over all patterns – perfect learning: error = 0 – can be misleading, especially if there are relatively few patterns that give bad results • e.g. Quicksort average vs. worst-case behavior – avg. time for 1 million names: 20,000,000 ops. – worst-case: 1,000,000,000,000 (1 trillion) ops. Analyzing the Simulation after Training Individual Pattern Error – looking at specific interesting patterns • too many patterns, in general, to examine all of them – also, we may look at novel patterns (not part of the training set) • spontaneous generalization, etc. – what has the network learned? • rote information vs. generalized knowledge Analyzing the Simulation after Training Inspecting weights and internal network settings – to see how the network is arriving at conclusions – comparison of starting weights with final trained weight set is often illuminating – after training, it is often useful to examine specific hidden node activations for a selected input pattern • what part of the network is triggered? Methodology of Simulations CS/PY 399 Lecture Presentation # 19 February 21, 2001 Mount Union College