* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Chapter 4 Learning - Western Washington University
Verbal Behavior wikipedia , lookup
Learning theory (education) wikipedia , lookup
Behavior analysis of child development wikipedia , lookup
Behaviorism wikipedia , lookup
Psychophysics wikipedia , lookup
Psychological behaviorism wikipedia , lookup
Eyeblink conditioning wikipedia , lookup
Classical Conditioning Learning: What does it mean to learn? • Learning is the single largest area of Psychology second only to clinical psychology I. Classical Conditioning – learning through association • Philosophical roots: English Empiricists • John Locke – Primary and Secondary Qualities • David Hume – Reflection , Cause and Effect Ideas • Anything that stimulates the CNS 1. Pavlov and the conditioned reflex • The procedure is what distinguishes classical conditioning from other modes of learning Classical Conditioning in America was Stated by Watson and Rayner (1920) • Albert was set on a rug, held by Rayner and presented with a fuzzy rabbit. • Watson slammed two sticks together behind Albert as he reached for the animal. • After several trials Albert began to cry when the rabbit was presented Watson, Rayner and Little Albert The Three Phase Process of Classical Conditioning Habituation • Works at the neuron level to organismic level. • Habituation decreases the responses over time both for magnitude and frequency per unit time. • The most basic level of learning. Requires no reinforcement. Habituation (cont.) • Habituation allows the organism to ignore irrelevant stimuli. • Dishabituation is the stopping of habituation to an attended stimuli Condition Stimulus (CS) • A neutral stimulus, can be sound, light or internal craving. Uncondition Stimulus (UCS) • The natural stimulus that drives the CNS to elicit the reflex. • In the vernacular of instrumental learning this hookup of the UCS-UCR is known as the reinforcement. Uncondition response (UCR) • The basic reflex that is become controlled by procedure. Reinforcement vs Reward • One reinforces a response! • One rewards the animal! Condition Response (CR) • The response when driven by the Condition Stimulus 2. Major phenomena of classical • Two types of “effective” procedures that lead the animal to predict up coming events. Delay Conditioning: requires only recognition Critical Measures: ½ mm movement is a response, Latency to onset of movement, Latency to peak of response. 3 Characteristics of Delay Conditioning • The CS starts before the UCS • There is a time period between when the CS in On and the start of the UCS – known as the Inter-Stimulus-Interval (ISI). • Both the CS and the UCS co-terminate. The association • The explicit pairing of the CS with the UCS • (in eye-blink, if tone and then air-puff) sets up the contingency that leads the animal to predict what is to come. • This analysis of expectation was put forth by Rescorla. Trace Conditioning: requires memory 3 Characteristics of Trace Conditioning • The CS starts before the UCS • The CS turns “OFF” before the UCS comes “ON”. • The time period between the CS turns “Off” and the UCS comes “On” is the trace interval (the inter – stimulus – interval). • The animal must remember to delay its response, and respond only at the end of the trace interval. Backward Conditioning Backward Conditioning (cont) • UCS comes “ON” before the CS. • UCS goes “OFF” with “ON- set” of CS • Ineffective for producing a conditioned response. Simultaneous Conditioning: very hard to show. Formal Classical Conditioning is a Three Stage Process • Habituation – Paired Training - Extinction Pavlov’s study • Condition stimulus, CS, was the sound of a metronome (will become the condition reinforcer) • The Uncondition Stimulus (UCS) was meat powder (the reinforcer). • The uncondition response was drops of salivation. • Condition response was salivation. Trial - By - Trial Presentation • Trials are presented singularly and continues until some criterion is met or a fixed number of trials completed. Paired trials • When both the CS and the UCS are presented in a fixed order and time frame • (an explicitly paired trial). CS Alone Trials • Periodically only the CS is presented • Known as CS alone trials. • Measure the effectiveness of the CS to drive the behavior Statistics • Mean and SD change in the timing or magnitude of the paired trials. • Mean and SD of the CS alone trials Results • In the beginning trials the CS does not elicit the CR • Across trials there is a change in the ability of the CS to drive the CR. The change increase as the trials progress. • The CR looks like the UCR, but micro analysis of the two behaviors will show clear difference Results of Explicit Pairing CSUCS • By the end of the trials, both for paired and CS alone trials, the CS is able to elicit the CR. • Magnitude increases, time shortens or frequency changes for the behavior How does one know that one has stimulus control? • Use Extinction trials. Present the trials in the manner of Habituation (but is not habituation) • Repeated presentation of the CS alone will drive the changes that were learned during CS presentation to zero. EXTINCTION • Note at the beginning of day 2, the first response is larger than last response on day one. Called spontaneous recovery. Generalization vs discrimination • Generalization is the tendency to respond to like stimuli the same way. • If a rabbit is trained on a 1200Hz tones, to a critiereon, his percent correct for a 800, or a 1600Hz tone will be less than the 1000Hz tones • See next slide Generalization gradient Discrimination training • The correct stimulus is explicitly paired with a given reinforcement (see Mauk and Ruiz slides 65- 71 below). • Or two different responses are paired with two explicitly different stimuli. The vast array of stimuli and condition responses Eyeblink Studies • First done by Ernest Hilgard in humans, dogs and monkeys 1928 – 1936). • Modern Studies were started by Isidore Gormenzano • A Model Animal – The Rabbit, its blink rate is 1 or 2 time a minute. Eye lid Conditioning • In the U.S. Ernest Hilgard studied eyelid conditioning in three major papers, 1929, 1932, 1936. He used dogs, monkeys and humans. He set the procedures for all neurosciences and their use of CC Modern Studies started in 1964 Gormaenzano – modern study of eyelid conditioning Mauk studies as representational of eyelid conditioning in the rabbit. • Demonstrated, by brain removal studies in the rabbit, that classical conditioning could be mediated by the cerebellum alone. Cerebellar brain circuit for mediating classical conditioning. What Is The Behavior Measured • A very sharp distinguishable criterion of what constitutes the movement of an eyelid. Eyelid Movement Changed to an Electrical Potential • An Stainless Steel Wire (0.007 in. by 0.018 cm) Extended From the suture loop in the eyelid to a phototransistor potentiometer. Optional slide • An infrared emitter is bounced off the eyelid. • The reflected beam passes through two small Polaroid filters, one fixed and one operated by the eyelid closure. • The two Polaroid filters are set so that infrared light passes through both filters to an infrared receiver when the eyelid is up. The receiver turns the infrared light into volts, depending on the amount of infrared light received. When the rabbit blinks its eye the wire is pulled and rotates the moveable Polaroid filter. This movement reduces the amount of light transmitted, lowering the voltage. • The movement of the eyelid, closure (distance in mm), was calibrated to voltage drop by the potentiometer. • A movement of ½ mm of eyelid was defined as an eyeblink. CS • Sine Waves of Various Frequencies presented at an intensity of 80dB (SPL) UCS Airpuff • UCS was calibrated to present to the eye and airpuff of 2 N/cm2 delivered to the cornea of the rabbit through, a 1 mm tuberculin syringe, positioned approximately 1 cm from the cornea. Criterion Learning • How Many Trials to Learn to a Fixed Criterion: Eight Correct Closures Out of Ten. Fixed Trial Procedure • Percent of Closures in X Number of Trials Most Frequently Used Procedure • Total of 108 Trials Divided in 12 Blocks • First Trial of each block is a CS Alone Trial • ACROSS 108 TRIALS: 12 CS ALONE TRIALS AND 12 BLOCKS OF PAIRED CS/UCS TRIALS Critical Measures: ½ mm movement is a response, Latency to onset of movement, Latency to peak of response What IS MEASURED • NUMBER OF TRIALS IN WHICH AN EYEBLINK WAS MADE. • LATENCY to ONSET: Time it takes from CS Onset to beginning of eye-blink (previously defined in terms of mm of movement). Mauk & Ruiz • The differential learning (conditioning) with two different ISIs. Hypothesis: • The rabbit can learn to discriminate between two different ISIs. • The rabbit is learning to time its eye-blink to two different inter – stimulus – intervals. • The rabbit is learning to time its behavior of eyelid closing! Delay Conditioning: Two or more ISIs • Manipulating the ISI is one of the independent variables. The Independent Variables: CS1 and CS2 and the position of the US (UCS) Note • The US, in Mauk’s terms, in the above slides, must be manipulated to coterminate with the CS. The UCS (air-puff) is always the in duration, 100ms. It occurs with the last 100 ms of the CS to coterminate. The Dependent Variable • Getting the eye closed at the appropriate time The Dependent Variable: The wiggly lines above CS1 and CS2 Acquisition of Conditioned Eyelid Closure (two different ISIs) Latency To Onset of Eyelid Closure Mauk & Ruiz: Six Different ISI to Six Different tones Note the Difference in Behavior of the Eyelid to Different ISIs Three Conditioned Eyelid Closures Elicited by Three Different ISI Associated Tones TRACE CONDITIOING 1. Acquisition of conditioned • What is measured or counted • Number of blinks across trials for CS alone • Number of blinks across trials for paired CS/UCS • Latency to Onset. Neural Response Berger et al High-Order Conditioning • One can use the CS as the UCS CS USED AS A UCS OVER SHADOWING • The salience of one cue is greater than that of the other cue. • The more salient cue overshadows the less salient cue. The less salient cue can not be used to predict the contingency. TEST INDIVIDUAL CUE TEST INDIVIDUAL CUE, LIGHT BLOCKING The behavior in Phase III • In Blocking, when tested in Phase III, Group I does not demonstrate a CS to the sound. The light has control over the behavior in Phase II and the sound becomes blocked. • When tested in Phase III, Group II shows CSs to both sound and light. Test for Sound Group I TEST GROUP II Condition fear in a dog. Sound of the horn comes to signal the shock. The dog jumps over the divider. • Panel 1 shock occurs with no cue. Dog jumps over barrier only when the shock appears. Non-cued shock increases fear and anxiety. • Panel 2 a loud tone is presented before shock appears. A contigency is set up The Condition Emotional Response CER • The sequence described above is how emotional responses get tied to external cues. • The reduction of anxiety is a positive state of affairs for the animal, i.e., a positive reward. • In the above dog case the dog avoids the negative state off affairs, fear, leading to anxiety. • between the shock and the tone. The tone comes to predict the shock. Anxiety is high before tone is on. • Panel 2 the dog comes to jump, before the shock when it hears the tone. Anxiety starts to drop as the dog starts to jump. Note the shock has not appeared. • Panel 3 Dog is jumping, shock appears, anxiety goes down. • Drug habits in humans leads to the paraphernalia of the habit to become condition stimuli for the high response, coupled with avoidance behavior of the withdrawal symptoms. The withdrawal symptoms leading to anxiety coupled to fear. What is learned in Classical Conditioning? • The animal learns an “if – then” contingency relationship between the CS and the UCS. • The CS increases the probability that the UCS will follow. • The animal has increasing predictability that the UCS will follow the CS, thus gaining control off the situation!