* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download learning
Survey
Document related concepts
Transcript
GENERAL PSYCHOLOGY Lecture 6 LEARNING Visiting Assistant PROFESSOR YEE-SAN TEOH Department of Psychology National Taiwan University Unless noted, the course materials are licensed under Creative Commons AttributionNonCommercial-ShareAlike 2.5 Taiwan (CC BY-NC-SA 3.0) LEARNING Habituation Classical Conditioning Operant Conditioning Habituation • The decline in an organism’s response to a stimulus once the stimulus has become familiar. • Important function – Helps us ignore old news to allow us to pay attention to unfamiliar stimuli that may signal danger or an unexpected opportunity. • Ignore inputs that are already familiar & unimportant and focus on the novel inputs. Dishabituation • Increase in organism’s response to a familiar stimulus caused by a change in the stimulus. • Important function – calls attention to newly arriving information, which may signal danger or an unexpected opportunity. CLASSICAL CONDITIONING Pavlov’s Dogs wikipedia: ИльяГо(грохотайло) History • Pavlov was studying digestive physiology by studying the secretion of saliva in dogs. • Found that salivation could be triggered by formerly neutral stimuli, like the sight of a food dish. 2 types of responses & 2 types of stimuli UR • Unconditioned Response US • Unconditioned Stimulus CR • Conditioned Response CS • Conditioned Stimulus Unconditioned Response (UR) • Biologically determined reflex. • Triggered by a certain stimulus independent of learning. • E.g. Salivation occurs reflexively when food is placed in the mouth. • Salivation = UR • Food in mouth = US Unconditioned Stimulus (US) • Trigger for Unconditioned Response. • E.g. Food in mouth is the trigger for the dog to salivate Unconditioned Linkage between the US & UR is biological, nor learned. Conditioned Response • Response triggered by some initially neutral stimulus (CS) as a result of pairings between that CS and an unconditioned stimulus (US). • The CR is typically not identical to the UR, although it is often similar to it. • E.g. If food in dog’s mouth (US) is paired with the sound of a bell (CS), the dog will salivate (UR), with repeated pairings, soon the dog will salivate (CR) to the sound of a bell (CS). • The CR will be less intense than the UR. Conditioned Stimulus • Initially neutral, that comes to elicit a new response to being associated with the unconditioned stimulus (US) through repeated pairings. • E.g. If a neural stimulus, like the sound of a bell (CS) is paired with a US, like food placed in the mouth, the CS will be associated with the US, and the dog will salivate at the CS. Conditioned Linkage between the CS & CR is learned through conditioning (associating the CS with the US) Acquisition of Conditioned Responses Learning is Gradual The strength of the CR slowly grows as the animal experiences more and more pairings of the the CS & US. Second-Order Conditioning • A neutral stimulus is paired with some already established CS and through repeated pairings, the new CS begins to elicit the CR. Learn to associate Bell + Food (1st order) Learn to associate Light + Bell (2nd order) Light becomes signal for bell which is associated with food Characteristics of Classical Conditioning Extinction Generalization Discrimination Contingency Extinction • Weakening of the CS to elicit the CR, due to unreinforced presentations of the CS. • Undo previously learned CR, so that the response will no longer be produced. • CR will eventually disappear if the CS is presented several times by itself, without the US. • Extinction does not mean forgotten. • Mere passage of time is not enough to undo conditioning. • Animal learns that the association between the CS and UR is no longer active. • But……… • If we pair the US with the CS again, we can recondition the animal. • Reconditioning occurs much more quickly, because the animal retains some memory of the CS. • When animal remembers that the CS used to signal the US and produces the CR in anticipation that the CS might signal the US again = spontaneous recovery Example of Extinction & Spontaneous Recovery Treating Anxiety Disorders Using Exposure Therapy •Person is exposed to a specific stimulus or situation (CS) that has been a source of anxiety (e.g. heights, elevators) •But person is kept safe and comfortable so there is no fearful US associated with the CS. •Repeated pairings (CS without US) = extinction of the CR (feelings of anxiety). •However, if after some time the person is suddenly exposed to the CS, the CR might follow. Stimulus Generalization • Animals respond to a range of new stimuli, provided that these stimuli are sufficiently similar to the original CS. • E.g If a light (CS) has been conditioned to elicit salivation (CR), the dog may salivate even if the light is slightly brighter or slightly dimmer than the original CS. • However, the greater the distance between the new stimulus and the original CS (e.g. very loud bell and very soft bell), the weaker the CR will be. Discrimination • Process of learning to respond to certain stimuli that are reinforced, and not to others that unreinforced. • Example: Red light (CS) is paired with the blast of a horn (US) which produces a startled response (UR). Red light alone (CS) produces startled response (CR). Alternate trials of red light plus horn and orange light and no horn = animal learns to discriminate the colors and will only startle to red light. CS as a Signal • The rate at which conditioning develops depends on how the CS and the US are related to each other in time. Rate of Presentation Effectiveness of Pairing US Before CS Not Effective CS & US Simultaneously Less Effective US shown few sec after CS Effectiveness Declines Sharply US presented 0.5 Sec after CS OPTIMUM • CS warns (signals) the organism that the US is coming so it can prepare itself. • If the CS and US are presented too far apart, the organism might not connect the CS with the US. • IF CS & US occur simultaneously, or if US comes before CS – CS cannot signal that the US is coming (it’s already happened) Contingency Contingency •Contingent relationship exists when one stimulus (CS) reliably occurs when another stimulus (US) is about to appear. •First stimulus can be used to accurately predict the arrival of the US. Contiguity •When stimuli occur close together in time. •Appears that the CR is acquired only when the CS predicts what is going to follow. Surprise & Expectations • If an animal is in a situation in which the CS is followed by the US 90% of the time, the animal will end up with strong expectations for what’s going to happen next whenever it happens. • Animals have a set of expectations based on experience. Relationship between the CR & UR • CR and UR are not the same. • Animal reacts differently to CS than the US. • CS tells animal to get ready so the CR is the adjustment the animal makes in preparation for the US. • Example: Bell (CS) signals food (US) so mouth is moistened (CR) in anticipation – the food (US) then produces more saliva. EXAMPLE OF CLASSICAL CONDITIONING Body’s Responses to Drugs Conditioning & Compensatory Responses • Drug tolerance o Decrease in the response to a drug, usually resulting from continued use. • Drug dependence & cravings o Inability to function without the drug. o Overwhelming desire for yet another dose. o Cravings accompanied by increased sensitivity to pain. o Depression. o Overproduction of fluid in person’s mouth & mucus membrane. Homeostasis – Stable environment that exists inside our bodies. • US = Heroin • UR = Body’s natural response to the drug. • CS = Stimuli that signal the drug is about to arrive (sight of needle, thought about drug) • CR = Opposite of the UR – since body’s mechanisms of homeostasis want to cancel out the effects of the drug. Body’s Compensatory Response • Internally produced response. • Body seeks to reduce the effects of some external influence by producing a reaction opposite in its characteristics to those of the external influence. Example: • CS causes body to produce an increase in pain sensitivity in response to decrease in pain sensitivity caused by heroin = cancels out heroin’s reaction & produces drug tolerance & homeostasis. • No compensatory response when exposed to drug for the first time. • After repeated exposures to the drug, learning occurs, and the compensatory response occurs. • Person preserves homeostasis but also gets drug tolerance effects. Drug cravings • When a compensatory response occurs because the body is anticipating the arrival of the drug, but no drug is actually available… • Person will experience the compensatory response on its own – depression, pain sensitivity – drug cravings. OPERANT CONDITIONING Skinner Thorndike & the Law of Effect Edward Thorndike’s photo courtesy of Science Photo Library Cats in a Puzzle Box • Problem for cat to solve. • How to open door and escape from box to get reward. • Cats learnt solution gradually, with no sudden improvement. Law of Effect • The tendency of a stimulus to evoke a response is i. Strengthened if the response is followed by reward. ii.Weakened if the response is not followed by reward. • If an animal makes a response and reward follows shortly, the response is more likely to be performed again. Skinner & Operant Behavior • Operants = Instrumental response defined by the effect it has (the way it operates) on the environment. i. Operant followed by a positive consequence is more likely to be repeated in the future. ii.Operant followed by a negative consequence is less likely to be repeated in the future. Behavior can be changed by providing or removing rewards and punishments. Classical Conditioning Operant Conditioning Builds on a response (UR) that is automatically triggered reflexively by a stimulus (US) Involves behaviors that are voluntary. Learning the association between two stimuli (US & CS) Learning the association between response and stimulus (the operant and a reward) Learning trials typically involve presentation of CS followed by a US Learning trials typically involve a response followed by a reward or reinforcer. Reinforcers • Presentation of something good or the termination or prevention of something bad. • Increases likelihood of the response occurring again. Examples • Receiving something good gold stars, praise, etc.) • Termination or prevention of something bad (avoiding being terminating loud noise) (candy, grounded, A Reinforcer can generally be determined only after a trial: If a response is repeated to gain the reward…the stimulus is a reinforcer. Discrimination • Discriminative stimuli = external stimuli that signal a particular relationship between a response and the reinforcer. • Positive discriminative stimulus (S+) • Negative discriminative stimulus (S-) Example: • Green light = S+ when it signals to a bird that it will get food if it hops on a lever. • Red light = S- which indicates that action will not lead to reward. Generalization – same as CC • Animals respond to a range of new stimuli, provided that these stimuli are sufficiently similar to the original discriminative stimulus S+. • However, the bigger the difference between the new stimulus and the original S+ (e.g. very bright light and very dim light), the weaker the response will be. Contingency – Same as CC • There needs to be some predictive relationship. • Response needs to be predictive of the reward, so that the probability of getting the reward after the response is greater than the probability of getting the reward without the response. • Predictability helps give animal control. LEARNED HELPLESSNESS A Contingency Breakdown What is learned helplessness? • Condition created by exposure to inescapable, aversive events. • Impairs or prevents learning in subsequent situations in which escape or avoidance is possible. • If there is no contingency between acts and outcomes – no point in trying. Seligman’s Dog Study • Two groups of dogs received same amount of electric shocks. • 1 group given control – could turn off shock by pressing a panel. • 1 group had no control. • Dogs with control in 1st task learned to escape & avoid shocks in next task. • Dogs with no control in 1st task did not try to escape in next task. BEHAVIORAL CONTRAST & MOTIVATION Behavioral Contrast • Organism seems to evaluate a reward relative to other rewards that are available or that have been available recently. • Example: A teen is not likely to be willing to sweep the floors for $100 if he received $200 the last time he swept the floors. Motivation according to type of reward INTRINSIC • Reward serves some internal need. • Engage in activity because of the pleasure of the activity. EXTRINSIC • Reward serves some external need. • Under control of experimenter. Schedules of Reinforcement • Rules about how often and under what conditions a response will be reinforced. • Ratio Schedule – person/animal is rewarded for producing a certain number of responses. • Interval Schedule – Person/animal is rewarded for producing a response after a certain period of time has passed. LATENT LEARNING LEARNING THAT TAKES PLACE WITHOUT ANY CORRESPONDING CHANGE IN BEHAVIOR…. Tolman’s Rats • Had rats exploring a maze without a reward for 10 days. • No change in behavior. • On 11th day, gave food as a reward for getting to the end of the maze. • The rats ran to the end virtually without error. • Knowledge of maze acquired but not used until a reward was given to motivate the rats. OBSERVATIONAL LEARNING Observational Learning • Process through which we watch how others behave and learn from their example. • This type of learning is found in many species. Mirror Neurons • Located in frontal lobe, near motor cortex • Fire whenever an animal performs an action, and whenever the animal observes someone else performing the same action. Bandura’s Classic Bobo Doll Study (1969,1977) • Children who had observed an adult behaving aggressively with a Bobo doll were more likely to behave aggressively with the Bobo doll… • Compared with children who viewed an adult who did not behave aggressively with the Bobo doll. AP Psychology Learning Neural Basis for learning Where? • Brain circuits underlying fear conditioning are centered in the amygdala. • Brain circuits underlying eyeblink conditioning are centered in the cerebellum. • Conditioning with a long delay between the CS and the US typically involves the hippocampus. Neural Plasticity • Capacity for neurons to change the way they function as a consequence of experience. • Involves changes at the synapse – how neurons communicate with each other. Presynaptic Facilitation • Some neurons, after learning, end up sending a stronger signal than they did before. • May release more neurotransmitter than they did before learning. Postsynaptic Changes Long-term Potentiation (LTP) •Postsynaptic neuron becomes more sensitive (potentiated) to the signal received from the presynaptic neuron, which lasts for days or weeks (long-term). Formation of New Synapses • Learning can lead to the creation of entirely new connections among neurons – new synapses. • Changes take place mostly on the dendrites – new dendritic spines. Consciousness Two Aspects of Consciousness Level of alertness or sensitivity •Distinction between being dimly aware of something and being highly alert. •Corresponds to brain sites: thalamus or reticular activating system. •Alertness is disrupted with damage to system that controls overall arousal and cycling of sleep & wakefulness. Content of Consciousness •May be thinking about the past, present, or future. •Various contents of consciousness require different brain sites. •In dreaming, we are conscious of a richly detailed content, but our level of sensitivity to the environment is low. DRUG-INDUCED CHANGES IN CONSCIOUSNESS Depressants, Stimulants, Marijuana, Hallucinogens Depressants • Drugs that decrease activity in the nervous system. • Alcohol, sleeping pills, drugs used to treat anxiety. Effects of alcohol • At low doses: Produces feelings of pleasure & well-being. • But also depresses activity in the neural circuits that controls our impulses. • Narrowing of attention. • Thinking tied to here and now, little attention to possible consequences of one’s actions. • Effects derived from actual impact on brain and people’s expectations: When participants ‘believed’ that they had consumed alcohol, they behaved with less inhibition. Effects of Sleeping Pills, Anti-Anxiety Drugs • Can produce physical & psychological dependence. - Withdrawal symptoms after prolonged use quite likely – enhanced anxiety, insomnia. • Can produce drug tolerance. - When person becomes less sensitive to the drug, higher & higher doses are needed to achieve the same effects. - Risks of high doses causing cognitive impairment or even coma. Stimulants • Drugs that stimulate the nervous system and broadly • • • • • increase the level of bodily arousal. Can raise blood pressure, increase heart & breathing rate. Increase overall alertness, boost energy. Decrease the need for sleep. Can lead to psychological & physical dependence with withdrawal symptoms. Caffeine, cocaine, amphetamines (incl MDMA/ecstasy), Ritalin. Effects of Cocaine • Temporary rush of excitement and euphoria. • Following arousal, person comes crashing down, feels fatigue, depression. • Risk of dependence. • Feelings of paranoia. • Risk of cardiac arrest, respiratory problems. Copyrights Page 6 6 16 36 Work License Author/Source Wikipedia: unknown http://commons.wikimedia.org/wiki/File:Ivan_Pavlov_LIFE. jpg 2012/04/11 visited Wikipedia: ИльяГо(грохотайло) http://en.wikipedia.org/wiki/File:MonumentIPAVLOV.jpg 2012/04/11 visited National Taiwan University YEE-SAN TEOH This photo is from Science Photo Library http://www.sciencephoto.com/media/228950/enlarge and used in accordance with its terms of use http://www.sciencephoto.com/terms.html for educational purpose only. 36 Wikipedia: Jacob Sussman http://en.wikipedia.org/wiki/File:Puzzle_box.jpg 2012/04/11 visited 56 AP Psychology Learning https://laurensaplearning.wikispaces.com/Bandura 2012/04/11 visited