* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Learning - WordPress.com
Survey
Document related concepts
Behavior analysis of child development wikipedia , lookup
Learning theory (education) wikipedia , lookup
Insufficient justification wikipedia , lookup
Psychophysics wikipedia , lookup
Behaviorism wikipedia , lookup
Psychological behaviorism wikipedia , lookup
Transcript
Chapter 6 Learning Copyright © Allyn & Bacon 2007 What is learning? See handout 6-2 With your aisle partner, discuss whether you think each of the scenarios is an example of learning. Copyright © Allyn & Bacon 2007 Learning Learning – A process through which experience produces lasting change in behavior or mental processes Hilgard and Bower’s definition: the relatively permanent change in behavior in a given situation brought about be repeated experiences in that situation, provided that the behavior can’t be explained by native response tendencies, maturation, or temporary states (fatigue, drugs, etc.) Copyright © Allyn & Bacon 2007 Learning In this chapter we will not learn about: Instincts: they are innate, NOT learned Maturation: normal stages will be covered in an upcoming chapter Memory: covered in a previous chapter Copyright © Allyn & Bacon 2007 Simple Learning Mere exposure effect – Learned preference for stimuli to which we have been previously exposed (e.g. advertising) Habituation – Learning not to respond to repeated presentation of a stimulus Copyright © Allyn & Bacon 2007 Complex Learning Behavioral learning – learning achieved through manipulation of stimuli and responses (e.g. classical and operant conditioning) B. F. Skinner’s behaviorist manifesto: 1. psychology’s concern should be behavior 2. methods should be objective, not introspective 3. goal should be “prediction and control of behavior” rather than understanding of mental events Copyright © Allyn & Bacon 2007 The Essentials of Classical Conditioning Neutral stimulus – produces no conditioned response (CR) prior to learning (Pavlov) Copyright © Allyn & Bacon 2007 The Essentials of Classical Conditioning Unconditioned stimulus (UCS) Unconditioned response (UCR) Conditioned stimulus (CS) Conditioned response (CR) Copyright © Allyn & Bacon 2007 The Essentials of Classical Conditioning Unconditioned stimulus (UCS) The stimulus that elicits an UCR Unconditioned response (UCR) Conditioned stimulus (CS) Conditioned response (CR) Copyright © Allyn & Bacon 2007 The Essentials of Classical Conditioning Unconditioned stimulus (UCS) Unconditioned response (UCR) The response elicited by a UCS without prior learning Conditioned stimulus (CS) Conditioned response (CR) Copyright © Allyn & Bacon 2007 The Essentials of Classical Conditioning Unconditioned stimulus (UCS) Unconditioned response (UCR) Conditioned stimulus (CS) A previously neutral stimulus that comes to elicit the CR Conditioned response (CR) Copyright © Allyn & Bacon 2007 The Essentials of Classical Conditioning Unconditioned stimulus (UCS) Unconditioned response (UCR) Conditioned stimulus (CS) Conditioned response (CR) A response elicited by a previously neutral stimulus that has become associated with the UCS Copyright © Allyn & Bacon 2007 Classical conditioning Close your eyes and imagine you are floating in warm ocean waters. Relax In this scene, what is the UCS, UCR, CS, and CR? Let’s practice this with a handout. Copyright © Allyn & Bacon 2007 Classical conditioning Acquisition – Initial learning stage; CR (salivating) becomes elicited by the conditioned stimulus (CS) Copyright © Allyn & Bacon 2007 Classical Conditioning Extinction – Weakening of a conditioned association in the absence of a UCS -many of you no longer respond to the horn. Spontaneous recovery – Reappearance of an extinguished conditioned response after a time delay Copyright © Allyn & Bacon 2007 (1) Acquisition (CS + UCS) (2) Extinction (CS alone) Rest period Strength of the CR (Weak) (Strong) Acquisition, Extinction, and Spontaneous Recovery (3) Spontaneous Recovery (CS alone) (Time) Trials Copyright © Allyn & Bacon 2007 Classical Conditioning: Generalization and Discrimination Stimulus generalization: giving a CR to stimuli that are similar to the CS (Little Albert) Stimulus discrimination: responding to one stimulus but not to stimuli that are similar Copyright © Allyn & Bacon 2007 Applications of Classical Conditioning Taste-aversion learning – learning to avoid food after a single experience, if eating it is followed by illness (caveat: if you get sick DIRECTLY after eating a food, you often won’t get taste aversion) Copyright © Allyn & Bacon 2007 A Challenge to Pavlov Why are some associations readily learned while others are resistant to learning? Contiguity: the nearer the better The Seahawk-balloon pairing might be too far apart Contingency: the more regular the better The UCS must ALWAYS be paired with the CS during acquisition Novelty: the newer the better It would be hard to make a CS out of the school bell, because you hear it twelve times a day. It’s not new. Copyright © Allyn & Bacon 2007 Association Principle Think of your favorite soup: Now imagine the soup is served to you in an ordinary bowl, but has been stirred by a thoroughly washed, used flyswatter. How much would you like to eat that soup, on a scale of 1=not at all to 9=very much? Copyright © Allyn & Bacon 2007 If that flyswatter were brand new, how much would you like to eat the soup, on a scale of 1=not at all to 9=very much? Copyright © Allyn & Bacon 2007 If the soup were first stirred with a thoroughly washed but used comb, how much would you like to eat that soup, on a scale of 1=not at all to 9=very much? Copyright © Allyn & Bacon 2007 If the soup were served in a thoroughly washed, used dog bowl, how much would you like to eat that soup, on a scale of 1=not at all to 9=very much? Copyright © Allyn & Bacon 2007 Of 143 participants… 1. 82% gave a four or lower 2. 58% gave a four or lower 3. 76% gave a four or lower 4. 71% gave a four or lower Copyright © Allyn & Bacon 2007 Association principle Linking two events that occur close together but are unrelated Association principle applies in both CC and OC Accounts for superstition Copyright © Allyn & Bacon 2007 Before we get to operant conditioning… Can I have two volunteers for an experiment? You will need to leave the room for a few moments as I prepare the experiment. Shaping – reinforcers guide behavior toward closer and closer versions of the desired behavior. Copyright © Allyn & Bacon 2007 The Skinner Box Copyright © Allyn & Bacon 2007 How Do We Learn New Behaviors by Operant Conditioning? Trial-and-error learning – Learner gradually discovers the correct response by attempting many behaviors and noting which ones produce the desired consequences Copyright © Allyn & Bacon 2007 How Do We Learn New Behaviors by Operant Conditioning? In operant conditioning, the consequences of behavior, such as rewards and punishments, influence the chance that our behavior will occur again Copyright © Allyn & Bacon 2007 Copyright © Allyn & Bacon 2007 Skinner’s Radical Behaviorism B.F. Skinner believed that the most powerful influences on behaviors are their consequences Based ideas on Thorndike’s Law of Effect: behaviors followed by favorable consequences become more likely, and behaviors followed by unfavorable consequences become less likely. Copyright © Allyn & Bacon 2007 The Power of Reinforcement Positive reinforcers – Stimulus presented after a response that increases the probability of that response happening again Negative reinforcers – Removal of an unpleasant stimulus, contingent on a particular behavior Copyright © Allyn & Bacon 2007 Contingencies of Reinforcement Copyright © Allyn & Bacon 2007 The Power of Reinforcement Primary reinforcers – Reinforcers, such as food and sex, that are innate because of their biological value Copyright © Allyn & Bacon 2007 Copyright © Allyn & Bacon 2007 The Power of Reinforcement Secondary reinforcers – Stimuli, such as money or tokens, that acquire their reinforcing power by their learned association with primary reinforcers (also called conditioned reinforcers) Copyright © Allyn & Bacon 2007 The Problem of Punishment Punishment – A stimulus which diminishes the strength of the response it follows How does this differ from negative reinforcement? Copyright © Allyn & Bacon 2007 Punishment vs. Negative Reinforcement Negative Reinforcement Response Consequence Loud Noise Press Lever Loud Noise Removed Press Lever Loud Noise Applied Punishment No Noise Copyright © Allyn & Bacon 2007 The Problem of Punishment Positive punishment – The application of an aversive stimulus after a response Omission training (negative punishment) – The removal of an appetitive stimulus after a response Copyright © Allyn & Bacon 2007 Four Kinds of Consequences STIMULUS Positive or appetitive + Present Remove Positive Reinforcement Bonus for working hard leads to more hard work Negative or aversive Punishment Getting speeding ticket leads to less speeding Omission Training Negative Reinforcement Missing dinner leads to less staying out late Aspirin curing headache causes more aspirin use Copyright © Allyn & Bacon 2007 Copyright © Allyn & Bacon 2007 CC or OC Copyright © Allyn & Bacon 2007 Let’s practice With an aisle partner, complete the back of the handout I gave you last week. Copyright © Allyn & Bacon 2007 Yay! If you know the difference between punishment and negative reinforcement, you are smarter than Bill Murray. Copyright © Allyn & Bacon 2007 Contingencies of Reinforcement Continuous reinforcement – Reinforcement schedule in which all desired responses are reinforced Partial reinforcement – Reinforcement schedule in which some, but not all, desired responses are reinforced (also called intermittent reinforcement) Copyright © Allyn & Bacon 2007 Schedules of Partial Reinforcement Ratio schedules – Provide reward after a certain number of responses; produce higher response rates than… Interval schedules – Provide reward after a certain time interval Fixed Ratio (FR) Variable Ratio (VR) Fixed Interval (FI) Variable Interval (VI) Copyright © Allyn & Bacon 2007 Schedules of Reinforcement Fixed Ratio (FR) Variable Ratio (VR) Fixed Interval (FI) Variable Interval (VI) Rewards appear after a certain set number of responses e.g. factory workers getting paid after every 10 cases of product are completed Effect: activity slows after reinforcer and then picks up; goes extinct quickly without constant reinforcement Copyright © Allyn & Bacon 2007 Schedules of Reinforcement Fixed Ratio (FR) Variable Ratio (VR) Fixed Interval (FI) Variable Interval (VI) Rewards appear after a certain number of responses, but that number varies from trial to trial e.g. slot machine payoffs Effect: responding rate is high and stable; most resistant to extinction Copyright © Allyn & Bacon 2007 Schedules of Reinforcement Fixed Ratio (FR) Variable Ratio (VR) Fixed Interval (FI) Variable Interval (VI) Rewards appear after a certain fixed amount of time, regardless of number of responses e.g. weekly or monthly paychecks Effect: activity increases as deadline nears; can cause fast extinction Copyright © Allyn & Bacon 2007 Schedules of Reinforcement Fixed Ratio (FR) Variable Ratio (VR) Fixed Interval (FI) Variable Interval (VI) Rewards appear after a certain amount of time, but that amount varies from trial to trial e.g. random visits from the boss who delivers praise Effect: steady activity results, good resistance to extinction Copyright © Allyn & Bacon 2007 As a rule: Variable is more resistant to extinction than fixed Ratio is more resistant to extinction than interval Copyright © Allyn & Bacon 2007 Reinforcement schedules Copyright © Allyn & Bacon 2007 Let’s try some practice figuring out reinforcement schedules Handout Copyright © Allyn & Bacon 2007 The Use and Abuse of Punishment Power usually disappears when threat of punishment is removed Punishment • • • Often triggers aggression May inhibit learning new and better responses Is often applied unequally Reinforcement should be chosen over punishment whenever possible Copyright © Allyn & Bacon 2007 Alternatives to Punishment Reinforcing preferred activities Premack principle: a more preferred activity can reinforce a less preferred activity Shaping Copyright © Allyn & Bacon 2007 Operant and Classical Conditioning Compared Classical conditioning involves the association of two stimuli (UCS + CS) before the response or behavior Operant conditioning involves a reinforcing (reward) or punishing stimulus after a response or behavior A video that summarizes classical and operant conditioning. Copyright © Allyn & Bacon 2007 How Does Cognitive Psychology Explain Learning? According to cognitive psychology, some forms of learning must be explained as changes in mental processes, rather than as changes in behavior alone Copyright © Allyn & Bacon 2007 How Does Cognitive Psychology Explain Learning? Insight learning – Problem solving occurs by means of a sudden reorganization of perceptions Cognitive maps – A mental representation of physical space (contrasted with chained association [knowing the turns]) Copyright © Allyn & Bacon 2007 Cognitive Maps Copyright © Allyn & Bacon 2007 Observational Learning: Bandura’s Challenge to Behaviorism Observational learning – Form of cognitive learning in which new responses are acquired after watching others’ behavior and the consequences of their behavior—can be prosocial or antisocial Bandura’s Doll Experiment; more video Latent learning: learning you didn’t know you’d learned until required to apply it. Copyright © Allyn & Bacon 2007 Brain Mechanisms and Learning Long-term potentiation – Physical changes that strengthen the synapses in groups of nerve cells; the neural basis of learning Copyright © Allyn & Bacon 2007 Operant and Classical Conditioning In action Copyright © Allyn & Bacon 2007 End of Chapter 6 Copyright © Allyn & Bacon 2007