* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download What is Learning?
Survey
Document related concepts
Neuroeconomics wikipedia , lookup
Attribution (psychology) wikipedia , lookup
Theory of planned behavior wikipedia , lookup
Theory of reasoned action wikipedia , lookup
Learning theory (education) wikipedia , lookup
Applied behavior analysis wikipedia , lookup
Verbal Behavior wikipedia , lookup
Adherence management coaching wikipedia , lookup
Psychophysics wikipedia , lookup
Behavior analysis of child development wikipedia , lookup
Eyeblink conditioning wikipedia , lookup
Insufficient justification wikipedia , lookup
Psychological behaviorism wikipedia , lookup
Behaviorism wikipedia , lookup
Transcript
What is Learning? Psychology Ciccarelli and White -Any relatively permanent change in behavior based on experience or practice Chapter Five: Learning -Learning is not maturation. Maturation is change based biological processes. Classical Conditioning Ivan Pavlov Getting the terminology down: Classical Conditioning Stimulus Response Unconditioned Conditioned Classical Conditioning Conditioned stimulus (CS) – a stimulus that is repeatedly paired with the unconditioned stimulus so that the two become associated. Conditioned Response (CR) – is the same response as the unconditioned response, however, it results from the conditioned stimulus. The conditioned stimulus is always followed by the conditioned response. Unconditioned stimulus (UCS) – signal or trigger for the unconditioned response; this signal leads to a response that is NOT learned Unconditioned response (UCR) – the response is typically a natural response or emotion that is also NOT learned The unconditioned stimulus is always followed by the unconditioned response. Classical Conditioning Famous example: Pavlov’s dogs Neutral stimulus (NS): rhythmic clicking sound NS is paired with unconditioned stimulus (UCS): food UCS → Unconditioned response (UCR): salivation After conditioning: NS becomes a conditioned stimulus (CS) CS → Conditioned Response (CR): salivation 1 Classical Conditioning NS---UCS UCS → UCR NS → CS CS → CR ____________________________________ Ticking – Food Food → Salivation Ticking & Food become associated Ticking → Salivation Classical Conditioning Classical Conditioning Pavlov’s principles for classical conditioning What comes first, the chicken or the egg? The timing of the CS/NS and the UCS Time span Repeated exposure The CS must be distinctive Higher Order Conditioning – CS paired with NS and NS also becomes a CS General Classical Conditioning: If you salivate when you see food on T.V. If you feel fear when you hear a dentist’s drill Drug addicts/environmental stimuli Remember back to the chapter on sleep Study spots or study associations Other properties of Classical Conditioning Classical Conditioning It is everywhere! Stimulus generalization – similar stimuli to the original CS can also elicit the CR Stimulus discrimination – similar stimuli to the CS can be distinguished and do not elicit the CR Extinction – CS presented without UCS; the CS will no longer elicit the CR and will once again as a NS Spontaneous recovery – brief appearance of the CS and UCS association Classical Conditioning Exercise: Explain the case of Little Albert with the proper terminology NS1(clicking) —UCS (food)→UCR (salivation) CS1 (clicking) →CR (salivation) Higher Order Conditioning: CS1 (clicking) — NS2 (snapping) →CR (salivation) CS2 (snapping) →CR (salivation) First step: decide what each of the terms relate to in this example (NS, UCS, UCR, CS, CR) Second step: explain how it works using the terms in the proper order (hint: start with the NS) Apply classical conditioning to your life Give your own life example of classical conditioning Use the same steps as above 2 Classical Conditioning Phobia – acquired fear that is out of proportion to the actual threat Phobias – heights, dogs, snakes, the dark, public speaking, getting married, etc. Systematic desensitization – while experiencing fear the person/animal is also learning to relax (relaxation exercises) or experiencing something pleasant (ex: food). Classical Conditioning Can be displayed: CS (feared stimulus)→ CR1 (fear) CS (feared stimulus) — NS/CS (relaxation exercises) CS (feared stimulus) → CR2 (relaxation) Classical Conditioning Vicarious conditioning Watching others respond to a stimulus and then being conditioned to acquire the same response Sheep meat and coyotes Birds and the monarch butterfly Tigers and humans in India Which two of Pavlov’s principles are violated? Biological preparedness – associations are made to increase survival Pavlov: Biological Preparedness Examples: Ex: Children with shots Ex: Rappelling How does Classical Conditioning work? Conditioned Taste Aversion Classical Conditioning Stimulus substitution What is the problem with this theory? (hint: think about the four principles of classical conditioning) How does Classical Conditioning work? Expectancy: Cognition/mental activity Cognitive perspective. Rescorla (1998): CS had to predict that the UCS was coming Study with rats and tones This demonstrates that the rats responded to their expectation of the shock 3 Operant Conditioning or Instrumental Conditioning Operant conditioning B.F. Skinner Stimulus: the lever Response: pushing the lever Consequence: escape and food Law of effect Puzzle box experiments with cats Operant Conditioning Rewards Avoid punishment Edward Thorndike Operant Conditioning Voluntary – operate in the world (to have agency) Consequences Learning happens after a response Operant Conditioning Concepts in operant conditioning Reinforcement – a response that is strengthened and is likely to happen again Reward (something positive) Avoidance (something negative) In the Thorndike cat puzzle, what was reinforced? Reinforcers work well when: A behavior is reinforced immediately after a response Operant Conditioning Not all reinforcers are created equal Some are more powerful than others Determined: amount of time spent on a voluntary behavior ex: spending more time on a TV show or video game than homework (remember, learning can be associative) Only the desired behavior is reinforced (dog example) Operant Conditioning Primary & secondary reinforcers Primary – fulfills basic needs (ex: food, water, touch etc.) Secondary – reinforcers that are paired with primary reinforcers (ex: money, praise) Premack Theory Parents use an extension of this theory Secondary reinforcers work due to classical conditioning 4 Class Exercise Table of Differences In groups Discuss and write down how classical and operant conditioning are different/similar Table of Similarities Operant Conditioning Classical Conditioning Reinforcement should be immediate. Duration between stimuli (NS/CS & UCS) should be short. (< 5 sec) Expectancy develops for a reinforcement after a correct response. The CS is expected to come before the UCS, or in other words, to predict it. Extinction, generalization, and spontaneous recovery is possible in operant conditioning. Extinction, generalization, and spontaneous recovery is possible in classical conditioning. Class Exercise Operant Conditioning Classical Conditioning Increases a particular behavior (often already occurring) Creates a new association Voluntary Involuntary Consequences provide the information to make an association. Stimuli that come before a response create an association. Extinction happens when you remove the reinforcement. Extinction happens when you remove the UCS. Types of Reinforcement Schedules of Reinforcement Positive or Negative Reinforcement? 1. Simon Cowell on American Idol has said:: ―Do you really believe you can become an American Idol? Well then, you are deaf.‖ The singer began crying and never competed on American Idol again. Continuous reinforcement – reinforced after every correct response. 2. Ashton Kutcher went on David Letterman to encourage people to watch him on Two and a Half Men and the ratings for Two and a Half Men went up. Now he is looking for other talk shows he can make an appearance on to talk about Two and a Half Men. 3. Brandon Hantz, Russell Hantz’s nephew decided that he was going to be a hero on Survivor unlike his uncle who was considered a villain. He noticed that he seemed to be doing well in the game when he was kind to others. So he tried to be nice to people as a strategy to try to win the game. Positive reinforcement: addition of a positive or pleasurable consequence Negative reinforcement: removal of a negative or unpleasant consequence Key to remember: reinforcers increase the likelihood a particular response Partial Reinforcement – a correct response is only reinforced some of the time. 4. Brad Womack went on the Bachelor a second time because he wanted to clean up his image. He had been considered the worst bachelor of all time when he didn’t choose a bachelorette in his first season. A child receives a gold star after every assignment completely correctly. Factory worker gets paid for each part assembled correctly. An employee is paid weekly for a job well done everyday. A vacationer plays the slot machines in Los Vegas hoping for the big win. 5 Schedules of Reinforcement Advantage of continuous reinforcement – learning takes place quickly Advantage of partial reinforcement – responses are more resistant to extinction Disadvantage of continuous reinforcement – it is more susceptible to extinction than partial reinforcement Disadvantage of partial reinforcement – make take longer to learn. Partial Reinforcement Fixed interval schedule of reinforcement – reinforcement is: Partial Reinforcement Has a set span of time and reinforces at a predictable time (ex: paycheck every week; ex: Christmas for children) Has a span of time but it reinforces unpredictably (ex: pop quizzes; fishing; hunting) Interval refers to spans of time Ratio refers to number of responses Fixed refers a set amount of time or responses for reinforcement (predictable) Variable refers to differing amounts of time or responses for reinforcement (unpredictable) Partial Reinforcement Variable interval schedule of reinforcement – Terminology Fixed ratio schedule of reinforcement – the number of responses will always be the same number for the reinforcement (predictable) (ex: a little girl sells 10 boxes of girl scout cookies to win a prize) Variable ratio schedule of reinforcement – the number of responses changes for the reinforcement (unpredictable) (ex: slot machines, lotto) Punishment Punishment – a response that is weakened and is less likely to happen again Types of punishment Punishment by application – something is applied that is unpleasant (spanking, extra chores, writing sentences) Punishment by removal – something is removed that is considered valuable (grounding a teenager, reduction in privileges, taking away an allowance; people with DUIs have their licenses taken away) Class Exercise Respond to the following examples in the following slide. First, determine if the example is Reinforcement Punishment Then, determine if: Positive or negative reinforcer Punishment by application or punishment by removal 6 Class Exercise Reinforcement or Punishment? 1. In Extreme Makeover Home Edition, Ty Pennington and his team built a house for a family with a little boy who has a rare disease. His bones are susceptible to breaking. They put in elevators throughout the house. The boy uses the elevators without exception to move around the house to avoid strain on his bones. 2. In a few of the final seasons of Buffy, the Vampire Slayer many of her friends failed to recognize her leadership due to her affiliation with Spike, a vampire with a bad reputation. They looked to Faith, another vampire slayer for direction and leadership. 3. In the Harry Potter series, Dolores Umbridge made Harry write out that he must not tell lies on a sheet a paper with a special pen with no ink. Harry realized after writing a few sentences that the pen was not intended for the paper but instead, by magic, it was engraving the sentence on his hand! The problem with some types of punishment First of all, it may be easier to encourage behaviors than to eliminate them. Punishment may be temporary 4. When Ellen began her own talk show she decided to start the show by dancing. The audience members decided to dance as well. It seemed to be a popular way to begin the show, so Ellen continues to open her show with an impromptu dance. A child continues a behavior when someone is not looking A child stops when punished but continues the behavior later Some types of punishment can become abuse Punishment Punishment Punishment that may lead to abuse (severe spankings, removal of food, etc.) An association is made between the person who is delivering the punishment and the punishment instead of associating the undesired behavior with the punishment Punishment may not weaken the response because other negative reinforcements are used to continue with the behavior and avoid the punishment (successful lying) How can punishment be effective? Most child development specialists recommend punishment by removal There should be a short duration between the undesired behavior and the punishment Punishment should be consistent Follow through Intensity Punishment/correct behavior Fear and anxiety really don’t help anyone learn anything except fear and anxiety. Abusive behavior creates a model for aggression Cultural point: Japanese – positive/negative reinforcement Austria, Denmark, Israel, and Italy have banned physical punishment in schools and homes More concepts in Operant Conditioning Discriminate stimulus – a stimulus that cues behavior for a particular reinforcement (see a cop car/starts raining and you slow down in your car) Shaping – a type of operant conditioning in which an ultimate complex goal is reached using small steps. The purpose of shaping is to mold behavior. The small steps are referred to as successive approximation. 7 More Concepts in Operant Conditioning Shaping is used in a variety of ways/settings: Train dogs for obedience or to do tricks Train dogs for people with handicaps or to sniff out drugs During the Iraq war, dolphins were used by the U.S. Navy to search for explosives in the Persian Gulf. More Concepts in Operant Conditioning More Concepts in Operant Conditioning Extinction, generalization, and spontaneous recovery Extinction: removal of the consequence Generalization Biofeedback (involuntary responses) Neurofeedback (brain wave activity) Constraints on Operant Conditioning Instinctive drift Pigs picking up coins; rooting Obvious constraint: limits to various capabilities ex: child who calls everyone and everything as ―Dada‖ ex: dog trying to perform old tricks Cognitive Learning Theory Cognitive Learning Theory Analysis of current behavior Behavior: socially relevant Spontaneous recovery Behavior modification – operant conditioning typically to change pre-existing behavior Applied behavior analysis (ABA) Edward Tolman Wolfgang Kohler Martin Seligman Cognitive Learning Theory Kohler and his chimps Food outside of cage Two sticks that fit together Insight or problem solving Tolman and his rats Three groups of rats Latent learning 8 Cognitive Learning Theory Seligman and his dogs Shocking dogs with no escape When given an escape, they found Learned helplessness Fear interfered with the ability to learn Depression Cognitive Learning Theory Observational Learning A model demonstrates a behavior and the viewer learns (whether or not they demonstrate the behavior) Albert Bandura and the Bobo Doll Modeled/Aggression or not Reward/Punishment Cognitive Learning Theory Observational Learning Attention Memory Imitation Motivation 9