* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Operant Conditioning
Learning theory (education) wikipedia , lookup
Bullying and emotional intelligence wikipedia , lookup
Prosocial behavior wikipedia , lookup
Behavioral modernity wikipedia , lookup
Abnormal psychology wikipedia , lookup
Observational methods in psychology wikipedia , lookup
Symbolic behavior wikipedia , lookup
Neuroeconomics wikipedia , lookup
Thin-slicing wikipedia , lookup
Transtheoretical model wikipedia , lookup
Attribution (psychology) wikipedia , lookup
Parent management training wikipedia , lookup
Theory of planned behavior wikipedia , lookup
Sociobiology wikipedia , lookup
Theory of reasoned action wikipedia , lookup
Descriptive psychology wikipedia , lookup
Applied behavior analysis wikipedia , lookup
Verbal Behavior wikipedia , lookup
Psychological behaviorism wikipedia , lookup
Classical conditioning wikipedia , lookup
Behavior analysis of child development wikipedia , lookup
Insufficient justification wikipedia , lookup
III. Operant Conditioning E.L. Thorndike and B.F. Skinner Operant Conditioning A. At the same time that Pavlov (and later Watson) was experimenting with what was to be known as “Classical” conditioning, E.L. Thorndike was experimenting with “Operant” conditioning or “Instrumental” Conditioning. His research served as the basis for B.F. Skinner’s research. Edward L. Thorndike ( 1874–1949) Operant Conditioning B. Describe a puzzle box: Clip - http://www.youtube.com/watch?v=yigW-izs8oc Law of Effect Thorndike’s principle that: 1. Behaviors followed by favorable consequences become more likely 2. Behaviors followed by unfavorable consequences become less likely Early Operant Conditioning E. L. Thorndike (1898) Puzzle boxes and cats First Trial in Box Situation: stimuli inside of puzzle box Scratch at bars Push at ceiling Dig at floor Howl Etc. After Many Trials in Box Situation: stimuli inside of puzzle box Scratch at bars Push at ceiling Dig at floor Howl Etc. Etc. Etc. Press lever Press lever Operant Conditioning B.F. Skinner (1904-1990) elaborated Thorndike’s Law of Effect developed behavioral technology B. F. Skinner (1904–1990) Operant Conditioning Operant Conditioning type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment *Instrumental because you use an “instrument” (reinforcement or punishment) to shape behavior. Operant Conditioning Operant Behavior operates (acts) on environment produces consequences Respondent Behavior occurs as an automatic response to stimulus behavior learned through classical conditioning Operant Chamber Skinner Box chamber with a bar or key that an animal manipulates to obtain a food or water reinforcer contains devices to record responses What is the instrument? What behavior is strengthened? Instruments of Conditioning Types of Conditioning Instruments Positive (Stimulus is Given) Reinforcement (Increases Desired Behavior) Give Something Increase Behavior Negative (Stimulus is Removed) Remove Something Increase Behavior Punishment (Decreases Undesirable Behavior) Give Something Decrease Behavior Remove Something Decrease Behavior Our Class: Experiment 1: What is the instrument? Experiment 2: What is the instrument? Which worked best? Operant Conditioning Classroom Practice Sheldon Choc 1.mp4 Sheldon Choc 2.mp4 Sheldon Choc 3.mp4 1. What is the desired behavior? 2. What is the instrument? https://www.youtube.com/watch?v=LhI5h5 JZi-U Operant Conditioning In shaping, successively closer versions of a desired response are reinforced (as in learning to play tennis). In chaining, each part of a sequence is reinforced; the different parts are put together into a whole (as in learning the steps to a dance). Punishment Punishment aversive event that decreases the behavior that it follows powerful controller of unwanted behavior Punishment Problems with Punishment Does not teach or promote alternative, acceptable behavior May produce undesirable results such as hostility, passivity, fear Likely to be temporary May model aggression Samples Positive Reinforcement Negative Reinforcement Positive Punishment Negative Punishment Operant Conditioning Processes Primary Reinforcement is unlearned and usually necessary for survival. Food is the best example of a primary reinforcer. Examples? Secondary Reinforcement is anything that comes to represent a primary reinforcer such as praise from a friend or a gold star on a homework assignment. Also called conditioned reinforcer. Examples? Common Terminology Classical Operant Acquisition – repeatedly pair CS with UCS Acquisition – behavior repeatedly FOLLOWED BY reinforcement/punishment Extinction – CS no longer paired with UCS, CR extinguished Extinction – behavior no longer reinforced/punished so extinguished Spontaneous Recovery – After period of extinction, CR returns in presence of CS Spontaneous Recovery – behavior suddenly reappears (after extinguished) in presence of reinforcer Generalization – Will respond to stimuli similar to CS Generalization – will respond to reinforcement/punishment similar to original Discrimination – Will ONLY respond to CS Discrimination – will ONLY respond to original reinforcement/punishment Schedules of Reinforcement Immediate Reinforcers To our detriment, small but immediate reinforcements are sometimes more alluring than big, but delayed reinforcements Continuous Reinforcement reinforcing the desired response each time it occurs Partial/Delayed/Intermittent Reinforcement reinforcing a response only part of the time results in slower acquisition greater resistance to extinction Schedules of Reinforcement Fixed Ratio (FR) reinforces a response only after a specified number of responses faster you respond the more rewards you get different ratios very high rate of responding like piecework pay Schedules of Reinforcement Variable Ratio (VR) reinforces a response after an unpredictable number of responses average ratios like gambling, fly fishing very hard to extinguish because of unpredictability Schedules of Reinforcement Fixed Interval (FI) reinforces a response only after a specified time has elapsed response occurs more frequently as the anticipated time for reward draws near Schedules of Reinforcement Variable Interval (VI) reinforces a response at unpredictable time intervals produces slow steady responding like pop quiz, fishing Practice Worksheet Fixed or Variable? Ratio or Interval? Consideration of Future Consequences Scale 1 2 3 4 5 = = = = = Extremely Uncharacteristic Somewhat Uncharacteristic Uncertain Somewhat Characteristic Extremely Characteristic Number 1 - 12 Delay of Gratification Delay of Gratification Scale Marshmallow Study Walter Mischel(Columbia University) Tracked children longitudinally Kids that could wait went on to achieve more Marshmallow Test