Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Review What is definition? What is US? What is UR? What is NS? What is NS paired with? What is CS? What is CR? What is extinction? How is it achieved? What is discrimination? What is generalization? Who is associated with this type of learning? A. Introduction Definition: What is operant conditioning? Learning from the consequences of behavior; a certain action is reinforced or punished and the behavior is increased or decreased. Difference from Classical Conditioning: In CC the experimenter presents the CS independent of behavior; in OC the participant must engage in a behavior in order for the programmed outcome to occur, that is, the behavior is voluntary. Psychologist associated with OC is: B.F. Skinner To increase behavior, reinforcement is presented Reinforcement is a stimulus or event that increases the repetition of a behavior Example: Bird in Skinner Box Social Approval Money for grades Give me an example of how you would teach a dog to sit up Name the reinforcement: If it increases behavior, what is it called? Primary—one that satisfies a biological need such as hunger, thirst, or sleep Secondary—one that has been paired with a primary reinforcer and through CC has acquired value and the ability to reinforce Best example of secondary reinforcer in society is: Money Name some others: praise, status, honor Timing and frequency very important Behavior that is reinforced every time is on : a continuous schedule Behavior that is reinforced intermittently is on: a partial schedule Which do you think would make the behavior last longer? Why? 1. FIXED-RATIO Reinforcement depends on a specified quantity of responses, i.e. every third response 2. VARIABLE-RATIO Does not require that a set number of responses be made. Differs each trial. 3. FIXED-INTERVAL First correct response after a specified amount of time gets rewarded 4. VARIABLE-INTERVAL The time at which the reinforcement is given changes What are the four schedules? How do they work? Ratio schedules are based on : Interval schedules are based on: Give example of each Shaping: a process in which reinforcement is used to sculpt new responses out of old ones. Used to train animals Chaining: Responses that follow one another in a sequence. Each response signals the next. Used to learn complex skills--swimming This occurs when behavior is negatively reinforced or punished A painful or unpleasant stimulus is removed Examples: stone in shoe, fear, disapproval Note: it follows and negates, takes away a negative stimulus Uses: 1. escape conditioning—a person’s behavior causes an unpleasant event to stop— whining when served liver 2. avoidance conditioning—the behavior has the effect of preventing an unpleasant situation from happening—whining before liver served An unpleasant consequence occurs and decreases the frequency of the behavior Note: opposite of negative reinforcement: in neg reinforcement the behavior increases, in punishment, the behavior decreases What do you think? Unwanted side effects: rage, aggression, fear People learn to avoid the person delivering the aversive consequences Punishment does not teach the appropriate behavior