* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download operant conditioning
Survey
Document related concepts
Transcript
Chapter 5 5th Edition Learning Copyright © Prentice Hall 2007 5-1 What is Learning? • Learning is a relatively permanent change in behavior or the potential to make a response that occurs as a result of experience. Copyright © Prentice Hall 2007 5-2 Classical Conditioning • Classical conditioning is a form of learning that occurs when two stimuli—a neutral stimulus and an unconditioned stimulus—that are paired (presented together) become associated with each other. Copyright © Prentice Hall 2007 5-3 Classical Conditioning • A neutral stimulus (NS) is a stimulus that, before conditioning, does not elicit a particular response. • An unconditioned stimulus (UCS) is a stimulus that automatically produces a response without any previous training. Copyright © Prentice Hall 2007 5-4 Classical Conditioning • A conditioned stimulus (CS) is a neutral stimulus that acquires the ability to elicit a conditioned response after being paired with an unconditioned stimulus. • An unconditioned response (UCR) is a reaction that is automatically produced when an unconditioned stimulus is presented. Copyright © Prentice Hall 2007 5-5 Classical Conditioning • A conditioned response (CR) is a response elicited by a conditioned stimulus that has been paired with an unconditioned stimulus; it is similar to the unconditioned response. • Several pairings of the NS and UCS during an acquisition phase lead to a situation in which the CS presented by itself elicits a CR. Copyright © Prentice Hall 2007 5-6 Classical Conditioning Copyright © Prentice Hall 2007 5-7 Classical Conditioning • Acquisition is the training stage during which a particular response is learned (occurs after a CS is presented). • Several factors influence the acquisition of CRs. • Among them are the order in which the CS and UCS are presented, the intensity of the UCS, and the number of times the CS and UCS are paired. Copyright © Prentice Hall 2007 5-8 Classical Conditioning • The optimum sequence is for the CS to precede the UCS (by about .50 second). • The stronger the UCS, the stronger the conditioning. • The more times the CS and UCS are presented together, the stronger the CR becomes. Copyright © Prentice Hall 2007 5-9 Classical Conditioning Acquisition patterns in classical conditioning when the UCS is presented at different percentages on trials (a series of presentations of stimuli). Copyright © Prentice Hall 2007 5-10 Classical Conditioning • Extinction is a general term for a reduction and eventual disappearance of a behavior; in the case of classical conditioning, extinction occurs when repeated presentation of the CS alone leads to a decrease in the strength of the CR. Copyright © Prentice Hall 2007 5-11 Classical Conditioning • Spontaneous recovery is the reappearance of an extinguished CR after the passage of time. Copyright © Prentice Hall 2007 5-12 Classical Conditioning • Generalization is the occurrence of responses to stimuli that are similar to a CS. • Discrimination is the occurrence of responses only to a specific CS. Copyright © Prentice Hall 2007 5-13 Classical Conditioning • John Watson and Rosalie Rayner demonstrated that emotions can be learned by classically conditioning 9-month-old Little Albert to fear a white rat. Copyright © Prentice Hall 2007 5-14 Classical Conditioning • Conditioning Little Albert to fear a white rat. • The research conducted by Watson and Rayner would not be considered ethical by present-day standards. Copyright © Prentice Hall 2007 5-15 Classical Conditioning • Many of our fears and anxieties may have been classically conditioned. • A phobia is an irrational fear of an activity, object, or situation that is out of proportion to the actual danger it poses. Copyright © Prentice Hall 2007 5-16 Classical Conditioning • If a potential attitude object (CS) such as a particular make of car is repeatedly paired with a UCS that is either positive or negative, an attitude is likely to form. • Motives that are acquired through the process of classical conditioning are called learned motives. • Many of the goals and incentives that motivate our behavior are learned through classical conditioning; hence they are termed learned goals (incentives). Copyright © Prentice Hall 2007 5-17 Classical Conditioning • Our understanding of classical conditioning has been subject to revision since Pavlov introduced the basic processes. • One principle that has emerged from this continued research is that the better the CS predicts the occurrence of the UCS, the stronger the conditioning will be. • Blocking is a situation in which the conditionability of a CS is weakened when it is paired with a UCS that has previously been paired with another CS. Copyright © Prentice Hall 2007 5-18 Classical Conditioning • Previous trials of a CS-UCS pairing can serve to block the effectiveness of a second CS. Copyright © Prentice Hall 2007 5-19 Classical Conditioning • Taste-aversion learning refers to the development of a dislike or aversion to a flavor or food that has been paired with illness. • Preparedness is the theory that organisms are biologically ready or prepared to associate certain CSs with certain UCSs. Copyright © Prentice Hall 2007 5-20 Operant Conditioning • In operant conditioning, also known as instrumental conditioning, an organism operates on its environment to produce a change. • Edward Thorndike studied the behavior of hungry animals by placing them in a small chamber he called a puzzle box. • The law of effect is Thorndike’s view that reinforcers promote learning, whereas punishers lead to the unlearning of responses. Copyright © Prentice Hall 2007 5-21 Operant Conditioning • B. F. Skinner was strongly influenced by Edward Thorndike, as well as by John B. Watson’s behavioral view of psychology. • Watson believed that if we could understand how to predict and control behavior, we would know all there was to know about psychology. • Skinner therefore began to look for the stimuli that control behavior. Copyright © Prentice Hall 2007 5-22 Operant Conditioning • To isolate those effects, he developed a special testing environment called an operant conditioning chamber, which is usually referred to as a Skinner box. Copyright © Prentice Hall 2007 5-23 Operant Conditioning • A reinforcer is an event or stimulus that increases the frequency of the response that it follows. • Positive reinforcers are events or stimuli such as food, water, money, and praise that are presented after the target response occurs. • Negative reinforcers are events or stimuli that are removed because a response has occurred. Copyright © Prentice Hall 2007 5-24 Operant Conditioning Copyright © Prentice Hall 2007 5-25 Operant Conditioning • A primary reinforcer is an event or stimulus that has innate (that is, biological) reinforcing properties; you do not have to learn that such stimuli are reinforcers. • A secondary reinforcer is a stimulus that acquires reinforcing properties by being associated with a primary reinforcer. Copyright © Prentice Hall 2007 5-26 Operant Conditioning • Skinner coined the term operant conditioning because the behaviors we emit voluntarily operate on the environment around us in some way. • The resulting changes in the environment determine what happens to a given target behavior (response). Copyright © Prentice Hall 2007 5-27 Operant Conditioning • Positive reinforcement occurs when a target behavior (response) is followed by presentation of a positive reinforcer, which has the effect of making the behavior more likely to occur in the future. Copyright © Prentice Hall 2007 5-28 Operant Conditioning • Shaping is a form of operant conditioning in which a desired response is taught by reinforcement of successive responses that more closely resemble the target response. • The Premack Principle, refers to an operant conditioning technique in which the opportunity to participate in a preferred activity reinforces a less preferred activity. Copyright © Prentice Hall 2007 5-29 Operant Conditioning • Negative reinforcement occurs when a target behavior (response) is followed by removal or reduction of a negative reinforcer, which leads to an increase in the frequency of the target behavior. Copyright © Prentice Hall 2007 5-30 Operant Conditioning • Escape conditioning occurs when certain stimuli, whose removal immediately after the occurrence of a response, increase the likelihood of that response. • Avoidance conditioning occurs when a behavior increases in frequency if it prevents an aversive stimulus from occurring. Copyright © Prentice Hall 2007 5-31 Operant Conditioning • Extinction is a general term for the reduction and elimination of behaviors. • In classical conditioning, extinction occurs when repeated presentation of the CS alone leads to a reduction in the strength of the CR. • In operant conditioning, extinction occurs when a behavior is no longer followed by a reinforcer. Copyright © Prentice Hall 2007 5-32 Operant Conditioning • Extinction burst means that a behavior often increases once extinction begins before it is finally eliminated. • Bringing a behavior under stimulus control means that a particular stimulus or signal tells the participant that its responses will be reinforced. • A discriminative stimulus is a stimulus or signal telling the participant that responding will be reinforced. Copyright © Prentice Hall 2007 5-33 Operant Conditioning • A preset pattern or plan for delivering reinforcement, a schedule of reinforcement, is an important determinant of behavior. • Once a target response has been shaped, the experimenter can arrange to have the reinforcer delivered according to a specific schedule. • When evaluating various schedules of reinforcement, the two dimensions of interest are (1) rate of responding, and (2) resistance to extinction. Each schedule can be described in terms of these two characteristics. Copyright © Prentice Hall 2007 5-34 Operant Conditioning • The most basic schedule of reinforcement is continuous reinforcement, in which the participant is given a reinforcer after each target response occurs. • In schedules of reinforcement that do not involve the use of continuous reinforcement, some responses are not reinforced. Copyright © Prentice Hall 2007 5-35 Operant Conditioning • The term intermittent (or partial) reinforcement describes these noncontinuous patterns of delivering reinforcement. • There are two main types of intermittent schedules: ratio and interval. Copyright © Prentice Hall 2007 5-36 Operant Conditioning • When a ratio schedule is in effect, the number of responses determines whether the participant receives reinforcement. • When an interval schedule is in effect, responses are reinforced only after a certain interval of time has passed. Copyright © Prentice Hall 2007 5-37 Operant Conditioning • Fixed-ratio (FR) schedules require that a set number of responses be made before a reinforcer is delivered. • Variable-ratio (VR) schedules require that the participant perform differing numbers of responses to obtain a reinforcer. Copyright © Prentice Hall 2007 5-38 Operant Conditioning • With a fixed-interval (FI) schedule, the time interval is constant. • The time interval changes after each reinforcer is delivered when a variable-interval (VI) schedule is used. • Ratio schedules generally produce higher rates of responding than interval schedules. Copyright © Prentice Hall 2007 5-39 Operant Conditioning • Operant responses that are not reinforced each time during training take much longer to extinguish than ones that have received continuous reinforcement. • This phenomenon is known as the partial reinforcement effect. Copyright © Prentice Hall 2007 5-40 Operant Conditioning • A punisher is a stimulus that produces a decrease in responding and may take the form of presentation of a stimulus or termination of a stimulus. Copyright © Prentice Hall 2007 5-41 Operant Conditioning • Punishment is the process of using a punisher to decrease the response rate. • Punishment is not an especially effective means of altering behavior. Copyright © Prentice Hall 2007 5-42 Cognitive and Social Perspectives On Learning • Observational learning (modeling) is learning that occurs through watching and imitating the behaviors of others. • For almost five decades, researchers have been accumulating correlational and experimental evidence on the relation between exposure to media violence and real-life aggression. Copyright © Prentice Hall 2007 5-43 Bandura’s Bobo Doll Study Copyright © Prentice Hall 2007 5-44 Bobo Doll (II) • Results: Observing the aggressive model increased the frequency of aggressive acts in the experimental group • Also: When the control group was led to believe that aggressive behavior would lead to reinforcement, they were able to imitate the aggressive model • Implication: Even if we do not immediate imitate aggressive acts, we are learning to do so, and can demonstrate this knowledge when we believe it is to our benefit to do so. Copyright © Prentice Hall 2007 5-45 Cognitive and Social Perspectives On Learning • The general conclusion from the accumulated evidence is: Violent media is causally related to short-term and longterm expressions of aggression. • One of the consequences of this exposure to media violence is desensitization, which is a reduction in distress-related physiological reactivity to observations or thoughts of violence. Copyright © Prentice Hall 2007 5-46 Cognitive and Social Perspectives On Learning • Although there are major differences between songs and visual media such as television, research has established that aggressive words can prime aggressive thoughts, perceptions, and behavior, hence the concern for the potential effects of violent lyrics on songs. Copyright © Prentice Hall 2007 5-47 Cognitive and Social Perspectives On Learning • One of the keys to observational learning appears to be that the participant identifies with the person being observed. • If we put ourselves in the other person’s place for a moment, we are better able to imagine the effects of the reinforcer or punisher. • This phenomenon is called vicarious reinforcement or vicarious punishment. Copyright © Prentice Hall 2007 5-48