Download Learning

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Verbal Behavior wikipedia , lookup

Behavior analysis of child development wikipedia , lookup

Insufficient justification wikipedia , lookup

Behaviorism wikipedia , lookup

Psychological behaviorism wikipedia , lookup

Psychophysics wikipedia , lookup

Classical conditioning wikipedia , lookup

Operant conditioning wikipedia , lookup

Transcript
Chapter 6
Learning
This multimedia product and its contents are protected under copyright law. The following are prohibited by law: any public performance or display,
including transmission of any image over a network; preparation of any derivative work, including the extraction, in whole or part, of any images; any
rental, lease, or lending of the program. ISBN: 0-205-42428-7
1
Copyright © Allyn and Bacon 2006
Learning
Learning –
A process through which experience
produces lasting change in behavior or
mental processes
Behavioral learning –
Forms of learning that can be described
in terms of stimuli and responses
(e.g. classical and operant conditioning)
2
Copyright © Allyn and Bacon 2006
What Sort of Learning Does
Classical Conditioning Explain?
Classical conditioning is a
basic form of learning in which
a stimulus that produces an
innate reflex becomes
associated with a previously
neutral stimulus, which then
acquires the power to elicit
essentially the same response
3
Copyright © Allyn and Bacon 2006
The Essentials of Classical
Conditioning
Neutral stimulus –
Any stimulus that produces no
conditioned response prior to learning
Acquisition –
Initial learning stage in classical
conditioning; conditioned response
becomes elicited by the conditioned
stimulus
4
Copyright © Allyn and Bacon 2006
The Essentials of Classical
Conditioning
Unconditioned
stimulus (UCS)
Unconditioned
response (UCR)
Conditioned
stimulus (CS)
Conditioned
response (CR)
5
Copyright © Allyn and Bacon 2006
The Essentials of Classical
Conditioning
Unconditioned
stimulus (UCS)
Unconditioned
response (UCR)
The stimulus that elicits
an unconditioned
response
Conditioned
stimulus (CS)
Conditioned
response (CR)
6
Copyright © Allyn and Bacon 2006
The Essentials of Classical
Conditioning
Unconditioned
stimulus (UCS)
Unconditioned
response (UCR)
Conditioned
stimulus (CS)
The response elicited
by an unconditioned
stimulus without prior
learning
Conditioned
response (CR)
7
Copyright © Allyn and Bacon 2006
The Essentials of Classical
Conditioning
Unconditioned
stimulus (UCS)
Unconditioned
response (UCR)
Conditioned
stimulus (CS)
A previously neutral
stimulus that comes to
elicit the conditioned
response
Conditioned
response (CR)
8
Copyright © Allyn and Bacon 2006
The Essentials of Classical
Conditioning
Unconditioned
stimulus (UCS)
Unconditioned
response (UCR)
Conditioned
stimulus (CS)
Conditioned
response (CR)
A response elicited by a
previously neutral
stimulus that has
become associated with
the unconditioned
stimulus
9
Copyright © Allyn and Bacon 2006
Classical Conditioning
Prior to conditioning
Neutral stimulus
(tone)
(Orientation to sound
but no response)
Unconditioned stimulus
(food powder in mouth)
Unconditioned response
(salivation)
Conditioning
Neutral stimulus
CS (tone)
+
Unconditioned stimulus
(food powder)
Conditioned response
(salivation)
After conditioning
Conditioned stimulus
(tone)
Conditioned response
(salivation)
10
Copyright © Allyn and Bacon 2006
Classical Conditioning
Extinction –
Weakening of a conditioned association
in the absence of an unconditioned
stimulus or reinforcer
Spontaneous recovery –
Reappearance of an extinguished
conditioned response after a time delay
11
Copyright © Allyn and Bacon 2006
(1)
Acquisition
(CS + UCS)
(2)
Extinction
(CS alone)
Rest period
Strength of the CR
(Weak)
(Strong)
Acquisition, Extinction, and
Spontaneous Recovery
(3)
Spontaneous
Recovery
(CS alone)
(Time)
Trials
12
Copyright © Allyn and Bacon 2006
Classical Conditioning:
Generalization and Discrimination
Stimulus generalization involves giving a
conditioned response to stimuli that are
similar to the CS
Stimulus discrimination involves
responding to one stimulus but
not to stimuli that are similar
Confusing stimuli may cause
experimental neurosis
13
Copyright © Allyn and Bacon 2006
Applications of Classical Conditioning
Taste-aversion learning –
Biological tendency in which an organism
learns to avoid food with a certain taste
after a single experience, if eating it is
followed by illness
14
Copyright © Allyn and Bacon 2006
A Challenge to Pavlov
Why are some stimuli-consequence
combinations readily learned while other
combinations are highly resistant to
learning?
What any organism can or cannot learn in
a given setting is due in part to its
evolutionary history
15
Copyright © Allyn and Bacon 2006
How Do We Learn
New Behaviors by
Operant Conditioning?
In operant conditioning, the
consequences of behavior,
such as rewards and
punishments, influence the
chance that our behavior will
occur again
16
Copyright © Allyn and Bacon 2006
Skinner’s Radical Behaviorism
B.F. Skinner believed that the most
powerful influences on behavior are its
consequences
17
Copyright © Allyn and Bacon 2006
How Do We Learn
New Behaviors by
Operant Conditioning?
Trial-and-error learning –
Learner gradually discovers the correct
response by attempting many behaviors
and noting which ones produce the
desired consequences
18
Copyright © Allyn and Bacon 2006
The Power of Reinforcement
Positive reinforcers –
Stimulus presented after a response that
increases the probability of that response
happening again
Negative reinforcers –
Removal of an unpleasant stimulus,
contingent on a particular behavior
19
Copyright © Allyn and Bacon 2006
The Power of Reinforcement
Primary reinforcers –
Reinforcers, such as food and sex, that
have an innate basis because of their
biological value to an organism
20
Copyright © Allyn and Bacon 2006
The Power of Reinforcement
Secondary reinforcers –
Stimuli, such as money or tokens, that
acquire their reinforcing power by their
learned association with primary
reinforcers
(also called conditioned reinforcers)
21
Copyright © Allyn and Bacon 2006
Contingencies of Reinforcement
Continuous reinforcement –
Reinforcement schedule in which all
correct responses are reinforced
Partial reinforcement –
Reinforcement schedule in which some,
but not all, correct responses are
reinforced
(also called intermittent reinforcement)
22
Copyright © Allyn and Bacon 2006
Class Demonstration
Establishing a behavior using operant
conditioning
• Select a target behavior; be very specific
• Using shaping  reinforcing successive
approximations of the behavior
• Be careful to reinforce only the target behavior
23
Copyright © Allyn and Bacon 2006
Contingencies of Reinforcement
Extinction –
In operant conditioning, a process by
which a response that has been learned
is weakened by the absence or removal
of reinforcement
How does this differ from extinction in
classical conditioning?
24
Copyright © Allyn and Bacon 2006
Schedules of Reinforcement
Ratio schedules –
Provide reward after a certain number of
responses
Interval schedules –
Provide reward after a certain time interval
Fixed Ratio (FR)
Variable Ratio (VR)
Fixed Interval (FI)
Variable Interval (VI)
25
Copyright © Allyn and Bacon 2006
Schedules of Reinforcement
Fixed Ratio (FR)
Variable Ratio
(VR)
Fixed Interval
(FI)
Rewards appear after a
certain set number of
responses
e.g. factory workers
getting paid after every
10 cases of product are
completed
Variable Interval
(VI)
26
Copyright © Allyn and Bacon 2006
Schedules of Reinforcement
Fixed Ratio (FR)
Variable Ratio
(VR)
Fixed Interval
(FI)
Rewards appear after a
certain number of
responses, but that
number varies from trial
to trial
e.g. slot machine payoffs
Variable Interval
(VI)
27
Copyright © Allyn and Bacon 2006
Schedules of Reinforcement
Fixed Ratio (FR)
Variable Ratio
(VR)
Fixed Interval
(FI)
Variable Interval
(VI)
Rewards appear after a
certain fixed amount of
time, regardless of
number of responses
e.g. weekly or monthly
paychecks
28
Copyright © Allyn and Bacon 2006
Schedules of Reinforcement
Fixed Ratio (FR)
Variable Ratio
(VR)
Fixed Interval
(FI)
Variable Interval
(VI)
Rewards appear after a
certain amount of time,
but that amount varies
from trial to trial
e.g. random visits from
the boss who delivers
praise
29
Copyright © Allyn and Bacon 2006
The Problem of Punishment
Punishment –
An aversive stimulus which diminishes
the strength of the response it follows
How does this differ from negative
reinforcement?
30
Copyright © Allyn and Bacon 2006
Punishment vs. Negative
Reinforcement
Negative
Reinforcement
Response
Consequence
Loud Noise
Press Lever
Loud Noise
Removed
Press Lever
Loud Noise
Applied
Punishment
No Noise
31
Copyright © Allyn and Bacon 2006
Types of Punishment
Positive punishment –
The application of an aversive stimulus
after a response
Negative punishment –
The removal of an attractive stimulus
after a response
32
Copyright © Allyn and Bacon 2006
Four Kinds of Consequences
GOAL
STIMULUS
Increase
Behavior
Decrease
Behavior
Positive
(Add)
Negative
(Subtract)
Positive
Reinforcement
Negative
Reinforcement
Bonus for working hard Aspirin relieving headache
leads to more hard work leads to more aspirin use
Positive
Punishment
Positive
Punishment
Getting speeding ticket
leads to less speeding
Missing dinner leads to
less staying out late
33
Copyright © Allyn and Bacon 2006
Problems of Punishment
Power usually disappears when threat of
punishment is removed
Punishment
Often triggers aggression
May inhibit learning new and better responses
Is often applied unequally
When does punishment work?
34
Copyright © Allyn and Bacon 2006
Alternatives to Punishment
Extinction
Reinforcing preferred activities
Premack principle
Prompting and shaping
35
Copyright © Allyn and Bacon 2006
Operant and Classical Conditioning
Compared
Classical conditioning involves the
association of two stimuli (UCS + CS)
before the response or behavior
Operant conditioning involves a reinforcing
(reward) or punishing stimulus after a
response or behavior
36
Copyright © Allyn and Bacon 2006