Download Classical Conditioning

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Perceptual control theory wikipedia , lookup

Transcript
Review







What is definition?
What is US?
What is UR?
What is NS?
What is NS paired
with?
What is CS?
What is CR?




What is extinction?
How is it achieved?
What is
discrimination?
What is
generalization?
Who is associated
with this type of
learning?

A. Introduction


Definition: What is operant conditioning?
Learning from the consequences of behavior; a
certain action is reinforced or punished and the
behavior is increased or decreased.
Difference from Classical Conditioning:
In CC the experimenter presents the CS independent
of behavior; in OC the participant must engage in a
behavior in order for the programmed outcome to
occur, that is, the behavior is voluntary.
Psychologist associated with OC is:
B.F. Skinner
To increase behavior, reinforcement is presented
Reinforcement is a stimulus or event that
increases the repetition of a behavior
Example: Bird in Skinner Box
Social Approval
Money for grades



Give me an example of how you would teach a
dog to sit up
Name the reinforcement: If it increases
behavior, what is it called?



Primary—one that satisfies a biological need such
as hunger, thirst, or sleep
Secondary—one that has been paired with a
primary reinforcer and through CC has acquired
value and the ability to reinforce
Best example of secondary reinforcer in society is:
Money
Name some others: praise, status, honor
Timing and frequency very important
 Behavior that is reinforced every time is on : a
continuous schedule
 Behavior that is reinforced intermittently is on:
a partial schedule
Which do you think would make the behavior last
longer? Why?

1. FIXED-RATIO

Reinforcement depends
on a specified quantity
of responses, i.e. every
third response
2. VARIABLE-RATIO
Does not require that a set
number of responses be
made. Differs each trial.
3. FIXED-INTERVAL

First correct response
after a specified amount
of time gets rewarded
4. VARIABLE-INTERVAL
The time at which the
reinforcement is given
changes




What are the four schedules?
How do they work?
Ratio schedules are based on :
Interval schedules are based on:
Give example of each

Shaping: a process in which reinforcement is
used to sculpt new responses out of old ones.


Used to train animals
Chaining: Responses that follow one another
in a sequence. Each response signals the next.

Used to learn complex skills--swimming

This occurs when behavior is negatively
reinforced or punished



A painful or unpleasant stimulus is removed
Examples: stone in shoe, fear, disapproval
Note: it follows and negates, takes away a
negative stimulus
Uses: 1. escape conditioning—a person’s
behavior causes an unpleasant event to stop—
whining when served liver
2. avoidance conditioning—the behavior has
the effect of preventing an unpleasant situation
from happening—whining before liver served


An unpleasant consequence occurs and
decreases the frequency of the behavior
Note: opposite of negative reinforcement: in
neg reinforcement the behavior increases, in
punishment, the behavior decreases




What do you think?
Unwanted side effects: rage, aggression, fear
People learn to avoid the person delivering the
aversive consequences
Punishment does not teach the appropriate
behavior