Survey							
                            
		                
		                * Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Learning
 A relatively permanent change in behavior brought about by
experience
Watson’s Extreme Environmentalism
 “Give me a dozen healthy infants, well-formed, and my own
special world to bring them up in, and I’ll guarantee to take
any one at random and train him to be any type of specialist I
might select – doctor, lawyer, artist, merchant-chief , and
yes, beggar-man and thief, regardless of his talents,
penchants, tendencies, abilities, vocations, and race of his
ancestors.”
Learning Theories/Behaviorism
 Habituation
 Pavlov: Classical Conditioning
 Watson
 Skinner: Operant Conditioning
1
Habituation
 Simplest form of learning
 Repeated exposure to a stimulus results in reduced
responsiveness.
Habituation Procedures
 Used to assess cognitive competence
Declining interest indicates learning
Novelty responsiveness indicates discrimination of new versus
familiar
 Older infants habituate faster than younger infants
 Infants of same age require more time to encode complex
stimuli than simple stimuli
Habituation Procedures
 Orienting response: natural attentional response to new
stimulus.
 Habituation: decline in orienting response as initially novel
stimulus becomes familiar.
 Dishabituation: recovery of orienting response when an
habituated stimulus changes.
2
Habituation
Habituation
Classical Conditioning
 An organism comes to associate one stimulus with another.
 Learning that one event predicts another.
3
Classical Conditioning
 Pavlov
 Unconditioned stimulus
 Unconditioned response
 Conditioned stimulus
 Conditioned response
Pavlov’s Original Experiment
 http://www.youtube.com/watch?v=hhqumfpxuzI
Classical Conditioning
4
Example
You are in your dentist's office. Your dentist is looking at your xrays, when he gets that far-off look that only dentists can get when
they are looking at x-rays of The Big One!
 He turns to you, and with a half-sadistic, half-empathetic look,
says, "My, my! I don't see cavities like this very often!"
 You hunker down, experiencing the drilling .
 Take notice of any changes in the way your body is reacting. Pay
particular attention to bodily responses (remember, Pavlovian
conditioning is about involuntary responses--those we do not have
control over!).
Example
 In this demonstration of the dentist's
drill:
 Unconditional Reflex
 UCS – drilling
 UCR - pain
 Conditional Reflex
 CS - sound of drill
 CR - pain
Summary of General Principles of
Classical Conditioning
Any stimulus we can perceive has the potential to become a
conditioned stimulus.
 Any response we make naturally can come to be elicited by any
learned signal.
 These responses can be highly specific and simple (such as a
muscle twitch or part of a brain wave pattern) or general and
complex (such as fear).
 The conditioned response can be a response of our skeletal
muscles or visceral organs or even a "private" response (such as
thoughts and feelings). In other words, the response can be an
overt behavior or reaction or something internal that only you
know is happening.
5
Summary of General Principles of
Classical Conditioning
With a powerful original UCS, conditioning may take place in only
one trial in which the UCS is paired with any CS.
 Stimuli quite different from the original CS can control the
appearance of the conditioned response through second-order
conditioning {DEF: transfer of CR from one CS to another CS.
After you were salivating to Pavlov, I could have presented Pavlov
and a light at the same time. Eventually, you would then salivate to
just the light}.
 Depending on the strength of the CR and the nature of the
conditioning process some learned responses resist extinction don't fade away easily, and may endure a lifetime. This is good
news if the response is something the person wants to continue
doing, this is bad news if the behavior is destructive.
Operant Conditioning
 The learning process by which a given behavior is
changed by the consequences of that behavior.
 An organism learns to behave in ways that produce
reinforcement.
 Watson, B. F. Skinner, Thorndike
Watson – Little Albert Experiment
http://www.youtube.com/watch?v=
0FKZAYt77ZM&feature=related
6
Consequences of Behavior
Reinforcement
 When a consequence
causes an increase in the
performance of the
behavior on which it is
contingent.
7
Reinforcement
 Positive reinforcement: strengthening of a response whose
consequence is a pleasant event.
 Negative reinforcement: strengthening of a response because
it is followed by removal of an unpleasant event.
Factors affecting Positive
Reinforcement
 Selection of the behavior to be increased
 Choice of the reinforcer
 Immediacy
 Instructions
 Schedules of Reinforcement
Schedules of Reinforcement
 Behavior is not necessarily going to be reinforced
every time it occurs
 In real life, behavior is not often reinforced each
time it occurs
 Intermittent reinforcement refers to
reinforcement that is not administered to each
instance of a response
8
Advantages of Intermittent
Reinforcement
 Economizing on time and reinforcers
 Building persistent behavior which is much
more resistant to extinction
Types of Reinforcement Schedules
*Continuous reinforcement: when each instance of a
response is reinforced
* Intermittment: when reinforcement is provided only after
some, but not all occasions of a behaviour
*Fixed ratio: when a certain number of responses occur
prior to the delivery of the reinforcer (e.g., every 5th time)
*Variable ratio: varies the number of responses required
around some average.
*Fixed interval: reinforcement after some fixed length of
time (e.g., after 30 seconds)
*Variable interval: varies the length of time between
reinforced responses around an average interval
Schedules of Reinforcement
 Continuous reinforcement
refers to reinforcement
being administered to each
instance of a response
 Intermittent reinforcement
lies between continuous
reinforcement and
extinction
9
An Example of Continuous
Reinforcement
 Each instance of a smile is reinforced
Fixed Ratio Reinforcement
 A fixed number of responses is required for each reinforcement
 Example: every fourth instance of a smile is reinforced
Graph of Fixed Ratio Responding
10
Fixed Interval Reinforcement
 These schedules require the passage of a specified amount of time
before reinforcement will be delivered contingent on a response
 No response during the interval is reinforced
 The first response following the end of the interval is reinforced
 This schedule usually produces a scalloped pattern of responding
in which little behavior is produced early in the interval, but as
the interval nears an end, the rate of responding increases
 This also produces an overall low rate of responding
Graph of Fixed Interval Responding
Variable Schedules of Reinforcement
 Variable schedules differ from fixed schedules in that the
behavioral requirement for reinforcement varies randomly
from one reinforcement to the next
 This usually produces a more consistent pattern of responding
without post-reinforcement pauses
 Variable ratio schedules produce an overall high consistent rate
of responding
 Variable interval schedules produce an overall low consistent
rate of responding
11
An Example of Variable Ratio
Reinforcement
 Random instances of the behavior are reinforced
Graph of Variable Ratio
Responding
Graph of Variable Interval Responding
12
Punishment
 When a consequence causes a decrease in the
performance of a contingent behaviour
Practice
 Giving a child a time-out
Negative punishment
 Community service time – cleaning up garbage
Positive punishment
 Applause after outstanding concert performance
 Positive reinforcement
 You are freezing and put on a jacket
Negative reinforcement
13
Shaping
 Process of teaching a complex behavior by rewarding closer
and closer approximations of the desired behavior
Limitations of Punishment
 Only suppresses existing behaviors
 Potential serious social
consequences
 Learned helplessness
 Can lead to aggression/antisocial
behaviors
 Only works in presence of
punisher
Using Punishment Effectively
 Punish as soon as possible
 Punish with appropriate amount of intensity
 Punish consistently
 Be otherwise warm
 Explain yourself
 Reinforce alternative behavior
 Consider alternative responses to misbehavior
14