Download Decision Theory

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Decision Theory
Slide 1
Learning Objectives






Structuring the decision problem and decision trees
Types of decision making environments:
• Decision making under uncertainty when
probabilities are not known
• Decision making under risk when probabilities
are known
Expected Value of Perfect Information
Decision Analysis with Sample Information
Developing a Decision Strategy
Expected Value of Sample Information
Slide 2
Types Of Decision Making Environments

Type 1: Decision Making under Certainty.
Decision maker know for sure (that is, with
certainty) outcome or consequence of every decision
alternative.

Type 2: Decision Making under Uncertainty.
Decision maker has no information at all about
various outcomes or states of nature.

Type 3: Decision Making under Risk.

Decision maker has some knowledge regarding
probability of occurrence of each outcome or state of
nature.
Slide 3
Decision Trees




A decision tree is a chronological representation of
the decision problem.
Each decision tree has two types of nodes; round
nodes correspond to the states of nature while square
nodes correspond to the decision alternatives.
The branches leaving each round node represent the
different states of nature while the branches leaving
each square node represent the different decision
alternatives.
At the end of each limb of a tree are the payoffs
attained from the series of branches making up that
limb.
Slide 4
Decision Making Under Uncertainty


If the decision maker does not know with certainty
which state of nature will occur, then he/she is said
to be making decision under uncertainty.
The five commonly used criteria for decision making
under uncertainty are:
1. the optimistic approach (Maximax)
2. the conservative approach (Maximin)
3.
the minimax regret approach (Minimax regret)
4. Equally likely (Laplace criterion)
5. Criterion of realism with  (Hurwicz criterion)
Slide 5
Optimistic Approach



The optimistic approach would be used by an
optimistic decision maker.
The decision with the largest possible payoff is
chosen.
If the payoff table was in terms of costs, the decision
with the lowest cost would be chosen.
Slide 6
Conservative Approach



The conservative approach would be used by a
conservative decision maker.
For each decision the minimum payoff is listed and
then the decision corresponding to the maximum of
these minimum payoffs is selected. (Hence, the
minimum possible payoff is maximized.)
If the payoff was in terms of costs, the maximum
costs would be determined for each decision and
then the decision corresponding to the minimum of
these maximum costs is selected. (Hence, the
maximum possible cost is minimized.)
Slide 7
Minimax Regret Approach




The minimax regret approach requires the
construction of a regret table or an opportunity loss
table.
This is done by calculating for each state of nature
the difference between each payoff and the largest
payoff for that state of nature.
Then, using this regret table, the maximum regret for
each possible decision is listed.
The decision chosen is the one corresponding to the
minimum of the maximum regrets.
Slide 8
Example: Marketing Strategy
Consider the following problem with two decision
alternatives (d1 & d2) and two states of nature S1
(Market Receptive) and S2 (Market Unfavorable) with
the following payoff table representing profits ( $1000):
States of Nature
s1
s3
Decisions
d1
20
6
d2
25
3
Slide 9
Example: Optimistic Approach
An optimistic decision maker would use the optimistic
approach. All we really need to do is to choose the
decision that has the largest single value in the payoff
table. This largest value is 25, and hence the optimal
decision is d2.
Maximum
Decision
Payoff
d1
20
choose d2
d2
25
maximum
Slide 10
Example: Conservative Approach
A conservative decision maker would use the
conservative approach. List the minimum payoff for
each decision. Choose the decision with the maximum
of these minimum payoffs.
Minimum
Decision
Payoff
choose d1
d1
d2
6
3
maximum
Slide 11
Example: Minimax Regret Approach
For the minimax regret approach, first compute a
regret table by subtracting each payoff in a column
from the largest payoff in that column. The resulting
regret table is:
s1
d1
d2
5
0
s2
0
3
Maximum
5
3
minimum
Then, select the decision with minimum regret.
Slide 12
Example: Equally Likely (Laplace) Criterion
Equally likely, also called Laplace, criterion finds
decision alternative with highest average payoff.
• First calculate average payoff for every alternative.
• Then pick alternative with maximum average payoff.
Average for d1 = (20 + 6)/2 = 13
Average for d2 = (25 + 3)/2 = 14 Thus, d2 is selected
Slide 13
Example: Criterion of Realism (Hurwicz)

Often called weighted average, the criterion of realism
(or Hurwicz) decision criterion is a compromise between
optimistic and a pessimistic decision.
• First, select coefficient of realism, , with a value
between 0 and 1. When  is close to 1, decision maker
is optimistic about future, and when  is close to 0,
decision maker is pessimistic about future.
• Payoff =  x (maximum payoff) + (1-) x (minimum payoff)
In our example let  = 0.8
Payoff for d1 = 0.8*20+0.2*6=17.2
Payoff for d2 = 0.8*25+0.2*3=20.6 Thus, select d2
Slide 14
Decision Making with Probabilities

Expected Value Approach
• If probabilistic information regarding the states of
nature is available, one may use the expected
Monetary value (EMV) approach (also known as
Expected Value or EV).
• Here the expected return for each decision is
calculated by summing the products of the payoff
under each state of nature and the probability of
the respective state of nature occurring.
• The decision yielding the best expected return is
chosen.
Slide 15
Expected Value of a Decision Alternative


The expected value of a decision alternative is the sum
of weighted payoffs for the decision alternative.
The expected value (EV) of decision alternative di is
defined as:
N
EV( d i )   P( s j )Vij
j 1
where:
N = the number of states of nature
P(sj) = the probability of state of nature sj
Vij = the payoff corresponding to decision
alternative di and state of nature sj
Slide 16
Example: Marketing Strategy

Expected Value Approach
Refer to the previous problem. Assume the
probability of the market being receptive is known to
be 0.75. Use the expected monetary value criterion to
determine the optimal decision.
Slide 17
Expected Value of Perfect Information



Frequently information is available that can improve
the probability estimates for the states of nature.
The expected value of perfect information (EVPI) is
the increase in the expected profit that would result if
one knew with certainty which state of nature would
occur.
The EVPI provides an upper bound on the expected
value of any sample or survey information.
Slide 18
Expected Value of Perfect Information

EVPI Calculation
• Step 1:
Determine the optimal return corresponding to each
state of nature.
• Step 2:
Compute the expected value of these optimal
returns.
• Step 3:
Subtract the EV of the optimal decision from the
amount determined in step (2).
Slide 19
Example: Marketing Strategy

Expected Value of Perfect Information
Calculate the expected value for the best action for each
state of nature and subtract the EV of the optimal
decision.
EVPI= .75(25,000) + .25(6,000) - 19,500 = $750
Slide 20