Download Intro to Game Theory

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Into. Game theory
Background: Game theory
Background: Prison’s Dilemma
Given you’re Dave,
what’s the best choice?
Nash Equilibrium
A set of strategies is a Nash equilibrium if no player
can do better by changing his or her strategy
Nash Equilibrium (Cont’)
• Nash showed (1950), that Nash equilibria (in
mixed strategies) must exist for all finite
games with any number of players.
• Before Nash's work, this had been proven for
two-player zero-sum games (by John von
Neumann and Oskar Morgenstern in 1947).
• Today, we’re going to find such Nash equilibria
using Linear Programming for zero-sum game
Zero-Sum Game
• A strictly competitive or zero-sum game is a 2-player
strategic game such that for each action a  A, we
have u1(a) + u2(a) = 0. (u represents for utility)
–What is good for me, is bad for my
opponent and vice versa
Zero-Sum Game
• Mixed strategy
– Making choice randomly obeying some kind of
probability distribution
• Why mixed strategy? (Nash Equilibrium)
• E.g. : P(1) = P(2) = 0.5; P(A) = P(B)=P(C)=1/3
Solving Zero-Sum Games
• Let A1 = {a11, …, a1n }, A2 = {a21, …, a2m }
• Player 1 looks for a mixed strategy p
– ∑i p(a1i ) = 1
– p(a1i ) ≥ 0
– ∑i p(a1i ) · u1(a1i, a2j) ≥ r for all j {1, …, m}
– Maximize r!
• Similarly for player 2.
Solve using Linear Programming
• What are the unknowns?
– Strategy (or probability distribution): p
• p(a11 ), p(a12 ),..., p(a1n-1 ), p(a1n )~n numbers
• Denoted as p1,p2,...,pn-1,pn
– Optimum Utility or Reward: r
 r 


p
 1 
 p 
x 2 


p 
 n 1 
 p 
 n 
• Stack all unknowns into a column vector
• Goal: maximize: f T x where f  1 0
0 0
T
Solve Zero-Sum Game using LP
• What are the constraints?
 r 


 p1 
 p 
x 2 


p 
 n 1 
 p 
 n 


n
i 1
pi  1   0 1
1x  1
n
p
u

r
,
for
all
j

{1,
2,...,
m
},
i
.
e
.
i
ij
i 1
r  u11 p1  u21 p2  ...  un1 pn  0
r  u12 p1  u22 p2  ...  un 2 pn  0
...
r  u1n p1  u2 n p2  ...  unb pn  0
Solve Zero-Sum Game using LP
r  u11 p1  u21 p2  ...  un1 pn  0
r  u12 p1  u22 p2  ...  un 2 pn  0
...
r  u1n p1  u2 n p2  ...  unb pn  0
1; U  x  0  Ax  0
T
Let: Ae=(0,1,…,1)
Aie=(1;-UT)
f=(1,0,…,0)T
The LP is:
maximize fTx
subject to:
Aiex<=0
Aex=1
Summary
• Zero-Sum Game  Mixed Strategy  NE
• Connections with Linear Programming
Related documents