* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download simultaneous - move games
Survey
Document related concepts
Transcript
SIMULTANEOUS - MOVE GAMES A game is simultaneous-move if the players choose their actions without any information on the other players’ actions. This does NOT necessarily mean that the players are acting at exactly the same time (even though that may very well be the case). Example: the interaction between Beckham and Rüstü is a simultaneous-move game (even though they do not act at exactly the same time) because Rüstü does NOT observe where Beckham kicked the ball 1 A game (in strategic form or normal form) is 1. A set of players N 2. For each player i in N, a set of his strategies: Si 3. For each player i in N, his payoff function: ui Example: the Rock-Paper-Scissors game N = {1 , 2} S1 = S2 = {r , p , s } u1(r , s) = u1(p , r) = u1(s , p) = 1 u1(r , r) = u1(p , p) = u1(s , s) = 0 u1(r , p) = u1(p , s) = u1(s , r) = -1 u2 is defined similarly 2 Example 2 Guessing half of the average (10 players) 1. Explain the rules of the game 2. Write it in normal form 3 To illustrate simple simultaneous-move games we use a Game Table 4 Example: the Rock-Paper-Scissors game draw a table: Player 1 takes rows Player 2 takes columns PLAYER 2 R P S R PLAYER 1 P S Observe: each cell corresponds to a possible outcome 5 Then place in each cell the payoffs agents get from that outcome PLAYER 2 PLAYER 1 R P S R 0, 0 -1, 1 1, -1 P 1, -1 0, 0 -1, 1 1, -1 0, 0 S What about the halfof the average game? Row player -1, 1 : -1 Column player: 1 6 A tale of two prisoners: Tchaikovsky Confess (Defect) Deny (Cooperate) Confess (Defect) 10 yr, 10 yr 1 yr, 25 yr Deny (Cooperate) 25 yr, 1 yr 3 yr, 3 yr Conductor ATTENTION: THIS IS NOT A GAME TABLE 7 Prisoner’s Dilemma Tchaikovsky Confess (Defect) Deny (Cooperate) Confess (Defect) -10, -10 -1, -25 Deny (Cooperate) -25, -1 -3, -3 Conductor 8 Students doing a project together Student 2 Goof off (Defect) Work hard (Cooperate) Goof off (Defect) 1, 1 3, Work hard (Cooperate) 0, 3 2, 0 Student 1 2 9 Two firms (duopolists), choosing prices Firm 2 Low (Defect) Low (Defect) High (Cooperate) 600, 600 1200, -200 -200, 1200 1000, 1000 Firm 1 High (Cooperate) Other examples: the arms race, common property, etc. 10 Battle of the Sexes Wife Football Soap opera Football 2, 1 0, Soap opera 0, 0 1, 0 Husband 2 Other examples: two politicians determining position on an issue two merging firms choosing between PC and MAC 11 Matching Pennies Veli Head Head Tail 1, -1 -1, 1 -1, 1 1, -1 Ali Tail Example: leader and follower firms choosing product appearances 12 The Stag hunt ( Rousseau, in Discourse on the Origin and Foundations of Inequality Among Man ) Hunter 2 Stag Hare Stag 2, 2 0, 1 Hare 1, 0 1, 1 Hunter 1 Other example: arms race 13 The Chicken game DEAN Swerve (Chicken) Straight (Tough) Swerve (Chicken) 0, 0 –1, 1 Straight (Tough) 1, –1 –2, –2 JAMES 14 "The Gift of the Magi" (O. Henry) is about a young couple who are short of money but desperately want to buy each other Christmas gifts. Unbeknownst to Jim, Della sells her most valuable possession, her beautiful hair, in order to buy a platinum fob chain for Jim's watch; while unbeknownst to Della, Jim sells his own most valuable possession, his watch, to buy jeweled combs for Della's hair. 15 The Gift of the Magi (O Henry) Jim Don’t buy present Don’t buy present Buy present 0, 0 –1, 1 1, –1 –2, –2 Della Buy present 16 Zero-sum game A 2-player game is zero-sum if in each entry of the game table, the agents' payoffs add up to zero. Constant-sum game A 2-player game is constant-sum if in each entry of the game table, the agents' payoffs add up to the same constant. Competitive game A 2-player game is competitive if Player 1’s ranking of the strategy profiles is the opposite of Player 2’s ranking. Every zero-sum game is a constant-sum game and every constantsum game is competitive. However, every competitive game can be made zero-sum by changing the payoff numbers. In these games, the players have totally opposing interests. 17 Zero-sum games: it is sufficient to write one number in every box the payoff of row player (b) Payoffs in zero-sum notation PLAYER 2 PLAYER 1 PLAYER 2 R P S R 0, 0 -1, 1 1, -1 P 1, -1 0, 0 -1, 1 1, -1 0, 0 S -1, 1 Row player receives PLAYER 1 R P S R 0 -1 1 P 1 0 -1 S -1 1 0 -1 Column player receives - ( -1 ) = 1 18 Game tables with three players: TALIA chooses: Contribute Don't Contribute NINA EMILY NINA Contribute Don't Contribute 5, 5, 5 3, 6, 3 Don't 6, 3, 3 4, 4, 1 EMILY Contribute Don't Contribute 3, 3, 6 1, 4, 4 Don't 4, 1, 4 2, 2, 2 19 ANALYZING GAMES Which strategies will the agents choose? What will be the outcome? USE: the assumption of rationality 20 In-class game Guessing half of the average (10 players) Play the game 21 Discussion • dominance, • iterated elimination of dominated strategies, • culmination in a Nash equilibrium • players getting close to the Nash equilibrium with more experience • if you expect the others not to play equilibrium strategies, then your best response might be different 22 What will the players do? Tchaikovsky Defect Cooperate Defect -10, -10 -1, -25 Cooperate -25, -1 -3, -3 Conductor 23 Strict Domination • Take an agent (call him Bond, James Bond) and two of his strategies: U and D. • The strategy U strictly dominates the strategy D (for Bond) if 1. for every possible strategy profile of the other agents, playing U yields a higher (>) payoff than playing D. That is, independent of what the others do, playing U always gives a higher payoff 24 Given a strategic form game G = ( N , (S1,…,Sn) , (u1,…,un) ) for an Agent i, U strictly dominates D if 1. for every strategy profile s-i = ( s1 ,…, si-1 , si+1 ,…, sn ) of the agents other than i, ui ( U , s-i ) > ui ( D , s-i ) 25 What will the players do in this game? Tchaikovsky Defect Cooperate Defect -10, -10 -1, -25 Cooperate -25, -1 -1, -3 Conductor 26 Weak Domination • Take an agent (call him Bond, James Bond) and two of his strategies: U and D. • The strategy U weakly dominates the strategy D (for Bond) if 1. for every possible strategy profiles of the other agents, playing U yields an at least as high (≥) payoff as playing D and 2. for at least one strategy profile of the other agents, U yields a higher (>) payoff than playing D. 27 Given a strategic form game G For agent i, U weakly dominates D if 1. for every strategy profile s-i of the other agents, ui ( U , s-i ) ≥ ui ( D , s-i ) and 2. for at least one s-i* of the other agents, ui ( U , s-i* ) > ui ( D , s-i* ) 28 What are the dominance relations? Row Player’s payoffs U 5 -1 3 M 2 -5 2 D 3 2 1 29 NOTE: If U strictly dominates D then U weakly dominates D Definition: If there is some strategy U that strictly (weakly) dominates D then D is called a strictly (weakly) dominated strategy Important: Rational players do NOT PLAY their strictly dominated strategies They can sometimes play their weakly dominated strategies 30 Cooperate is weakly dominated in this game? Can they sustain (Cooperate,Cooperate) ? Tchaikovsky Defect Cooperate Defect -10, -10 -1, -25 Cooperate -25, -1 -1, -1 Conductor 31 QUESTION • If M is strictly dominated by U and M is weakly dominated by D what do I call M ? 32 If a strategy of Bond strictly dominates all of his other strategies it is called a strictly dominant strategy. If Bond has a strictly dominant strategy, he always plays it If a strategy of Bond weakly dominates all of his other strategies it is called a weakly dominant strategy. However, Bond does not always play a weakly dominant strategy. Why? The answer lies in what Bond expects others to play. 33 QUESTIONS 1. If U strictly dominates M and U weakly dominates D what do I call U? 2. How many strictly dominant strategies can a player have? How many weakly dominant strategies can a player have? 34 BEST SITUATION Strictly Dominant Strategy for each agent (it strictly dominates all of her other strategies) or Weakly Dominant Strategy for each agent (it weakly dominates all of her other strategies) THEN WE KNOW WHAT TO DO !!! We have a Dominant Strategy Equilibrium 35 Definition: a Dominant Strategy Equilibrium of the game G = ( N , (S1,…,Sn) , (u1,…,un) ) is a strategy profile (s1,…,sn) such that for every player i in N, si is a dominant strategy of player i. 36 Both players have dominant strategies in the Prisoner’s Dilemma. Tchaikovsky Confess (Defect) Deny (Cooperate) Confess (Defect) -10, -10 -1, -25 Deny (Cooperate) -25, -1 -3, -3 Conductor Thus (Confess, Confess) is a Dominant Strategy Equilibrium 37 1. What are the domination relationships between players’ strategies? 2. Which are the dominant strategies? Strict or weak? 3. Which are the dominated strategies? Strict or weak? 4. Any dominant strategy equilibrium? COLUMN ROW Left Center Right Up 1, 2 2, 5 1, 3 Level 0, 5 1, 3 1, 6 Down 1, 1 3, 1 5, 2 38 What about the Battle of the Bismarck Sea? JAPANESE NAVY U.S. AIR FORCES North South North 2 2 South 1 3 Note that this is a zero-sum game 39 What about the Chicken game? DEAN Swerve (Chicken) Straight (Tough) Swerve (Chicken) 0, 0 –1, 1 Straight (Tough) 1, –1 –2, –2 JAMES 40 What do you do when there is no dominant strategy equilibrium ????? Successively Eliminating Strictly Dominated Strategies (because players never play them) The table gets smaller and smaller and smaller and smaller and smaller 41 Successive elimination of strictly dominated strategies PIERCE’S PIZZA PIES DONNA’S DEEP DISH High Medium Low High 60, 60 36, 70 36, 35 Medium 70, 36 50, 50 30, 35 Low 35, 36 35, 30 25, 25 42 CAUTION: When dominance is weak, the result depends on the order of elimination. COLUMN L ROW C R T 1, 1 1, 1 0, 0 B 0, 0 1, 2 1, 2 HOW ABOUT GAMES LIKE THE CHICKEN GAME? 43 Nash Equilibrium (J. Nash, 1951) A strategy profile where an agent, given her opponents’ strategies, can NOT INCREASE her payoff by switching to another strategy That is, a strategy profile where each agent’s strategy is a best response to the other agents’ strategies Definition: a Nash Equilibrium of the game G = ( N , (S1,…,Sn) , (u1,…,un) ) is a strategy profile (s1*,…,sn*) such that for every player i in N, ui( si* , s-i* ) ≥ ui( si , s-i* ) for every si in Si. 44 Interpretation: Imagine a population of many decision-makers, who repeatedly interact (assume the role of a game player) In time, they will form beliefs about how opponents behave. With experience, they will find the best actions for themselves. A Nash equilibrium is designed to model a steady state of social interaction (in a sense, a social norm) that emerges out of the players gaining experience in playing repeatedly. Example: Driving (more specifically, the crossroads game where you either stop or pass) 45 The Prisoners’ Dilemma: Student 2 Goof off (Defect) Work hard (Cooperate) Goof off (Defect) 1, 1 3, Work hard (Cooperate) 0, 3 2, 0 Student 1 2 The incentive to free-ride eliminates the possibility of the mutually desirable outcome. 46 Battle of the Sexes Wife Football Soap opera Football 2, 1 0, Soap opera 0, 0 1, 0 Husband 2 Both outcomes are stable social norms. 47 Matching Pennies Veli Head Head Tail 1, -1 -1, 1 -1, 1 1, -1 Ali Tail 48 The Tennis Game HINGIS SELES DL CC DL 50 80 CC 90 20 49 The Stag hunt Hunter 2 Stag Hare Stag 2, 2 0, 1 Hare 1, 0 1, 1 Hunter 1 One outcome better than the other, but coordinated deviations are not allowed. Will talk about focal equilibria. 50 IN-CLASS game: • Two players • Each proposes a division of 1000 YTL • If proposals match, they get what is proposed • Otherwise, they get nothing Assume: each player’s objective is to maximize her monetary gain. 51 Consider the following coordination game: COLUMN BACH STRAVINSKY BACH 2, 2 0, 0 STRAVINSKY 0, 0 1, 1 ROW Is one of the equilibria more likely? It is called a focal equilibrium. 52 The Chicken Game DEAN Swerve (Chicken) Straight (Tough) Swerve (Chicken) 0, 0 –1, 1 Straight (Tough) 1, –1 –2, –2 JAMES 53 B A FIGURE 4.13 Lottery 0 1 2 0 0, 0 0, 5 0, 0 1 5, 0 0, 0 – 5, 0 2 0, 0 0, – 5 – 5, – 5 54 Copyright © 2000 by W.W. Norton & Company COLUMN ROW EXERCISE 4.7 Left Middle Right Top 3, 1 2, 3 10, 2 High 4, 5 3, 0 6, 4 Low 2, 2 5, 4 12, 3 Bottom 5, 6 4, 5 9, 7 55 Copyright © 2000 by W.W. Norton & Company B A EXERCISE 4.9 1 2 3 1 10, 10 0, 0 0, 0 2 0, 0 15, 15 0, 0 3 0, 0 0, 0 15, 15 56 Copyright © 2000 by W.W. Norton & Company