Two-Player Zero-Sum Games

Two-Player Zero-Sum Games Finding the saddle-point – Example #2

Finding Saddle Points Consider the following 2-player zero-sum game… Player 2 Player 1

Finding Saddle Points Consider the following 2-player zero-sum game… Player 2 First, determine if saddle-point is in pure strategies…. we determine minimax and maximin strategies. Player 1

Finding Saddle Points Consider the following 2-player zero-sum game… P2 maximum of minimum values -3 -1 P1 minimum of maximum values 4 2

Finding Saddle Points Consider the following 2-player zero-sum game… P2 B is the maximin strategy for player 1 -3 -1 P1 minimum of maximum values 4 2

Finding Saddle Points Consider the following 2-player zero-sum game… P2 B is the maximin strategy for player 1 -3 -1 P1 Y is the minimax strategy for player 2 4 2

Finding Saddle Points P2 Maximin strategy for player 1 -3 -1 P1 Minimax strategy for player 2 4 2 If player 1 chooses strategy A, then player 2’s best strategy results in the payoff -3 If player 1 chooses strategy B, then player 2’s best strategy results in the payoff -1 Therefore, assuming player 2 is always playing his best strategy for any choice by player 1, the best strategy for player 1 is B… this is called the maximin strategy.

Finding Saddle Points P2 Maximin strategy for player 1 -3 -1 P1 Minimax strategy for player 2 4 2 If player 2 chooses X then the best strategy for player 1 results in 4. If player 2 chooses Y then the best strategy for player 1 results in 2. Therefore, assuming player 1 is always playing his best strategy for any strategy choice of player 2, the best strategy for player 2 is Y. This is the minimax strategy for player 2.

Finding Saddle Points P2 -3 -1 P1 These values are not the same and therefore the optimal strategies – the saddle point – lies in mixed strategies for each player. 4 2 Assuming that player 2 is playing his best, player 1 could be sure that the worst that could happen would be to get a payoff of -1 if he continued with strategy B and it might turn out better, if player 2 mistakenly chose X.

Finding Saddle Points P2 Assuming that player 1 is playing his best, player 2 could choose strategy Y and the worst that could happen is to get a 2 (and it could turn out better for player 2, if player 1 mistakenly chose B.) -3 -1 P1 4 2 The situation is that an optimal strategy for each will be mixed strategies because, the worst that could happen to each is not the same value… that is to say, based on the maximin and minimax strategies of each, each knows the worst that could happen assuming the other plays their best. Each player will try to do better than that by mixing strategies on repeated plays of the game.

Finding Saddle Points Assign probabilities for repeated play for each strategy. P2 p 1-p q 1-q P1 First, let’s determine P2’s optimal mixed strategy…To do this, we will consider the payoffs that P1 would receive with pure strategies corresponding to P2’s mixed strategy… EA= -3p + 2(1-p) = -3p+2-2p = -5p+2 EB = 4p + -1(1-p) = 4p -1+p = 5p-1

Graphing the Payoff Functions Find intersection by solving equations… 5p-1 = -5p+2 10p = 3 p=3/10 then plug p = 3/10 into either function to get payoff of 1/2 expected value Intersection at (3/10, 1/2) 1/2 represents the expected payoff to the row player (player 1) and is therefore the referred to as the value of the game. Note that the expected payoff to the column player (player 2) would therefore by -1/2). EB = 5p - 1 4 1 p -1 1 -3 EA = -5p + 2

Finding Saddle Points P2 p 1-p q 1-q P1 Now, for player 1… we assume pure strategies for player 2 and find q and 1-q which represent the optimal mixed strategy for player 1… EX= -3q + 4(1-q) = -3q + 4 – 4q = -7q + 4 EY = 2q -1(1-q) = 2q – 1 + q = 3q - 1

Graphing the Payoff Functions Find intersection by solving equations… -7q + 4 = 3q -1 10q = 5 q = 1/2 The goal for player 1 is to maximize his own payoff. Because it is a zero sum game, that’s the same as minimizing the other player’s payoff. expected value Intersection at (1/2, 1/2) For example, if player 1 chose a value of q = 1 which corresponds to playing strategy A, then the best strategy for player 2 is to pick strategy X. Player 1 finds that q=1/2 is optimal because player 2 can not get a lower payoff at this point. 4 EY = 3q -1 1 q -1 1 -3 EX = -7q + 4

Graphing the Payoff Functions Find intersection by solving equations… -7q + 4 = 3q -1 10q = 5 q = 1/2 The conclusion is that an optimal mixed strategy for player 1 is play strategy A with probability 1/2 and strategy B with probability 1/2. expected value Intersection at (1/2, 1/2) 4 EY = 3q -1 1 q -1 1 -3 EX = -7q + 4

Writing the Answer We were seeking an optimal strategy for each player in this 2-player zero-sum game. The answer is that the optimal strategies for each player lie in mixed strategies. We’ve found that the optimal strategy for player 2 is play strategy X with probability 3/10 and therefore play strategy Y with probability 7/10. And we found that the optimal strategy for player 1 is to play strategy A with probability 1/2 and strategy B with probability 1/2. Player 2 Player 1 We’ve also found that the value of the same is 1/2. Note that the value of the game is the payoff to the row player (and thus negating the value of the game is the payoff to the column player). Also note that the value of the game is always the second coordinate in both calculations of an intersection point for the payoff functions.

Writing the Answer We were seeking an optimal strategy for each player in this 2-player zero-sum game. Player 2 Here is a succinct way to write the final answer…. Optimal strategies: Player 2: (3/10, 7/10) Player 1: (1/2, 1/2) Value of the game: 1/2 Player 1 In writing it this way, it is understood that the first coordinate is the probability for playing the first strategy and the second coordinate is the probability for playing the other strategy when reading the strategies from the given table in the natural order – from left to right and top to bottom.

Two-Player Zero-Sum Games

Two-Player Zero-Sum Games

Presentation Transcript

Notes 6: Two-Player Games and Search

Chapter 7 Two Player Perfect-Information Games

Zero-Sum Superheroes

Multi-player, non-zero-sum games

Two Player Zero Sum Games

Zero-Sum Company Overview

Notes 6: Two-Player Games and Search

On zero-sum problems

General-sum games

TWO-PERSON, ZERO-SUM GAMES

Two-Person, Zero-Sum Game: Advertising

ZERO-SUM (TOTAL CONFLICT) GAMES

Experts Learning and The Minimax Theorem for Zero-Sum Games

Two-player Games (2) ZUI 2013/2014

TWO-PERSON ZERO-SUM GAMES

Intelligent Strategies for Several Zero-, One- and Two-Player Games

Equilibrium Concepts in Two Player Games

Zero-sum square matrices

CPS 570: Artificial Intelligence Two-player, zero-sum, perfect-information Games

Algorithms for solving sequential (zero-sum) complete-information games

Stochastic Zero-sum and Nonzero-sum  -regular Games

Two-player games overview