740 likes | 910 Vues
Game Theory and Strategy. Content. Two-persons Zero-Sum Games Two-Persons Non-Zero-Sum Games N-Persons Games. Introduction. At least 2 players Strategies Outcome Payoffs. Two-persons Zero-Sum Games. Payoffs of each outcome add to zero Pure conflict between 2 players.
E N D
Content • Two-persons Zero-Sum Games • Two-Persons Non-Zero-Sum Games • N-Persons Games
Introduction • At least 2 players • Strategies • Outcome • Payoffs
Two-persons Zero-Sum Games • Payoffs of each outcome add to zero • Pure conflict between 2 players
Dominance and Dominance Principle • Definition: A strategy S dominates a strategy T if every outcome in S is at least as good as the corresponding outcome in T, and at least one outcome in S is strictly better than the corresponding outcome in T. • Dominance Principle: A rational player would never play a dominated strategy.
Saddle Points and Saddle Points Principle • Definition: An outcome in a matrix game is called a Saddle Point if the entry at that outcome is both less than or equal to any in its row, and greater than or equal to any entry in its column. • Saddle Point Principle: If a matrix game has a saddle point, both players should play a strategy which contains it.
Value • Definition: For a matrix game, if there is a number such that player A has a strategy which guarantees that he will win at least v and player B has a strategy which guarantees player A will win no more than v, then v is called the value of the game.
Saddle Points Minimax
Saddle Points • 0 saddle point • 1 saddle point • more than 1 saddle points
Mixed Strategy • Colin plays with probability x for A, (1-x) for B • Rose A: x(2) + (1-x)(-3) = -3 + 5x • Rose B: x(0) + (1-x)(3) = 3 - 3x • if -3 + 5x = 3 - 3x => x = 0.75 • Rose A: 0.75(2) + 0.25(-3) = 0.75 • Rose B: 0.75(0) + 0.25(3) = 0.75
Mixed Strategy • Rose plays with probability x for A, (1-x) for B • Colin A: x(2) + (1-x)(0) = 2x • Colin B: x(-3) + (1-x)(3) = 3 - 6x • if 2x = 3 - 6x => x = 0.375 • Colin A: 0.375(2) + 0.625(0) = 0.75 • Colin B: 0.375(-3) + 0.625(3) = 0.75
Mixed Strategy • 0.75 as the value of the game • 0.75A, 0.25B as Colin’s optimal strategy • 0.375A. 0.625B as Rose’s optimal strategy
Minimax Theorem • Every m x n matrix game has a solution. There is a unique number v, called the value of game, and optimal strategy for the players such that • i) player A’s expected payoff is no less that v, no matter what player B does, and • ii) player B’s expected payoff is no more that v, no matter what player A does • The solution can always be found in k x k subgame of the original game
Minimax Theorem (example) • There is no dominance in the above example • From arrows in the graph, Colin will only choose A, B or C, but not D or E. • So the game is reduced into a 3 x 3 subgame
Utility Theory • Rose’s order is u, w, x, z, y, v • Colin’s order is v, y, z, x, w, u
Utility Theory • Transformation can be done using a positive linear function, f(x) = ax + b • in this example, f(x) = 0.5(x - 17) -------->
Two-Persons Non-Zero-Sum Games • Equilibrium outcomes in non-zero-sum games ~ saddle points in zero-sum games
Nash Equilibrium • If there is a set of strategies with the property that no player can benefit by changing her strategy while the other players keep their strategies unchanged, then that set of strategies and the corresponding payoffs constitute the Nash Equilibrium
Dominant Strategy Equilibrium • If every player in the game has a dominant strategy, and each player plays the dominant strategy, then that combination of strategies and the corresponding payoffs are said to constitute the dominant strategy equilibrium for that game.
Pareto-optimal • If an outcome cannot be improved upon, ie. no one can be made better off without making somebody else worse off, then the outcome is Pareto-optimal
Pareto Principle • To be acceptable as a solution to a game, an outcome should be Pareto-optimal.
Prudential Strategy, Security Level and Counter-Prudential Strategy • In a non-zero-sum game, player A’s optimal strategy in A’s game is called A’s prudential strategy. • The value of A’s game is called A’s security level • A’s counter-prudential strategy is A’s optimal response to his opponent’s prudential strategy.
Example • consider only Rose’s strategy • saddle point at AB
Example • consider only Colin’s strategy
Example BB AA Equilibrium BA AB
Co-operative Solution Negotiation Set
Co-operative Solution Negotiation Set
Co-operative Solution • Concerns are Trust and Suspicion
N-Person Games • More important and common in real life • n is assumed to be at least three