Oskar Morgenstern
1902—1977
“Sherlock Holmes, pursued by his opponent, Moriarty, leaves London for Dover. The train stops at a station on the way, and he alights there rather than travelling on to Dover. He has seen Moriarty at the railway station, recognizes that he is very clever and expects that Moriarty will take a faster special train in order to catch him in Dover. Holmes’s anticipation turns out to be correct. But what if Moriarty had been still more clever, had estimated Holmes’s mental abilities better and had foreseen his actions accordingly? Then, obviously, he would have travelled to the intermediate station [Canterbury]. Holmes again would have had to calculate that, and he himself would have decided to go on to Dover. Whereupon, Moriarty would again have ‘reacted’ differently,” (p.173-4).
Morgenstern, Oskar, 1935, “Perfect Foresight and Economic Equilibrium,”
“‘All that I have to say has already crossed your mind,’ said he. ‘Then possibly my answer has crossed yours,’ I replied. ‘You stand fast?’ ‘Absolutely.’”
— Arthur Conan Doyle, 1893, The Final Problem
E(X)=k∑i=1pixi
E(X)=k∑i=1pixi
E(X)=p1x1+p2x2+⋯+pkxk
A probability-weighted average of X, with each possible X value, xi, weighted by its associated probability pi
Also called the "mean" or "expectation" of X, always denoted either E(X) or μX
Example: Suppose you lend your friend $100 at 10% interest. If the loan is repaid, you receive $110. You estimate that your friend is 99% likely to repay, but there is a default risk of 1% where you get nothing. What is the expected value of repayment?
Pure strategy: is a complete strategy profile that a player will play
Mixed strategy is a probability distribution over a strategy profile
The logic of mixed strategies is best understood in the context of repeated constant-sum games
If you play one strategy repeatedly (i.e. a pure strategy), your opponent can exploit your (predictable) strategy with their best response
You want to “keep your opponent guessing”
We have already seen Nash equilibrium in pure strategies (PSNE)
Nash (1950) proved that any n-player game with a finite number of pure strategies has at least one equilibrium
Consider the following game between a Kicker and a Goalie during a penalty kick
A constant sum game (in this case, zero-sum)
Palacios-Huerta (2003) calculated average success rates in English, Spanish, & Italian leagues (1995-2000)
If both Kicker and Goalie choose same direction, Kicker's payoff is higher if he chooses his natural side (often Right)
Palacios-Huerta, Ignacio, 2003, “Professionals Play Minimax,” Review of Economic Studies 70(2): 395–415
What if Kicker were to randomize strategies
Let p be probability that Kicker plays Kick Left
Then Goalie wants to maximize his expected payoff, given Kicker plays Kick Left with p=0.50
If Goalie plays Dive Left: E[Dive Left]=42(p)+7(1−p)=42(0.50)+7(1−0.50)
Then Goalie wants to maximize his expected payoff, given Kicker plays Kick Left with p=0.50
If Goalie plays Dive Right: E[Dive Right]=5(p)+30(1−p)=5(0.50)+30(1−0.50)
Then Goalie wants to maximize his expected payoff, given Kicker plays Kick Left with p=0.50
Goalie will play Dive Left to maximize his expected payoff (24.5 ≻ 17.5)
Now consider Kicker's expected payoff under this mixed strategy
Since Goalie will Dive Left to maximize his expected payoff, Kicker can expect to earn:
58(p)+93(1−p)58(0.50)+93(1−0.50)75.5
In constant sum games, note that even in mixed strategies, one player increases their own (expected) payoff by pulling down the other player's (expected) payoff!
In this game, even expected payoffs always sum to 100
von Neumann & Morgenstern’s minimax theorem (simplified): in a 2-person, constant sum game, each player maximizes their own expected payoff by minimizing their opponent's expected payoff
The name “minimax” is a popular strategy in games, trying to minimize the risk of your maximum possible loss
Kicker's “randomizing” 50:50 (Kick Left, Kick Right) was not random enough!
Goalie recognizing this pattern can exploit it and hold down Kicker's expected payoff
Kicker can do better by picking a better p (and similarly, so can Goalie)
Want to find the optimal probability mix that leaves your opponent(s) indifferent between their strategies to respond
In constant sum games (i.e. sports, board games, etc)
This principle is the same in non-constant sum games too!
Implies game is played repeatedly
Not always intuitive, but a simple principle
We want to find Kicker's optimal mixed strategy that leaves Goalie indifferent between his (pure) strategies
Suppose Kicker plays Kick Left with probability p
We want to find Kicker's optimal mixed strategy that leaves Goalie indifferent between his (pure) strategies
Suppose Kicker plays Kick Left with probability p
Goalie's expected payoff of playing Dive Left: 42p+7(1-p)
We want to find Kicker's optimal mixed strategy that leaves Goalie indifferent between his (pure) strategies
Suppose Kicker plays Kick Left with probability p
Goalie's expected payoff of playing Dive Left: 42p+7(1-p)
Goalie's expected payoff of playing Dive Right: 5p+30(1-p)
We want to find Kicker's optimal mixed strategy that leaves Goalie indifferent between his (pure) strategies
Suppose Kicker plays Kick Left with probability p
Goalie's expected payoff of playing Dive Left: 42p+7(1-p)
Goalie's expected payoff of playing Dive Right: 5p+30(1-p)
What value of p would make Goalie indifferent between Dive Left and Dive Right?
E[Left]=E[Right]E[42p+7(1−p)]=E[5p+30(1−p)]
E[Left]=E[Right]E[42p+7(1−p)]=E[5p+30(1−p)]
E[Left]=E[Right]E[42p+7(1−p)]=E[5p+30(1−p)]
p⋆=0.383
Kicker plays Kick Left with p=0.383 and Kick Right with 1−p=0.617
E[Left]=E[Right]E[42p+7(1−p)]=E[5p+30(1−p)]
p⋆=0.383
Kicker plays Kick Left with p=0.383 and Kick Right with 1−p=0.617
E[Left]=E[Right]E[42p+7(1−p)]=E[5p+30(1−p)]
p⋆=0.383
Kicker plays Kick Left with p=0.383 and Kick Right with 1−p=0.617
Goalie's expected payoff of Dive Left: 42(0.383)+7(0.617)≈20.41
Goalie's expected payoff of Dive Right: 5(0.383)+30(0.617)≈20.41
We want to find Goalie's optimal mixed strategy that leaves Kicker indifferent between his (pure) strategies
Suppose Goalie plays Dive Left with probability q
We want to find Goalie's optimal mixed strategy that leaves Kicker indifferent between his (pure) strategies
Suppose Goalie plays Dive Left with probability q
Kicker's expected payoff of playing Dive Left: 58q+95(1-q)
We want to find Goalie's optimal mixed strategy that leaves Kicker indifferent between his (pure) strategies
Suppose Goalie plays Dive Left with probability q
Kicker's expected payoff of playing Dive Left: 58q+95(1-q)
Kicker's expected payoff of playing Dive Right: 93q+70(1-q)
We want to find Goalie's optimal mixed strategy that leaves Kicker indifferent between his (pure) strategies
Suppose Goalie plays Dive Left with probability q
Kicker's expected payoff of playing Dive Left: 58q+95(1-q)
Kicker's expected payoff of playing Dive Right: 93q+70(1-q)
What value of p would make Kicker indifferent between Kick Left and Kick Right?
E[Left]=E[Right]E[58q+95(1−q)]=E[93q+70(1−q)]
E[Left]=E[Right]E[58q+95(1−q)]=E[93q+70(1−q)]
E[Left]=E[Right]E[58q+95(1−q)]=E[93q+70(1−q)]
q⋆=0.417
Goalie plays Dive Left with q=0.417 and Dive Right with 1−q=0.583
E[Left]=E[Right]E[58q+95(1−q)]=E[93q+70(1−q)]
q⋆=0.417
Goalie plays Dive Left with q=0.417 and Dive Right with 1−q=0.583
E[Left]=E[Right]E[58q+95(1−q)]=E[93q+70(1−q)]
q⋆=0.417
Goalie plays Dive Left with q=0.417 and Dive Right with 1−q=0.583
Kicker's expected payoff of Kick Left: 58(0.417)+95(0.583)≈79.57
Kicker's expected payoff of Kick Right: 93(0.417)+70(0.583)≈79.57
Goalie is indifferent between Dive Left and Dive Right when Kicker plays Kick Left with p=0.383
Kicker is indifferent between Kick Left and Kick Right when Goalie plays Dive Left with q=0.417
Goalie is indifferent between Dive Left and Dive Right when Kicker plays Kick Left with p=0.383
Kicker is indifferent between Kick Left and Kick Right when Goalie plays Dive Left with q=0.417
Mixed Strategy Nash Equilibrium (MSNE): (p, q) = (0.383, 0.417)
Goalie is indifferent between Dive Left and Dive Right when Kicker plays Kick Left with p=0.383
Kicker is indifferent between Kick Left and Kick Right when Goalie plays Dive Left with q=0.417
Mixed Strategy Nash Equilibrium (MSNE): (p, q) = (0.383, 0.417)
q = pr(Goalie dives Left)
Goalie's Best Response ={Rightif p<0.383Indifferentif p=0.383Leftif p>0.383
q = pr(Goalie dives Left)
Goalie's Best Response ={Rightif p<0.383Indifferentif p=0.383Leftif p>0.383
Kicker's Best Response ={Leftif q<0.417Indifferentif q=0.417Rightif q>0.417
q = pr(Goalie dives Left)
Goalie's Best Response ={Rightif p<0.383Indifferentif p=0.383Leftif p>0.383
Kicker's Best Response ={Leftif q<0.417Indifferentif q=0.4173Rightif q>0.417
Like any Nash equilibrium, players are playing mutual best responses to each other (probabilistically)
Goalie's Best Response ={Rightif p<0.383Indifferentif p=0.383Leftif p>0.383
A two player game with three strategies available to each
Graphically more difficult, but same principle to find MSNE
Game is symmetric, so only need to find one player's optimal mixed strategy
Define for Column:
Column must choose r,p that make Row indifferent between their strategies
List the expected payoffs to Row from Column's mix of r,p
Row's expected payoff must equal for all three strategies
2r+p−1=p−r
List the expected payoffs to Row from Column's mix of r,p
Row's expected payoff must equal for all three strategies
2r+p−1=p−r
The necessity of MSNE is easy to see for constant-sum games with no PSNE
But MSNE also exist for non-constant sum games, and for games with one or more PSNE
We know an assurance game has two PSNE
Let's solve for MSNE
Let p=pr(Harry goes to Whitaker)
Let q=pr(Sally goes to Whitaker)
Let p=pr(Harry goes to Whitaker)
Let q=pr(Sally goes to Whitaker)
Let p=pr(Harry goes to Whitaker)
Let q=pr(Sally goes to Whitaker)
p⋆=13
q⋆=13
MSNE: (p, q) = (13,13)
Calculate expected payoffs to Harry and Sally with (p, q) MSNE
Problem: MSNE is even worse than either PSNE in this game!
Sally's BR ={Starbucksif p<13Indifferentif p=13Whitakerif p>13
Harry's BR ={Starbucksif q<13Indifferentif q=13Whitakerif q>13
All intersections of best response functions are Nash equilibria
Interior solution: MSNE
Corner solutions: PSNE
Hawk-Dove/Chicken game: 2 PSNE
Let's solve for MSNE
Let p=pr(Row plays Hawk)
Let q=pr(Column plays Hawk)
Let p=pr(Row plays Hawk)
Let q=pr(Column plays Hawk)
Let p=pr(Row plays Hawk)
Let q=pr(Column plays Hawk)
p⋆=0.5
q⋆=0.5
MSNE: (p, q) = (0.5,0.5)
Calculate expected payoffs to Row and Column with (p, q) MSNE
Expected payoff in MSNE is:
Column's BR ={Hawkif p<0.5Indifferentif p=0.5Doveif p>0.5
Row's BR ={Hawkif q<0.5Indifferentif q=0.5Doveif q>0.5
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
o | Tile View: Overview of Slides |
Esc | Back to slideshow |
Oskar Morgenstern
1902—1977
“Sherlock Holmes, pursued by his opponent, Moriarty, leaves London for Dover. The train stops at a station on the way, and he alights there rather than travelling on to Dover. He has seen Moriarty at the railway station, recognizes that he is very clever and expects that Moriarty will take a faster special train in order to catch him in Dover. Holmes’s anticipation turns out to be correct. But what if Moriarty had been still more clever, had estimated Holmes’s mental abilities better and had foreseen his actions accordingly? Then, obviously, he would have travelled to the intermediate station [Canterbury]. Holmes again would have had to calculate that, and he himself would have decided to go on to Dover. Whereupon, Moriarty would again have ‘reacted’ differently,” (p.173-4).
Morgenstern, Oskar, 1935, “Perfect Foresight and Economic Equilibrium,”
“‘All that I have to say has already crossed your mind,’ said he. ‘Then possibly my answer has crossed yours,’ I replied. ‘You stand fast?’ ‘Absolutely.’”
— Arthur Conan Doyle, 1893, The Final Problem
E(X)=k∑i=1pixi
E(X)=k∑i=1pixi
E(X)=p1x1+p2x2+⋯+pkxk
A probability-weighted average of X, with each possible X value, xi, weighted by its associated probability pi
Also called the "mean" or "expectation" of X, always denoted either E(X) or μX
Example: Suppose you lend your friend $100 at 10% interest. If the loan is repaid, you receive $110. You estimate that your friend is 99% likely to repay, but there is a default risk of 1% where you get nothing. What is the expected value of repayment?
Pure strategy: is a complete strategy profile that a player will play
Mixed strategy is a probability distribution over a strategy profile
The logic of mixed strategies is best understood in the context of repeated constant-sum games
If you play one strategy repeatedly (i.e. a pure strategy), your opponent can exploit your (predictable) strategy with their best response
You want to “keep your opponent guessing”
We have already seen Nash equilibrium in pure strategies (PSNE)
Nash (1950) proved that any n-player game with a finite number of pure strategies has at least one equilibrium
Consider the following game between a Kicker and a Goalie during a penalty kick
A constant sum game (in this case, zero-sum)
Palacios-Huerta (2003) calculated average success rates in English, Spanish, & Italian leagues (1995-2000)
If both Kicker and Goalie choose same direction, Kicker's payoff is higher if he chooses his natural side (often Right)
Palacios-Huerta, Ignacio, 2003, “Professionals Play Minimax,” Review of Economic Studies 70(2): 395–415
What if Kicker were to randomize strategies
Let p be probability that Kicker plays Kick Left
Then Goalie wants to maximize his expected payoff, given Kicker plays Kick Left with p=0.50
If Goalie plays Dive Left: E[Dive Left]=42(p)+7(1−p)=42(0.50)+7(1−0.50)
Then Goalie wants to maximize his expected payoff, given Kicker plays Kick Left with p=0.50
If Goalie plays Dive Right: E[Dive Right]=5(p)+30(1−p)=5(0.50)+30(1−0.50)
Then Goalie wants to maximize his expected payoff, given Kicker plays Kick Left with p=0.50
Goalie will play Dive Left to maximize his expected payoff (24.5 ≻ 17.5)
Now consider Kicker's expected payoff under this mixed strategy
Since Goalie will Dive Left to maximize his expected payoff, Kicker can expect to earn:
58(p)+93(1−p)58(0.50)+93(1−0.50)75.5
In constant sum games, note that even in mixed strategies, one player increases their own (expected) payoff by pulling down the other player's (expected) payoff!
In this game, even expected payoffs always sum to 100
von Neumann & Morgenstern’s minimax theorem (simplified): in a 2-person, constant sum game, each player maximizes their own expected payoff by minimizing their opponent's expected payoff
The name “minimax” is a popular strategy in games, trying to minimize the risk of your maximum possible loss
Kicker's “randomizing” 50:50 (Kick Left, Kick Right) was not random enough!
Goalie recognizing this pattern can exploit it and hold down Kicker's expected payoff
Kicker can do better by picking a better p (and similarly, so can Goalie)
Want to find the optimal probability mix that leaves your opponent(s) indifferent between their strategies to respond
In constant sum games (i.e. sports, board games, etc)
This principle is the same in non-constant sum games too!
Implies game is played repeatedly
Not always intuitive, but a simple principle
We want to find Kicker's optimal mixed strategy that leaves Goalie indifferent between his (pure) strategies
Suppose Kicker plays Kick Left with probability p
We want to find Kicker's optimal mixed strategy that leaves Goalie indifferent between his (pure) strategies
Suppose Kicker plays Kick Left with probability p
Goalie's expected payoff of playing Dive Left: 42p+7(1-p)
We want to find Kicker's optimal mixed strategy that leaves Goalie indifferent between his (pure) strategies
Suppose Kicker plays Kick Left with probability p
Goalie's expected payoff of playing Dive Left: 42p+7(1-p)
Goalie's expected payoff of playing Dive Right: 5p+30(1-p)
We want to find Kicker's optimal mixed strategy that leaves Goalie indifferent between his (pure) strategies
Suppose Kicker plays Kick Left with probability p
Goalie's expected payoff of playing Dive Left: 42p+7(1-p)
Goalie's expected payoff of playing Dive Right: 5p+30(1-p)
What value of p would make Goalie indifferent between Dive Left and Dive Right?
E[Left]=E[Right]E[42p+7(1−p)]=E[5p+30(1−p)]
E[Left]=E[Right]E[42p+7(1−p)]=E[5p+30(1−p)]
E[Left]=E[Right]E[42p+7(1−p)]=E[5p+30(1−p)]
p⋆=0.383
Kicker plays Kick Left with p=0.383 and Kick Right with 1−p=0.617
E[Left]=E[Right]E[42p+7(1−p)]=E[5p+30(1−p)]
p⋆=0.383
Kicker plays Kick Left with p=0.383 and Kick Right with 1−p=0.617
E[Left]=E[Right]E[42p+7(1−p)]=E[5p+30(1−p)]
p⋆=0.383
Kicker plays Kick Left with p=0.383 and Kick Right with 1−p=0.617
Goalie's expected payoff of Dive Left: 42(0.383)+7(0.617)≈20.41
Goalie's expected payoff of Dive Right: 5(0.383)+30(0.617)≈20.41
We want to find Goalie's optimal mixed strategy that leaves Kicker indifferent between his (pure) strategies
Suppose Goalie plays Dive Left with probability q
We want to find Goalie's optimal mixed strategy that leaves Kicker indifferent between his (pure) strategies
Suppose Goalie plays Dive Left with probability q
Kicker's expected payoff of playing Dive Left: 58q+95(1-q)
We want to find Goalie's optimal mixed strategy that leaves Kicker indifferent between his (pure) strategies
Suppose Goalie plays Dive Left with probability q
Kicker's expected payoff of playing Dive Left: 58q+95(1-q)
Kicker's expected payoff of playing Dive Right: 93q+70(1-q)
We want to find Goalie's optimal mixed strategy that leaves Kicker indifferent between his (pure) strategies
Suppose Goalie plays Dive Left with probability q
Kicker's expected payoff of playing Dive Left: 58q+95(1-q)
Kicker's expected payoff of playing Dive Right: 93q+70(1-q)
What value of p would make Kicker indifferent between Kick Left and Kick Right?
E[Left]=E[Right]E[58q+95(1−q)]=E[93q+70(1−q)]
E[Left]=E[Right]E[58q+95(1−q)]=E[93q+70(1−q)]
E[Left]=E[Right]E[58q+95(1−q)]=E[93q+70(1−q)]
q⋆=0.417
Goalie plays Dive Left with q=0.417 and Dive Right with 1−q=0.583
E[Left]=E[Right]E[58q+95(1−q)]=E[93q+70(1−q)]
q⋆=0.417
Goalie plays Dive Left with q=0.417 and Dive Right with 1−q=0.583
E[Left]=E[Right]E[58q+95(1−q)]=E[93q+70(1−q)]
q⋆=0.417
Goalie plays Dive Left with q=0.417 and Dive Right with 1−q=0.583
Kicker's expected payoff of Kick Left: 58(0.417)+95(0.583)≈79.57
Kicker's expected payoff of Kick Right: 93(0.417)+70(0.583)≈79.57
Goalie is indifferent between Dive Left and Dive Right when Kicker plays Kick Left with p=0.383
Kicker is indifferent between Kick Left and Kick Right when Goalie plays Dive Left with q=0.417
Goalie is indifferent between Dive Left and Dive Right when Kicker plays Kick Left with p=0.383
Kicker is indifferent between Kick Left and Kick Right when Goalie plays Dive Left with q=0.417
Mixed Strategy Nash Equilibrium (MSNE): (p, q) = (0.383, 0.417)
Goalie is indifferent between Dive Left and Dive Right when Kicker plays Kick Left with p=0.383
Kicker is indifferent between Kick Left and Kick Right when Goalie plays Dive Left with q=0.417
Mixed Strategy Nash Equilibrium (MSNE): (p, q) = (0.383, 0.417)
q = pr(Goalie dives Left)
Goalie's Best Response ={Rightif p<0.383Indifferentif p=0.383Leftif p>0.383
q = pr(Goalie dives Left)
Goalie's Best Response ={Rightif p<0.383Indifferentif p=0.383Leftif p>0.383
Kicker's Best Response ={Leftif q<0.417Indifferentif q=0.417Rightif q>0.417
q = pr(Goalie dives Left)
Goalie's Best Response ={Rightif p<0.383Indifferentif p=0.383Leftif p>0.383
Kicker's Best Response ={Leftif q<0.417Indifferentif q=0.4173Rightif q>0.417
Like any Nash equilibrium, players are playing mutual best responses to each other (probabilistically)
Goalie's Best Response ={Rightif p<0.383Indifferentif p=0.383Leftif p>0.383
A two player game with three strategies available to each
Graphically more difficult, but same principle to find MSNE
Game is symmetric, so only need to find one player's optimal mixed strategy
Define for Column:
Column must choose r,p that make Row indifferent between their strategies
List the expected payoffs to Row from Column's mix of r,p
Row's expected payoff must equal for all three strategies
2r+p−1=p−r
List the expected payoffs to Row from Column's mix of r,p
Row's expected payoff must equal for all three strategies
2r+p−1=p−r
The necessity of MSNE is easy to see for constant-sum games with no PSNE
But MSNE also exist for non-constant sum games, and for games with one or more PSNE
We know an assurance game has two PSNE
Let's solve for MSNE
Let p=pr(Harry goes to Whitaker)
Let q=pr(Sally goes to Whitaker)
Let p=pr(Harry goes to Whitaker)
Let q=pr(Sally goes to Whitaker)
Let p=pr(Harry goes to Whitaker)
Let q=pr(Sally goes to Whitaker)
p⋆=13
q⋆=13
MSNE: (p, q) = (13,13)
Calculate expected payoffs to Harry and Sally with (p, q) MSNE
Problem: MSNE is even worse than either PSNE in this game!
Sally's BR ={Starbucksif p<13Indifferentif p=13Whitakerif p>13
Harry's BR ={Starbucksif q<13Indifferentif q=13Whitakerif q>13
All intersections of best response functions are Nash equilibria
Interior solution: MSNE
Corner solutions: PSNE
Hawk-Dove/Chicken game: 2 PSNE
Let's solve for MSNE
Let p=pr(Row plays Hawk)
Let q=pr(Column plays Hawk)
Let p=pr(Row plays Hawk)
Let q=pr(Column plays Hawk)
Let p=pr(Row plays Hawk)
Let q=pr(Column plays Hawk)
p⋆=0.5
q⋆=0.5
MSNE: (p, q) = (0.5,0.5)
Calculate expected payoffs to Row and Column with (p, q) MSNE
Expected payoff in MSNE is:
Column's BR ={Hawkif p<0.5Indifferentif p=0.5Doveif p>0.5
Row's BR ={Hawkif q<0.5Indifferentif q=0.5Doveif q>0.5