430 likes | 439 Views
Explore Bayesian decision theory, Nash equilibrium, and strategies in large worlds to make rational decisions. Learn innovative approaches with practical implications.
E N D
Making Decisions in Large Worlds Ken Binmorek.binmore@ucl.ac.uk
What is a small world? Bayesian decision theory applies only in a small world, where you can always: Look before you leap Leonard Savage, Foundations of Statistics
What is a small world? Bayesian decision theory applies only in a small world, where you can always: Look before you leap But… The look-before-you-leap principleis preposterous if carried to extremes. Leonard Savage, Foundations of Statistics
What is a large world? In a small world, you can always Look before you leap. In a large world, you must sometimes Cross that bridge when you come to it. Leonard Savage, Foundations of Statistics
John Harsanyi’s problem What is the rational solution of the Battle of the Sexes in a symmetric environment? The symmetric Nash equilibrium only yieldsthe players’ security levels. So why don’t they play their security strategies instead of their equilibrium strategies?
Analogous problems were solved by extending the set of pure strategies to the set of mixed strategies. Can we similarly resolve Harsanyi’s problem byextending the set of mixed strategies to a largerset of muddled strategies?
probability p 1 1 0 0 0 1 0 1 mixing box
Richard von Mises probability p 1 1 0 0 0 1 0 1 mixing box
no probability 1 1 0 0 0 1 0 1 muddling box?
No probability 1 1 0 0 0 1 0 1 upper probability p*(x) lower probability p*(x) muddling box?
Randomizing boxes When evaluating a muddling box x, I want only p*(x) and p*(x) to be relevant. What criterion makes this reasonable?
Randomizing boxes When evaluating a muddling box x, I want only p*(x) and p*(x) to be relevant. What criterion makes this reasonable? where the inf and sup are taken over all finite sets
Randomizing boxes When evaluating a muddling box x, I want only p*(x) and p*(x) to be relevant. What criterion makes this reasonable? A box is muddled if the inf and sup are achieved for all {n1, n2,… nk} where the inf and sup are taken over all finite sets
Upper and Lower Probability A (non-measurable) event E has a lower probability(inner measure) p* and an upper probability (outermeasure) p*. Let worst best G = gamble
Upper and Lower Probability A (non-measurable) event E has a lower probability(inner measure) p* and an upper probability (outermeasure) p*. Let worst best G = gamble How to evaluate G?
Upper and Lower Probability A non-measurable event E has a lower probability (inner measure) p* and an upper probability (outermeasure) p*. Let worst best G = only probabilisticinformation used u(G) = U(p*, p*)
John Milnor’s axioms fordecisions under complete ignorance state action b a a c consequence
compatible principle ofinsufficientreason maximincriterion Hurwiczcriterion minimaxregret criterion characterizingproperty ordering symmetry strong domination continuity linearity row adjunction column linearity column duplication convexity special row adjunction
compatible principle ofinsufficientreason maximincriterion Hurwiczcriterion minimaxregret criterion characterizingproperty ordering symmetry strong domination continuity linearity row adjunction column linearity column duplication convexity special row adjunction
compatible principle ofinsufficientreason maximincriterion Hurwiczcriterion minimaxregret criterion characterizingproperty ordering symmetry strong domination continuity linearity row adjunction column linearity column duplication U(p*,p*) = hp* + (1-h)p* convexity special row adjunction
Ellsberg paradox $1m $0m $0m J = $0m $1m $0m K = With the Hurwicz criterion: $0m $1m $1m u(J) = 1/3 u(K) = 2(1-h)/3 L = u(L) = 2/3 u(M) = h/3 + (1-h) $1m $0m $1m M = uncertainty (or ambiguity) aversion
compatible principle ofinsufficientreason maximincriterion Hurwiczcriterion minimaxregret criterion characterizingproperty ordering symmetry strong domination continuity linearity row adjunction column linearity column duplication convexity special row adjunction
Upper and Lower Probability With some mild extra assumptions, the product form U(p*,p*) = {p*}h{p*}1-h follows from retaining the multiplicative property of the probabilities of independent events:
ball box 0 1 box 0 2 0 2 ball 1 0 Battle of the Sexes
Nashequilibrium outcomes Eve’spayoff 1-q q (1,2) * 0 1 1-p 0 2 0 2 p 0 1 * (2,1) Battle of the Sexes * . (0,0) Adam’spayoff
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p . 0 1 (2,1) Battle of the Sexes p=0 . (0,0) Adam’spayoff
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p . 0 1 (2,1) p=1/6 Battle of the Sexes . (0,0) Adam’spayoff
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p . 0 1 (2,1) p=1/3 Battle of the Sexes . (0,0) Adam’spayoff
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p . 0 1 (2,1) p=1/2 Battle of the Sexes . (0,0) Adam’spayoff
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p . 0 1 (2,1) p=2/3 Battle of the Sexes . (0,0) Adam’spayoff
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p p=5/6 . 0 1 (2,1) Battle of the Sexes . (0,0) Adam’spayoff
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p=1 p . 0 1 (2,1) Battle of the Sexes . (0,0) Adam’spayoff
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p q=0 . 0 1 (2,1) Battle of the Sexes . (0,0) Adam’spayoff
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p . 0 1 (2,1) q=1/3 Battle of the Sexes . (0,0) Adam’spayoff
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p . 0 1 (2,1) q=1/2 Battle of the Sexes . (0,0) Adam’spayoff
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p . 0 1 (2,1) q=2/3 Battle of the Sexes . q=1/3 (0,0) Adam’spayoff
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p . 0 1 (2,1) q=5/6 Battle of the Sexes . (0,0) Adam’spayoff
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p . 0 1 (2,1) Battle of the Sexes q=1 . (0,0) Adam’spayoff
Nashequilibriumoutcomes Eve’spayoff (1,2) q 1-q * 0 1 1-p 0 2 0 2 p 0 1 * Battle of the Sexes * (2,1) . (2/3,2/3) (0,0) Adam’spayoff (3/4,3/4)
If is close enough to and U(p,P)=phP1-h we can find a symmetric Nash equilibrium in muddled strategies that pays each player more than the securitylevel of 2/3. In the case whenh=1/2, each player uses amuddling box with
If is close enough to and U(p,P)=phP1-h we can find a symmetric Nash equilibrium in muddled strategies that pays each player more than the securitylevel of 2/3. In the case whenh=1/2, each player uses amuddling box with The corresponding payoffs to the players exceed the maximum symmetric payoff of 3/4 available if only mixed strategies are used.
Eve’spayoff 1-q q . (1,2) 0 1 1-p 0 2 0 2 p . 0 1 (2,1) * Battle of the Sexes . (0,0) Adam’spayoff