300 likes | 311 Views
Explore the foundational principles and modeling framework for decision analysis without explicit consequences. Learn about Bayesianism, Savage's SEU theory, and the essential axioms that guide decision-making processes.
E N D
Decision Analysis Without Consequences,Bayesianism Without Probabilities Robert Nau Fuqua School of Business Duke University Presentation for RUD 2005 June 2005
Savage’s SEU theory: the foundation of “Bayesian” inference & decision analysis Key primitives: • States of the world • Consequences Key Axioms: • Complete order (P1) • Independence (P2) • Values can be disentangled from beliefs (P3) • Beliefs can be revealed by preferences (P4)
What is “Bayesianism”? • Does it mean having beliefs that are represented by unique probabilities (i.e., does it require Savage’s P3 and P4)? • Does it mean that individuals can observe each other’s “true” probabilities, or that they should (at least some times) agree on them? • Or, does Bayesianism just mean having preferences that satisfy independence (P2), thus having an additive representation that supports conditionalization?
What are “consequences”? • A natural consequence is the result of a feasible act of the decision maker and a possible state of the world • Thus, acts logically precede consequences • Savage does it the other way around: consequences are psychologically primitive “states of the person”, and acts are arbitrary (often counterfactual) mappings of states-of-the-world to states-of-the-person
What’s wrong with primitive consequences? • States of the person are endogenous and unobservable—not a good foundation for descriptive and/or interactive decision theory • Acts in which states of the world are mapped to counterfactual states of the person are purely imaginary—preferences can’t be validated by choices • Preferences among acts yielding “constant” consequences still do not reveal “true” probabilities (e.g., utilities could have state-dependent scale factors)
This paper: • Axiomatizes a simple additive representation of preferences without explicit consequences • The only primitives are discrete “moves” of two players—the DM and nature—and monetary gambles • Implicitly every combination of a move of the DM, a move of nature, and a monetary payoff, is treated as a distinct consequence • Doesn’t yield “true” probabilities—but doesn’t need to! • Provides a sufficient foundation for Bayesian methods of decision analysis and game theory based on no-arbitrage and risk neutral probabilities (à la de Finetti)
Modeling framework • Let A = {a1, …, aI} denote a finite set of strategies (alternative courses of action) • Let S = {s1, …, sJ} denote a finite set of states. Non-empty subsets of S are events. • Let w = {w1, …, wJ} denote an observable wealth distribution over states (e.g., a side-gamble) • An act is a strategy-wealth pair (a, w) • An outcome is a triple (a, s, w) This is essentially the same state-preference framework used in Nau’s (Manag. Sci. 2003) model of generalized risk aversion, augmented by a finite set of strategies
States Acts Each quantity of money received in a given state in conjuction with a given strategy is effectively treated as a distinct consequence
Properties of the decision maker’s preference relation over acts • Axiom 1: (Ordering) is a weak order, and for every strategy the restricted preference relation over wealth distributions is a continuous weak order. • Axiom 2: (Strict monotonicity) w*j > wj (a, w*jw-j) (a, w) for all a,j, w, w*j. (More money preferred to less, and no null states)
Axiom 3: (Strategic generalized triple cancellation) For every pair of strategies a and b, wealth distributions w, x,y, z, state j, and constants wj*, xj*: If (a, w) (b, x) and (a, w*jw-j) (b, x*jx-j) and (a, wjy-j) (b, xj,z-j) then (a, w*jy-j) (b, x*jz-j). In words: if changing wjto wj* is no worse than changing xj to xj* when the background is a comparison of (a, w) vs. (b, x) , then changing wj to wj* cannot be strictly worse than changing xjto xj* when the background comparison is (a, wjy-j) vs. (b, xjz-j).
Definition: Strategies a and boverlap if there exist w and x in the interior of such that (a, w) ~ (b, x). There is a set of overlapping relationships that covers all the strategies if for any two strategies a and b, there exists a sequence {a, i1, i2, …, ik, b} such that a overlaps i1, i1 overlaps i2, etc., up to ik overlaps b. • Axiom 4: (Overlapping strategies) There is a set of overlapping relationships that covers all the strategies.
Main result: general additive model Theorem 1: Axioms 1-4 hold iff is represented by a utility function U having the strategy-dependent additive-across-states form: U(a, w) = , in which {uaj} are strictly increasing, continuous utility functions for money that are unique up to joint transformations of the form uaj + j. Note: preferences among strategies without any side-gambles are represented by {j uaj(0)}.
Special cases of the general additive model • SEU with constant linear utility for $ uaj(wj) = pj(vaj+wj) • SEU with state-dependent linear utility for $uaj(wj) = pj(vaj+jwj) • SEU with state-indep. utility and prior stakesuaj(wj) = pju(vaj+wj+Wj) • General state-dependent SEU uaj(wj) = pjvaj(wj)
What SEU preference parameters are “globally” observable (if entire preference order is known)? • The DM’s “true” subjective probabilities for all events are observable only in special case 1. • True probabilities are distorted by state-dependence and/or effects of prior stakes in cases 2 and 3 • True probabilities are theoretically irrelevant in special case 4 (hopelessly entangled with utilities)! • But… true probabilities are not needed for Bayesian decision analysis (or game theory)
Claim: decision analysis can be carried out in terms of “local” observations • The entire preference order does not have to be observed—just a finite number of acceptable small gambles in conjunction with available acts • Beliefs and values turn out to be only partly separable (i.e., “true” probabilities and utilities are not observed), but this doesn’t matter.
Local observables I: risk neutral probabilities • The decision maker’s risk neutral (betting) probabilities are the normalized derivatives of the state-dependent utility functions for money: • Note that they are, in general, act-dependent • In the event that (a, w) is chosen, for whatever reason, z is an acceptable gamble if z ·(a, w) 0
Local observables II: “value gambles” • In the event act (a,w) is chosen when (b, x) is available, the DM should accept a gamble (a,w; b, x) whose state-by-state increments of cardinal utility are proportional to the state-by-state differences in utility between (a,w) and (b, x), evaluated from the perspective of the local marginal utilities for money that apply under the chosen act (a, w). This implies acceptability of the “value gamble” (a,w; b, x) determined by:
Decision analysis in terms of acceptable gambles • Note: (a,w)(a,w; b, x) U(a, w) U(b, x), • Hence the direction of preference between any two acts can be determined from observations of acceptable gambles. • This suffices to determine the optimal decision, even though “true” probabilities have not been observed (and may not exist…)
Prelude to Bayesian updating: conditional preference Let the decision maker be endowed with a conditional preference relationE for every event E that reflects the updating of preferences with the receipt of new information. • Axiom 4*: (Conditionally overlapping strategies) For any two events E and F, each consisting of 2 or more states, there is a set of overlapping relationships under E that covers all the strategies and which is identical to a set of overlapping relationships under F that covers all the strategies.
Conditional preference axioms • Axiom 5: (Independence of irrelevant payoffs) (a, w) E(b, x) (a, wE w*‑E) E (b, xE x*‑E) for all w*‑E and x*‑E. • Axiom 6: (Aggregation/sure thing principle) For disjoint events E and F, (a, w) E (b, x) and (a, w) F (b, x) (a, w) EF (b, x)
Theorem 2: Given axioms 1-2-3-4*-5-6,for any event E consisting of 2 or more states, the conditional preference relation Eis represented by UE(a, w) = where{uij} are the same state- and strategy-dependent utility functions that represent .
Hence, updating of preferences is accomplished simply by dropping ruled-out states from the utility summation • This is consistent with the usual Bayesian updating procedure, even though probabilities are not separated from utilities.
Preferences conditional on {1, 2} These states are dropped from the utility summation
Preferences conditional on {1, 2} These states are dropped from the utility summation, and probabilities are usually renormalized (but they needn’t be)
Special case: experimental information • In a scientific experiment, the DM typically has no intrinsic in the experimental result given the hypothesis, in which case… • The risk-neutral likelihood function is act-independent (reflective of “true” beliefs), while the risk-neutral prior is (as usual) contaminated by state-dependent utility, prior stakes, etc. • Payoffs of value gambles depend only on the hypothesis, not the experimental result, and hence can be written in “reduced form”
Special case: experimental information • Upon receipt of information, the risk neutral prior (for each available act) is updated via Bayes’ rule, by applying the common likelihood function. • Preferences among acts are then updated by applying the risk neutral posteriors to the reduced-form value gambles. • Hence, Bayesian methods do not actually require knowledge of the decision maker’s “true” prior probabilities!
Extensions/connections • Risk aversion under the general additive model can be modeled via the methods in Nau 2003 (Management Science), though additivity rules out uncertainty aversion. • The general additive preference model is also sufficent for “Bayesian” modeling of noncooperative games, where it leads to species of correlated equilibrium as the natural solution concepts (Nau and McCardle 1990 JET, Nau 1992 Management Science, Nau 1995 w.p.)