180 likes | 319 Views
Price of Anarchy in Games of Incomplete Information. Tim Roughgarden. Alon Ardenboim. Full Information Games. The players payoffs are common knowledge.
E N D
Price of Anarchy in Games of Incomplete Information Tim Roughgarden AlonArdenboim
Full Information Games • The players payoffs are common knowledge. • Pure (Mixed) Nash equilibrium – each players maximizes his utility (in expectation) when sticking with his current (probabilistic) strategy.
Price of Anarchy • Choose a goal function (e.g. welfare maximization). • How bad can an equilibrium be w.r.t. the optimal outcome (e.g. maximum welfare)? .
Incomplete Information Games • Players are uncertain about each other payoffs. • For example, auctions (eBay), VCG mechanisms. • Assume players’ private preferences are drawn independently from prior distributions. • Distributions ARE common knowledge.
Bayes-Nash Equilibrium • Type space . • Action space . • sampled from . is common knowledge. • A strategy is a function from type space to a distribution over actions . • A strategy profile is a Bayes-Nash equilibrium if for every , type and action ,
Bayes-Nash PoA • The corresponding PoA of such a games measures how bad is the worst Bayes-Nash equilibrium w.r.t the optimal value. • That is, • When is a product dist., this is iPoA (independent). • Otherwise, we talk about cPoA (correlated).
Smooth Full Information Games • Def: A game is -smooth w.r.toutcome and a maximization objective function if for every , • W is payoff-dominating if it bounds the sum of players’ payoffs from above (non-negative transfers). • Thm: if a game is -smooth w.r.t. an optimal outcome for a payoff-dominating then PoA. • Let be a Nash Eq., we have:
Smooth Incomplete Information Games • Def: Let be a game structure and a maximization objective function. The structure is -smooth w.r.t. social choice function if for every and feasible to , we have • Thm: If a game structure is -smooth w.r.t. an optimal choice function for a payoff-dominating , then the iPoAof the game w.r.t. .
Proof of Theorem • Let be an optimal choice function (that is, if every player plays we get ). • Let be a Bayes-Nash equilibrium. • In strategy player samples and plays .
Proof Cont. • We have: (Payoff dominant) (Lin. of Exp.) (Equilibrium) Bayes-Nash (Def.) (Lin. of Exp.) (Smooth) OPT
Application to GSP • In the Generalized Second Prize (GSP) auction there are ad slots in a web page. Each with an associated click-through rate. • Each bidder has a private information – valuation per click . • No player overbids (feasible space of bids is ). • Assume .
GSP (cont.) • Assume player gets bids the highest bid. • Allocation: assign the slot with CTR . • Payment: Charge player the highest bid. • Payoff: if . otherwise (if bid is feasible).
Smoothness of GSP • Thm: The GSP is a -smooth game (and therefore the iPoA is ) w.r.t. welfare maximization goal function. • Proof: • Consider welfare maximization (payoff dominant). • Let’s take the social choice function (). • Easy to see it’s optimal. • Fix a type vector of players valuations and an outcome (arbitrary bids). Assume . • Let denote the index of the highest bidder.
Smoothness proof (cont.) • Claim: for every . • : …
Smoothness proof (cont.) • Claim: for every . • : …
Smoothness proof (cont.) • Claim: for every . • : …
Smoothness proof (cont.) • Summing over all players we get:
Directions • Application to other games. • Other smoothness variants. • What to do with correlated type distributions? • Is there a relation between cPoA and sPoA?