Zhu Han, Dusit Niyato, Walid Saad, and Tamer Basar

Game Theory in Wireless and Communication Networks: Theory, Models, and ApplicationsLecture 11Game with Bounded Rationality Zhu Han, Dusit Niyato, Walid Saad, and Tamer Basar

Overview of Lecture Notes • Introduction to Game Theory: Lecture 1, book 1 • Non-cooperative Games: Lecture 1, Chapter 3, book 1 • Bayesian Games: Lecture 2, Chapter 4, book 1 • Differential Games: Lecture 3, Chapter 5, book 1 • Evolutionary Games: Lecture 4, Chapter 6, book 1 • Cooperative Games: Lecture 5, Chapter 7, book 1 • Auction Theory: Lecture 6, Chapter 8, book 1 • Matching Game: Lecture 7, Chapter 2, book 2 • Contract Theory, Lecture 8, Chapter 3, book 2 • Learning in Game, Lecture 9, Chapter 6, book 2 • Stochastic Game, Lecture 10, Chapter 4, book 2 • Game with Bounded Rationality, Lecture 11, Chapter 5, book 2 • Equilibrium Programming with Equilibrium Constraint, Lecture 12, Chapter 7, book 2 • Zero Determinant Strategy, Lecture 13, Chapter 8, book 2 • Mean Field Game, Lecture 14, book 2 • Network Economy, Lecture 15, book 2

Overview • All players are assumed to be fully rational, which might not true with human and low capable devices. • Outline • Introduce the concept of bounded rationality and its implication on game theory. • develop into the fundamental details of one of the most important frameworks that can capture bounded rationality, prospect theory, which essentially deals with subjective perceptions. • conclude by shedding some light on other related notions of bounded rationality. • An example

games with bounded rationality • In a behavioral game, it is customary to study how certain real-world limits, such as the bounds on the cognitive abilities of a human, can impact the rational tenets of game theory and alter its predicted equilibrium outcomes. • For Wireless CPS • the natural, cognitive bounds on the rationality of human players (users, administrators, hackers, etc.) that interact with the wireless or cyber-physical system, • the limited resources and computational capabilities of certain devices (e.g., IoT sensors) that prevent them from following the conventional, rational path of game theory.

Example: Allais’s Paradox Fear of disappointment. Risk averse. High Probability • 95% chance to win $10,000 • 100% chance to win $9,499 Hope to avoid loss. Risk seeking 0.95*10000>9499 Low Probability Gain High Probability • 5% chance to win $10,000 • 100% chance to win $501 Lose • 95% chance to lose $10,000 • 100% chance to lose $9,499 0.05*10000<501 0.95*10000>9499 Hope of large gain. Risk seeking Low Probability • 5% chance to lose $10,000 • 100% chance to lose $501 Fear of large loss. Risk averse 0.05*10000<501

Example: evolutionary game • limit on the informational gathering capabilities of the players. • due to informational asymmetry and uncertainty, • instead of acting as typical optimizers, the players will simply copy strategies from others, based on observable payoffs. • In essence, evolutionary games take an “evolutionary biology" approach to optimization, through mutations and natural selection

Prospect Theory • Nobel-prize winning • PT provides one with mathematical tools to understand how decision making, in real life, can deviate from the tenets of expected utility theory (EUT), a conventional game-theoretic notion which is guided strictly by objective notions of gains and losses, player rationality, conformity to pre-determined decision making rules that are unaffected by real-life perceptions of benefits and risk. • people cannot be counted on to always choose optimally among alternatives if merely stating the alternatives differently influences their choices • Solution: use prospect-theoretic notions to refine existing game-theoretic mechanisms

Prospect Theory • Expected utility theory (EUT) cannot explain the deviations due to end-user subjectivity • Prospect theory (PT) [Kahneman,Tversky’79] as a Nobel prize winning theory explains the deviations in monetary transactions: • People usually over-weigh low probability outcomes and underweigh their outcomes with high probabilities • Loss looms larger than gains • Prospect theory has recently been applied in many contexts: • Social sciences [Gao’10] [Harrison’09] [Tanaka’16] • Communication networks [Li’14] [Yu’14] [Yang’14] [Lee’15] • Smart energy management [Wang’14] [Xiao’14]

Fundamentals Tenets of Prospect Theory: Weighting and Framing

weighting function: Prelec function

－α=1 －α=0.5 Probability Weighting Functions • Probability weighting function models the subjectivity of a player • Subjective probability for a player to weigh the outcome with a probability p • S-shaped and asymmetrical, ranging in [0,1] • Objective weight decreases with the player’s subjective evaluation distortion • Prelec function [Prelec’98]:

Subjective Perceptions of Utility Functions The Framing Effect • utility framing or reference points: • each player frames its gains or losses with respect to a possibly different reference point • Example: save $10 per month for bill for rich and poor people • subjective value functions - concave in gains, convex in losses - over the possible outcomes. Rk is a raw utility reference point and vk is a so-called value function that transforms the raw utility of the player into a PT utility that is framed with respect to a reference point

Framing Effect Illustration of the prospect-theoretic framing effect: how objective utilities are viewed subjectively by human participants. The utility function value changes depending on a certain reference point that highlights the individual perceptions of gains and losses.

Impact of PT on Game Analysis • properties may no longer hold under PT • requiring new analysis • existence of an equilibrium may be jeopardized by PT effects • overall operation of optimization mechanisms can be affected • traditional equilibrium analysis might not work • no general rules for analyzing games under PT exist • some application-specific approaches exit

Related Work • PT-based channel access game between two subjective end-users in wireless network [Li’12] • Wireless operator invests spectrum to users under uncertain spectrum supply using PT [Yu’14] • PT-based random access game between two users choosing the transmission probabilities on a radio channel [Li’14] • Stackelberg game between the SP offering the bandwidth and subjective end-users to choose services [Yang’14] • PT-based microgrids energy trading game in smart grids [Xiao’15]

Other Notions of Bounded Rationality • Satisficing: describe a procedure for constructing expectations of how good a potential game solution might reasonably be achieved. Once such a reasonably good solution is achieved, players may stop their search for improvements • Satisficing equilibrium: • defined as a state at which each player has already reached a “target" utility level or cannot achieve this target by unilaterally changing its action, given the other players actions across all games. • a refinement of the idea of a Nash equilibrium, that takes into account the satisficing behavior of the players: instead of seeking a best response, the players will seek a target outcome that satisfies their requirements.

quantal best response • players become more likely to make errors as those errors become less costly, a notion that is often known as cost-proportional errors. • instead of using a conventional best response, players will react quantally, rather than via strict maximization

quantal response equilibrium (QRE) • generalized the Nash equilibrium to the case in which no player can unilaterally improve its utility by using a quantal (rather than a fully rational) best response • A QRE is guaranteed to exist for any normal form game and non-negative precision parameter • QRE points are not guaranteed to be unique • human players will play games differently depending on the magnitudes of the payoffs involved

Summary • PT provides a rigorous framework for incorporating bounded rationality in game theory. • PT focuses on how game-theoretic decision making is affected by risk and uncertainty, through the use of the weighting and framing effects. • The incorporation of PT effects will, however, lead to non-linearities in the utility functions, hence requiring new analysis for the equilibrium or other outcomes of the game. • Other types: satisfaction equilibrium and quantal • Daniel Kahneman Nobel 2002, "for having integrated insights from psychological research into economic science, especially concerning human judgment and decision-making under uncertainty"

Zhu Han, Dusit Niyato, Walid Saad, and Tamer Basar

Zhu Han, Dusit Niyato, Walid Saad, and Tamer Basar

Presentation Transcript

Zhu Han, Dusit Niyato, Walid Saad, Tamer Basar, and Are Hjorungnes

Saad Nauman

Dusit Niyato 1 and Ekram Hossain 2

Marguerite Tamer

Zhu Zhu Pet Fashion Show

Zhu Han, Dusit Niyato, Walid Saad, Tamer Basar, and Are Hjorungnes

Xuding Zhu

Zhu Han, Dusit Niyato, Walid Saad, Tamer Basar, and Are Hjorungnes

Presented by: Jia (Jasmine) Meng Advisor: Dr. Zhu Han

Zhu Han, Dusit Niyato, Walid Saad, Tamer Basar, and Are Hjorungnes

Zhu Han, Dusit Niyato, Walid Saad, Tamer Basar, and Are Hjorungnes

Zhu Han, Dusit Niyato, Walid Saad, Tamer Basar, and Are Hjorungnes

Zhu Han, Dusit Niyato, Walid Saad, Tamer Basar, and Are Hjorungnes

Prof/ Walid ELshazly