Compressing Mental Model Spaces and Modeling Human Strategic Intent

Compressing Mental Model Spaces and Modeling Human Strategic Intent

Prashant Doshi University of Georgia, USA

http://thinc.cs.uga.edu

Yifeng Zeng Reader, Teesside Univ. Previously: Assoc Prof., Aalborg Univ. Yingke Chen Doctoral student Muthu Chandrasekaran Doctoral student Hua Mao Doctoral student

What is a mental behavioral model?

How large is the behavioral model space?

How large is the behavioral model space? General definition A mapping from the agent’s history of observations to its actions

How large is the behavioral model space? 2H (Aj) Uncountably infinite

How large is the behavioral model space? Let’s assume computable models Countable A very large portion of the model space is not computable!

Daniel Dennett Philosopher and Cognitive Scientist Intentional stance Ascribe beliefs, preferences and intent to explain others’ actions (analogous to theory of mind - ToM)

Organize the mental models Intentional models Subintentional models

Organize the mental models Intentional models E.g., POMDP  =  bj, Aj, Tj, j, Oj, Rj, OCj BDI, ToM Subintentional models Frame (may give rise to recursive modeling)

Organize the mental models Intentional models E.g., POMDP  =  bj, Aj, Tj, j, Oj, Rj, OCj BDI, ToM Subintentional models E.g., (Aj), finite state controller, plan Frame

Finite model space grows as the interaction progresses

Growth in the model space Other agent may receive any one of |j| observations |Mj|  |Mj||j| |Mj||j|2  ...  |Mj||j|t 0 1 2 t

Growth in the model space Exponential

Absolute continuity condition (ACC)

ACC Subjective distribution over histories True distribution over histories

ACC is a sufficient and necessary condition for Bayesian update of belief over models

How do we satisfy ACC? Cautious beliefs (full prior) Grain of truth assumption Prior with a grain of truth is sufficient but not necessary

General model space is large and grows exponentially as the interaction progresses

It would be great if we can compress this space! • No loss in value to the modeler • Flexible loss in value for greater compression Lossless Lossy

Expansive usefulness of model space compression to many areas: • Sequential decision making (dt-planning) in multiagent settings • Bayesian plan recognition • Games of imperfect information

Interactive POMDP framework (Gmytrasiewicz&Doshi05) 1. Sequential decision making in multiagent settings Include models of the other agent in the state space Update beliefs over the physical state and models

General and domain-independent approach for compression Establish equivalence relations that partition the model space and retain representative models from each equivalence class

Approach #1: Behavioral equivalence (Rathanasabapathy et al.06,Pynadath&Marsella07) • Intentional models whose complete solutions • are identical are considered equivalent

Approach #1: Behavioral equivalence Behaviorally minimal set of models

Approach #1: Behavioral equivalence Lossless Works when intentional models have differing frames

Approach #1: Behavioral equivalence Impact on dt-planning in multiagent settings Multiagent tiger Multiagent MM Multiagent tiger

Approach #1: Behavioral equivalence Utilize model solutions (policy trees) for mitigating model growth Model reps that are not BE may become BE next step onwards • Preemptively identify such models and do not update all of them

Approach #2: -Behavioral equivalence (Zeng et al.11,12) Redefine BE

Approach #2: RevisitBE (Zeng et al.11,12) Intentional models whose partial depth-d solutions are identical and vectors of updated beliefs at the leaves of the partial trees are identical are considered equivalent Lossless if frames are identical Sufficient but not necessary

Approach #2: (,d)-Behavioral equivalence • Two models are (,d)-BE if their partial depth-d solutions are identical and vectors of updated beliefs at the leaves of the partial trees differ by  Models are (0.33,1)-BE Lossy

Approach #2: -Behavioral equivalence • Lemma (Boyen&Koller98): KL divergence betweentwo distributions in a discrete Markov stochastic process reduces or remains the same after a transition, with the mixing rate acting as a discount factor • Mixing raterepresents the minimal amount by which the posterior distributions agree with each other after one transition • Property of a problem and may be pre-computed

Approach #2: -Behavioral equivalence Given the mixing rate and a bound, , on the divergence between two belief vectors, lemma allows computing the depth, d, at which the bound is reached Compare two solutions up to depth d for equality

Approach #2: -Behavioral equivalence Impact on dt-planning in multiagent settings Discount factor F = 0.5 Multiagent Concert On a UAV reconnaissance problem in a 5x5 grid, allows the solution to scale to a 10 step look ahead in 20 minutes Multiagent Concert

Approach #2: -Behavioral equivalence What is the value of d when some problems exhibit F with a value of 0 or 1? F=1 implies that the KL divergence is 0 after one step: Set d = 1 • F=0 implies that the KL divergence does not reduce: Arbitrarily set d to the horizon

Approach #3: Action equivalence (Zeng et al.09,12) Intentional or subintentional models whose predictions at time step t (action distributions) are identical are considered equivalent at t

Approach #3: Action equivalence

Approach #3: Action equivalence Lossy Works when intentional models have differing frames

Approach #3: Action equivalence Impact on dt-planning in multiagent settings AE bounds the model space at each time step to the number of distinct actions Multiagent tiger

Approach #4: Influence equivalence (related to Witwicki&Durfee11) Intentional or subintentional models whose predictions at time step tinfluence the subject agent’s plan identically are considered equivalent at t Regardless of whether the other agent opened the left or right door, the tiger resets thereby affecting the agent’s plan identically

Approach #4: Influence equivalence Influence may be measured as the change in the subject agent’s belief due to the action Group more models at time step t compared to AE Lossy

Compression due to approximate equivalence may violate ACC Regain ACCby appending a covering model to the compressed set of representatives

Open questions

N > 2 agents Under what conditions could equivalent models belonging to different agents be grouped together into an equivalence class?

Can we avoid solving models by using heuristics for identifying approximately equivalent models?

Modeling Strategic Human Intent

Adam Goodie Professor of Psychology, UGA Yifeng Zeng Reader, Teesside Univ. Previously: Assoc Prof., Aalborg Univ. Yingke Chen Doctoral student Xia Qu Doctoral student Roi Ceren Doctoral student Muthu Chandrasekaran Doctoral student Matthew Meisel Doctoral student Hua Mao Doctoral student

Computational modeling of probability judgment in stochastic games Computational modeling of human recursive thinking in sequential games

Compressing Mental Model Spaces and Modeling Human Strategic Intent