1.11k likes | 1.29k Views
Can Negotiation Breakdown Probabilities of Laissez-Faire Agents Be Derived A Priori?. R. Loui, R. Ratkowski, J. Rosen Department of Computer Science and Engineering Washington University St. Louis USA. Trailers/Previews. DataMining on OC192 Data Streams (10Gbps) A.k.a. "Streaming AI"
E N D
Can Negotiation Breakdown Probabilities of Laissez-Faire Agents Be Derived A Priori? R. Loui, R. Ratkowski, J. RosenDepartment of Computer Science and EngineeringWashington UniversitySt. LouisUSA r.p.loui AIVR UIUC
Trailers/Previews • DataMining on OC192 Data Streams (10Gbps) • A.k.a. "Streaming AI" • With John Lockwood (from UIUC), PI • 1M x speedup of classified intelligence task • Better performance than available software • Better performance than SVM methods • Related to FPgrep, FPsed, FPawk patent US 7093023 • Methods, systems, and devices using reprogrammable hardware for high-speed processing of streaming data to find a redefinable pattern and respond thereto • W/Moshe Looks, papers on hierarchical streaming clustering r.p.loui AIVR UIUC
Trailers/Previews • Scripting Languages and The New Programming Pragmatics • http://www.cse.wustl.edu/~loui/praiseieee.txt • The real shock is that academia continues to reject the sea change in programming practices brought about by scripting. • Scripting was not enervating but was actually renewing: programmers who viewed code generation as tedious and tiresome … viewed scripting as rewarding self-expression or recreation. • I personally believe that CS1 java is the greatest single mistake in the history of computing curricula. • Linguists recognize something above syntax and semantics, and they call it "pragmatics". We are entering an era of comparative programming language study when the issues are higher-level, social, and cognitive too. r.p.loui AIVR UIUC
Trailers/Previews • Moshe Looks, D.Sc. 06 (expected) • Externally advised by David Goldberg (UIUC) and Martin Pelikan (formerly UIUC) • COMPETENT PROGRAM EVOLUTION • My thesis is that the properties of programs and program spaces can be leveraged as inductive bias to reduce the burden of manual representation-building, leading to competent program evolution. • The central contributions of this dissertation are • a view of the requirements for competent program evolution, and • the design of a procedure, meta-optimizing semantic evolutionary search (MOSES) r.p.loui AIVR UIUC
Trailers/Previews • Dynamics of Rule Revision and Strategy Revision • With M. Looks, B. Cynamon (U Chicago), A. Schiller (Princeton) • A.k.a. Legislature vs. Population games • H.L.A. Hart: There is a limit, inherent in the nature of language, to the guidance which general language can provide. • Abridgement = Projection of a veridical utility function • Scenario extinction • Sheep, Weasels, and Gorillas r.p.loui AIVR UIUC
Today's Abstract We present a different AI model of negotiation where agents are driven by dynamic expectations (there is no solution concept and there is no recursive modeling of beliefs). We require two assumptions to paint a new picture: (1) there is an empirically observable objective probability of breakdown that is monotonic (at some granularity) in elapsed time since last progress; (2) there is a nonstandard utility attached to the act of unilateral breakdown: a process utility that models the satisfaction of breaking down on a non-cooperative negotiating partner. This is a procedural fairness adjustment, not the substantive distributive fairness effect that has been trendy in the economics literature. We observe the variety of behaviors that can be generated by agents constructing action according to such Pessimism-Punishment (PP) negotiation models. We define a laissez-faire path for two PP agents starting in a given position and the proper calibration of their breakdown probabilities conditioned only on position. Finally, we discuss what iterative process could be used to reduce a priori miscalibration of breakdown probabilities. r.p.loui AIVR UIUC
Negotiation Behavior • Social Psychology: Dean Pruitt • Logrolling issues • Management Sci: Howard Raiffa / Max Bazerman - Margaret Neale • Integrative agreement • Law: Roger Fisher – William Ury – Bruce Patton • Principled negotiation • Artificial Intelligence: • Problem solving: R Davis – R Smith / V Lesser • Shared planning dialogue: S Carberry / G Ferguson – J Allen • Non-ideal game theory: E Durfee / S Rosenschein / T Sandholm • Argumentation: K Sycara / S Parsons – C Sierra – N Jennings • Economics: John Nash / Ariel Rubinstein • Solution concept • Equilibrium r.p.loui AIVR UIUC
Negotiation Behavior: Equilibrium • Mathematical curiosity (cf. Axelrod) • Starts with "Solution concept (I or II)": reduction of uncertainty to a distinct outcome • Epistemologically far-fetched • Empirically ridiculous • Philosophically indefensible • Useless in the design of negotiating agents r.p.loui AIVR UIUC
Negotiation Behavior: Equilibrium • Mathematical curiosity (cf. Axelrod) • Starts with "Solution concept (I or II)": reduction of uncertainty to a distinct outcome • Epistemologically far-fetched • Empirically ridiculous • Philosophically indefensible • Useless in the design of negotiating agents r.p.loui AIVR UIUC
Negotiation Behavior: AI • A place for language / argument • A place for introspection on utilities • A range of interesting & reasonable behaviors • Participation in the process of negotiation: • Exchanging proposals • Reacting to proposals r.p.loui AIVR UIUC
My Theory Pt. I • Observation: parties to a negotiation (can) construct a probability distribution over potential settlements r.p.loui AIVR UIUC
Party 1'saspiration Party 2'saspiration r.p.loui AIVR UIUC
Party 1'sproposals at t Party 2'sproposals at t r.p.loui AIVR UIUC
inadmissible(dominated)at t inadmissible(dominated)at t r.p.loui AIVR UIUC
In black: admissiblesettlementsat t(probabilityof agreement Is non-zero) r.p.loui AIVR UIUC
1's aspiration 2's aspiration r.p.loui AIVR UIUC
Breakdown (BATNA) r.p.loui AIVR UIUC
Breakdown (BATNA) r.p.loui AIVR UIUC
Breakdown column Breakdown row r.p.loui AIVR UIUC
Breakdownwould occurhere (BATNA) r.p.loui AIVR UIUC
1's security level 1 would rather breakdown 2's security level 2 would rather breakdown r.p.loui AIVR UIUC
Prob(bd) = ? Eu1|s = 51 Eu2|s = 49α +54(1-α) r.p.loui AIVR UIUC
My Theory Pt. I • Observation: parties to a negotiation (can) construct a probability distribution over potential settlements r.p.loui AIVR UIUC
My Theory Pt. I • Observation: parties to a negotiation (can) construct a probability distribution over potential settlements • What kind of probability? r.p.loui AIVR UIUC
My Theory Pt. I • Observation: parties to a negotiation (can) construct anobjective, empirical or epistemological*(NOT SUBJECTIVE*) probability distribution over potential settlements from past experience in similar settings • OBJECTIVE: • Constructed from data • Agree on P, given K • Not committed to P(a) until queried about P(a) • SUBJECTIVE: • "Bayesian" (but not necessarily what AI people mean) • Can change by new prior, new conditioning, or shift in feelings • Total and consistent prior to querying r.p.loui AIVR UIUC
My Theory Pt. I • Observation: from a probability distribution over potential settlements, there is an expected utility given settlement • Observation: there is a probability of breakdown p(bd) r.p.loui AIVR UIUC
P(bd) gap probability of break down r.p.loui AIVR UIUC
My Theory Pt. I • Observation: from a probability distribution (at t) over potential settlements, there is an expected utility given settlement (at t) • Observation: there is a probability of breakdown pt(bd) r.p.loui AIVR UIUC
My Theory Pt. I • Observation: At t, calculate 1. An expected utility given settlement (Eut|s) and 2. An expected utility given continued negotiation,Eut = (Eut |s) (1 - pt(bd)) + u(bd) pt(bd) • Definition: Rationality requires the agent, at t, to: 1. Extend an offer, o, if Eut < u(o) 2. Accept an offer, a, if Eut < u(a), a offers-to-you(t) 3. Break down unilaterally if Eut < u(bd) r.p.loui AIVR UIUC
My Theory Pt. I • Why not iff? • Extend an offer, o, if Eut < u(o) • Withhold an offer?, o, if Eut > u(o) • There may be other reasons for acting earlier • Constructivism: • Multiple ways of constructing probability • Multiple ways of deriving/justifying/motivating action r.p.loui AIVR UIUC
My Theory Pt. I • Empirical Observation: At sufficient granularity, p(bd) is decreasing in the time since last progress r.p.loui AIVR UIUC
My Theory Pt. I PessimismFor sufficiently large Δ, where LP(t0) denotes last progress at t0 pt+Δ(bd | LP(t0)) > pt(bd | LP(t0))What is progress?A non-trivial offer by the other partyWhat does this mean?(at some granularity, the past record implies that:)If there are no offers, the probability of breakdown rises r.p.loui AIVR UIUC
My Theory Pt. I PessimismLinear Pessimismp(bd | NP(t)) = π tExponential Pessimismp(bd | NP(t)) = 1 - e-πt Delayed Linear Pessimismp(bd | NP(t)) = π max(0, t - t0) Whatever fits the empirical record! r.p.loui AIVR UIUC
Pessimism causes Eu to fall Next offer is made at this timeand prob(bd) resets to 0 Expectation starts to fall again r.p.loui AIVR UIUC
offers reciprocated offers r.p.loui AIVR UIUC
Agreement reached as Eu < u1 r.p.loui AIVR UIUC
Whenever u(acc) > security, acceptance occurs before breakdown! Best offer received security r.p.loui AIVR UIUC
Would you accept an 11-cent offer if yoursecurity were 10-cents? Best offer received security r.p.loui AIVR UIUC
My Theory Pt. II • Observation: You wouldn't accept 11¢ over 10 ¢ security, nor 51 ¢ over 50 ¢ security • Observation: You wouldn't let your kid do it • Observation: Your Mother wouldn't let you do it • Observation: Your lawyer wouldn't let you do it • Observation: Your accountant wouldn't let you do it • Proposition: We shouldn't automate our agents to do it r.p.loui AIVR UIUC
My Theory Pt. II • Question: Isn't this an issue of distributive justice • Answer: Substantive fairness is trivial to model by transforming utilities • Observation: There may be a procedural fairness issue r.p.loui AIVR UIUC
My Theory Pt. II • Procedural fairness: • the more the other party withholds progress, the more you will punish • When the other party resumes cooperation, you are willing to forgo punishment r.p.loui AIVR UIUC
My Theory Pt. II Resentmentu(bd) = security + resentment(t) What is resentment?1. Dignity2. Pride3. Investment in society4. Protection against non-progressive manipulators 5. A GENUINE source of satisfaction: non-material, transactional, personal(?), transitory(?) r.p.loui AIVR UIUC
My Theory Pt. II Resentmentut(bd) = security + resentment(t) = u(bd) + r(t) for NP(t), non-progress for a period tWhat is resentment?6. Attached to a speech/dialogue act: BATNA through breaking down vs. BATNA through agreement7. A nonstandard utility (process utility)8. Specific or indifferent (I-bd-you vs. you-bd-me) r.p.loui AIVR UIUC
My Theory Pt. II Resentmentlinear resentmentr(t) = ρtsigmoid resentmentr(t) = rmax(2/(1+e-ρt) -1) You either feel it or you don't – you can't fake it! r.p.loui AIVR UIUC
Eu never falls to u1 r.p.loui AIVR UIUC
Actually accepts becauseresentment resets with progress Nontrivial progess Resentment resets to zero each time there is progress r.p.loui AIVR UIUC
Agent breaks down before accepting Resentment might not reset to zero if there is memory r.p.loui AIVR UIUC
P&P Agents Pessimism + Punishment"purely" probablistic r.p.loui AIVR UIUC
Variety of Behaviors • Agent can make a series of offers, responds to offers • Agent can wait, then offer, accept, or break down • Agent can accept, offer, or break down immediately • Agent can offer before accepting and vice versa • Agent can breakdown before accepting and vice versa • Agent can offer before breaking down and vice versa • Agent can be on path to breakdown, then on path to acceptance • because received offer changes Eu or resentment • because extended offer changes Eu • I wouldn't use this as my agent on ebay quite yet… r.p.loui AIVR UIUC
(Assumes no progress) Linear pess/linear specific pun low-valued ρ high-valued ρ r.p.loui AIVR UIUC