430 likes | 594 Views
The DEFACTO System: Training Incident Commanders. Nathan Schurr Janusz Marecki, Milind Tambe, Nikhil Kasinadhuni, and J. P. Lewis University of Southern California Paul Scerri Carnegie Mellon University. Outline. Motivation and Domain DEFACTO Team Level Adjustable Autonomy
E N D
The DEFACTO System:Training Incident Commanders Nathan Schurr Janusz Marecki, Milind Tambe, Nikhil Kasinadhuni, and J. P. Lewis University of Southern California Paul Scerri Carnegie Mellon University
Outline • Motivation and Domain • DEFACTO • Team Level Adjustable Autonomy • Experiments with DEFACTO • Conclusions
Motivation: Help Incident Commanders • Incident Commander • First Response • Disaster Rescue Scenario • Urban Environment • Large Scale • Crime Scene • Incident commander must control situation, monitor situation, and allocate resources • Goal: Initially a Training Simulation • Later: Decision Support/Replacement
Aims of DEFACTO • LAFD Exercise Challenges • Personnel Heavy • Smaller Scale • Low Fidelity Environment • Key Exercise Components • Communication • Allocation • Agent-teams replace people playing roles • Demonstrating Effective Flexible Agent Coordination of Teams via Omnipresence
Outline • Motivation and Domain • DEFACTO • Team Level Adjustable Autonomy • Experiments with DEFACTO • Conclusions
Disaster Rescue Simulation:USC Map, Different underlying simulators Statistics
Challenges in Extending to Human-Agent Teams • Teamwork • Communication • Role Allocation • Agent team to incorporate human • Adjustable Autonomy (Scerri et al JAIR 2002) • Interface
DEFACTO • Teamwork Proxies • Machinetta • Continued development with CMU • Used in many other domains – UAVs, sensor nets etc. • Flexible Interaction • Team Level Adjustable Autonomy Strategies • Dynamic Strategy Selection • Omni-Viewer • 2D – Standard with Simulator • 3D – Developed by us • Interaction
Proxy Architecture • Abstracted Theories of Teamwork (Scerri et al AAMAS 03) • Communication: communication with other proxies • Coordination: reasoning about team plans and communication • State: the working memory of the proxy • Adjustable Autonomy: reasoning about whether to act autonomously or pass control to the team member • RAP Interface: communication with the team member Other RAP Communication RAP Interface Proxies State Coordination Adjustable Autonomy
Teamwork Proxies • Higher level TOP • Reuse across domain • Flexible Teamwork (Tambe JAIR 97) • Communication • Joint Intentions (Cohen & Levesque 1991) • Allocation • Role allocation algorithms (Xu et al AAMAS 2005) • Machinetta • Platform Independent • Modular Structure • Downloadable – Free, Publicly available
Outline • Motivation and Domain • DEFACTO • Team Level Adjustable Autonomy • Experiments with DEFACTO • Conclusions
Adjustable Autonomy(AA) Strategies for Teams • Agents dynamically adjust own level of autonomy • Agents act autonomously, but also... • Give up autonomy, transferring control to humans • When to transfer decision-making control • Whenever human has superior expertise • Yet, too many interrupts also problematic • Previous: Individual agent-human interaction
AA: Novel Challenges in Teams • Transfer of control strategies for AA in teams • Planned sequence of transfers of control • AT - Team level A strategy • H - Human strategy for all tasks • AH - Individual A followed by H • ATH - Team level A strategy followed by H • Goal: Improve Team Performance
Outline • Motivation and Domain • DEFACTO • Team Level Adjustable Autonomy • Experiments with DEFACTO • Conclusions
Experiments • Initial evaluation of system and of strategies • Details • 3 Subjects • Allocation Viewer • Same Map for each scenario • Building size and location • Initial position of fires • 4, 6, and 10 agents • A, H, AH, ATH Strategies • Averaged over 3 runs
Conclusions from Results • No strategy dominates through all cases • Humans may sometimes degrade agent team results • Slope of strategy A > Slope of H • Humans are not as good at exploiting additional agents resources • If EQH is low, then as we grow to larger numbers of agents, A will dominate AH, ATH and H • Dip at 6… • LAFD – “Not surprising.”
Summary • DEFACTO • Teamwork • Team Level Adjustable Autonomy Strategies • Interface • Experimented with strategies for adjustable autonomy • Future Directions • Experiments with LAFD • Study strategy behavior • Train the “system” • Training today, real response in the future.
Thank You • Email: schurr@usc.edu • Web Site: http://teamcore.usc.edu • Machinetta • http://teamcore.usc.edu/doc/Machinetta/ • Thanks • CREATE Center • Fred Pighin and Pratik Patil
Related Work: Disaster Response Simulations • LA County Fire Department Simulators • DEFACTO focuses on “incident commander” • “Environment” simulators: • E.g., Terrasim, EPICS • Not provide on agent behaviors • “Agent-based” simulators • E.g., Battlefield simulators • Adjustable autonomy
Outline • Motivation • Objectives • CREATE Research Center • Current State of the Art • DEFACTO • Simulator • Teamwork Proxies • 3D Visualization • Team Level Adjustable Autonomy • Models • Predictions • Experiments with DEFACTO • Conclusions
DEFACTO: Key Research Areas • Enable effective interactions of agents with humans • Research: Adjustable autonomy • Previous work: Often single agent-single human interactions • Scale-up to 100s of agents with fire engines, ambulances, police • Research: Scale-up in team coordination • Previous work: Limited numbers of agents coordinating in teams • Visualization • Robust 3D visualization
Adjustable Autonomy:Novel Challenges in Teams • Previous transfer-of-control fails in teams: • Ignore costs to team (just concerned about individual) • One shot transfers of control, too rigid • Transfer control to a human (H) or agent (A) • If human fails to make a decision, miscoordination!! • Forcing agent to decide can cause a poor decision • Expensive lesson learned in the “Electric-Elves” project • Major errors by software assistants • Hence need more flexible transfer of control
Predictions • EQh: Expected quality of human decision • AGH: How many agents human can control • A Strategy has constant slope
CREATE Research Center • Center for Risk and Economic Analysis of Terrorism Events • MANPAD Scenario • Large Scale Disaster • Limited Resources • First Response • Help incident commander control situation • Large Scale • Crime Scene
Simulator • Robocup Rescue • 10 different Simulators • Multiple Agent Types
Team Level AA Model • How to select the strategy among many? • Key idea: Calculate expected utility of different strategies • Mathematical model of strategies • EQ: Quality of an entity’s decision • P: Probability of response of that entity • W: Cost of miscoordination • Traditional Expected Utility • Probability of response * decision quality • Integrate over time
Agents Per Fire Subject A Subject B Subject C
Fire starts on 1st floor Spreads to Attic LA City Fire Dept Exercise: Fire Progression
LAFD officials simulate fire progression and the resource availability Battalion Chief allocates available resources to tasks LAFD Exercise: Simulations by People Playing Roles
Proxy Architecture • Abstracted Theories of Teamwork (Machinetta) • Platform Independent • Modular Structure Other RAP Communication RAP Interface Proxies State Coordination Adjustable Autonomy
Objectives: Agent-based Simulation Tools for Disaster Response • Improve training and decision making • Present • Teach and evaluate LAFD response tactics • Future • Agent/Robot disaster response • Key research questions in: • Multiagent coordination, Adjustable Autonomy • Visualization of multiagent systems