Randomized Sensing in Adversarial Environments

Randomized Sensing in Adversarial Environments Andreas Krause Joint work with Daniel Golovin and Alex Roper International Joint Conference on Artificial Intelligence 2011 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAAAAA

Motivation Want to manage sensing resources to enable robust monitoring under uncertainty Roboticenvironmentalmonitoring Coordinatecameras todetect intrusions Detectsurvivors afterdisaster

A Sensor Selection Problem Select two cameras to query, in order to detect the most people. People Detected: Duplicates only counted once 4 2

A Sensor Selection Problem NP-hard… Set V of sensors, |V| = n Select a set of k sensors Sensing quality model

Submodularity Diminishing returns property for adding more sensors. +2 +1 For all , and a sensor , Many objectives are submodular[K, Guestrin ‘07] Detection, coverage, mutual information, and others

Greedy algorithm Lets choose sensors S = {v1, … , vk} greedily [Nemhauser et al ‘78] If F is submodular, greedy algorithm gives constant factor approx.:

Sensing in Adversarial Environments • Set I of m intrusion scenarios • For scenario i: Fi(A) is sensing utility when selecting A • Intruder chooses worst-case scenario, knowing the sensors 2 1

Deterministic minimax solution One approach: Want to solve [K, McMahan, Guestrin, Gupta ’08]: • NP-hard • Greedy algorithm fails arbitrarily badly • Saturate algorithm provides near-optimal solution

Disadvantage of minimax approach Suppose we pick {3} and {5} with probability 1/2 Randomization can perform arbitrarily better! 1 2

The randomized sensing problem Given submodular functions F1,…,Fm, want to find NP-hard! Even representing the optimal solution may require exponential space! 

Existing approaches • Many techniques for solving matrix games • Typically don’t scale to combinatorially large strategy sets • Security games [Tambe et al] • Solve large scale Stackelberg games for security applications • Cannot capture general submodular objective functions • LP based approach [Halvorson et al ‘09] • Double oracle with approximate best response • No polynomial time convergence convergence guarantee

Randomized sensing Define Distributionover sensingactions Distribution over intrusions Thus, can minimize over q instead of over p!

Equivalent problem: Finding q* Want to solve Use multiplicative update algorithm [Freund & Schapire ‘99] Initialize For t = 1:T NP-hard  But submodular! 

The RSense algorithm Initialize For t=1:T • Use greedy algorithm to computebased on objective function • Update Return

Performance guarantee Theorem: Let Suppose RSense runs for iterations. For the resulting distribution it holds that

Handling more general constraints So far: wanted Many application may require more complex constraints: Examples: Informative path planning: Controlling PTZ cameras: Nonuniform cost: Can replace greedy algorithm by - best response RSenseguarantees

(often) submodular [Das & Kempe ’08] Example: Lake monitoring Monitor pH values using robotic sensor transect Prediction at unobservedlocations Observations A True (hidden) pH values pH value Var(s | A) Use probabilistic model(Gaussian processes)to estimate prediction error Position s along transect Where should we sense to minimize our maximum error?

Experimental results Randomized sensing outperforms deterministic solutions

Running time RSense outperforms existing LP based method

pSPIEL Results: Search & Rescue Map from Robocup Research Challenge • Coordination of multiple mobile sensors to detect survivors of major urban disaster • Buildings obstruct viewfield of camera • Fi(A) = Expected # of people detected at location i Detection Range Detected Survivors

Experimental results • Randomization outperforms deterministic solution • RSense finds solution faster than existing methods

Worst- vs. average case Given: Possible locations V, submodular functions F1,…,Fm Want to optimize both average- and worst-case score! Can modify RSense to solve this problem! Compute best response to

Knee in tradeoff curve Tradeoff results Worst casescore Worst casescore Envtl. monitoring Search &rescue Average case score Average case score Can find good compromise between average- and worst-case score!

Conclusions • Wish to find randomized strategy for maximizing an adversarially-chosen submodular function • Developed RSense, which provides near-optimal performance • Performs well on two real applications • Search and rescue • Environmental monitoring

Randomized Sensing in Adversarial Environments

Randomized Sensing in Adversarial Environments

Presentation Transcript

Adversarial Search

Adversarial Search

Multi-Vehicle Exploration and Routing in Adversarial Environments

Adversarial Games

Wireless Sensing Applications for Critical Industrial Environments

PermaSense Sensing in Disruptive Environments

Adversarial Search

Bacterial Navigation and Applications to Sensing in Marine Environments

Adversarial Search

Adversarial Search

Opponent Modeling In Interesting Adversarial Environments

Sketching in Adversarial Environments Or Sublinearity and Cryptography

Adversarial Search

Adversarial Search

Multispectral Remote Sensing of Benthic Environments

Adversarial Search

Adversarial Search

Adversarial Search

Smart Environments for Occupancy Sensing and Services

Adversarial Search

Adversarial Search