410 likes | 610 Views
Analytic Synchronization A concept for understanding emergent behaviors in asynchronous multi-agent systems. A. S. Morse Yale University. Napa Valley August 2, 2005. www.swarms.org. Research Objectives. To study multi-agent motion control problems with the objective of better
E N D
Analytic Synchronization A concept for understanding emergent behaviors in asynchronous multi-agent systems A. S. Morse Yale University Napa Valley August 2, 2005 www.swarms.org
Research Objectives To study multi-agent motion control problems with the objective of better understanding the process of devising provably correct local control strategies which can coordinate and otherwise induce desired emergent {group-wide} behaviors. Formation control:With the objective of understanding how to form, maintain, and otherwise manage a formation of mobile autonomous agents, we are actively studying a variety of algorithms based on the classical concept of graph rigidity. Much of this research has been done with Brian Anderson who will be giving a talk at the workshop on this subject. Sensor localization:As a direct spin-off from our original work formation control, we are also actively studying how global graph rigiditycan be used to advantage to deal with long-standing localization problems which arise in sensor networks. Distributed control:We are heavily involved in the study of high-level distributed control strategies of all types with the goal of understanding how to analyze and synthesize them. Earlier work with Ali Jadbabaie on heading synchronization as well as more recent work on rendezvousing are representative directions. SWARMS
We have been specifically interested in the development of techniques for analyzing asynchronous multi-agent systems. One such technique, called analytic synchronization, will be outlined later in the talk. We will talk about two problems: The Asynchronous Rendezvous Problem The Asynchronous Heading Synchronization Problem SWARMS
Agents r agenti sensing region SWARMS The Asynchronous Rendezvous Problem Consider a set of n mobile autonomous agents which can all move in the plane. Each agent is able to continuously sense the relative positions of all other agents in its “sensing region” where by agent i’s sensing region is meant a closed disk of radius r centered at agent i’s current position. Problem: Devise local control strategies, one for each agent, which without active communication between agents, cause all members of the group to eventually rendezvous at a single unspecified point. Synchronized strategies proposed in Unsynchronized Strategies ! H. Ando, Y. Oasa, I. Suzuki, M. Yamashita, October 1999 IEEE Trans. Rob. & Auto. Lin, Morse, Anderson, CDC 2003
fixed positive numbers sensing period maneuvering period is an upper bound on the time it takes for agent i to move from one way-point to the next. SWARMS Stop and Go Maneuvers {of Agent i} A stop and go maneuver takes place within a time interval consisting of two successive subintervals: Each agent is stationary during each of its sensing periods. During a maneuvering period, an agent moves from its current rest position to its next “way-point” and again comes to rest. Successive way-points for each agent are constrained to be within r units of each other. Details of maneuvering to way-points not considered.
sensing period kth kth maneuvering period tik A design parameter called a sensing timeS is chosen to satisfy SWARMS Stop and Go Maneuvers {of Agent i} A stop and go maneuver takes place within a time interval consisting of two successive subintervals: Agent i’s event times: ti1, ti2, …. Not synchronized with other agents’ event times Registered neighbors of agent i at event time tikare those agents which during agent i’s kth sensing period are stationary for at least S seconds at positions within agent i’s sensing region. It is assumed that no agent stops for as long as S seconds during any of its maneuvering periods.
sensing period kth kth maneuvering period tik SWARMS Registered neighbor is a symmetric relation: If agent j is a registered neighbor of agent i at tik and q is the sensing/maneuvering period of agent j during which registration takes place, then agent i is a registered neighbor of agent j at tjq. Registered position of neighbor j at tik is the position of agent j during the last interval of at length at least S within agent i’s kth sensing period, during which agent j is stationary. Registered neighbors of agent i at event time tikare those agents which during agent i’s kth sensing period are stationary for at least S seconds at positions within agent i’s sensing region.
Retaining Neighbors j i SWARMS Agent i is said to satisfy the motion constraint induced by neighbor j at time tik if the point to which agent i moves at the end of its kth maneuvering period is within a closed disk of diameter r centered at the mean of its position at tik and the registered position of neighbor j at tik. Neighbor Retention: Suppose that agents i and j satisfy the motion constraints induced by their registered neighbors. If agent j is a registered neighbor of agent i at tik, the agent j is also a registered neighbor of agent i at ti(k+1)
Retaining Neighbors SWARMS Agents is said to satisfy the motion constraint induced by neighbor j at time tik if the point to which agent i moves at the end of its kth maneuvering period is within a closed disk of diameter r centered at the mean of its position at tik and the registered position of neighbor j at tik. Cooperation Assumption: Each agent satisfies the motion constrains induced by each of its registered neighbors. Neighbor Retention: Suppose that agents i and j satisfy the motion constraints induced by their registered neighbors. If agent j is a registered neighbor of agent i at tik, the agent j is also a registered neighbor of agent i at ti(k+1) Implies that each agent will retain all of its current registered neighbors forever.
sensing region constraint disc Agent A neighbors constraint disc constraint disc constraint disc constraint set SWARMS Agent A’s Motion Constraints Induced by its Neighbors Either contains just agent A or is strictly convex with nonempty interior
SWARMS Way Point Selection Rules By the kth local convex hull of agent i is meant the convex hull of the set consisting of agent i’s position at time tik and the registered positions of agent i’s neighbors at time tik. Agent i’s kth way point 1. must be the same as its position at time tik if agent i has no neighbors at time tik. 2. must be within its kth local convex hull. 3. must not be a corner of its kth local convex hull unless the local convex hull is a single point. 4. must be within the motion constraint sets induced by its neighbors.
convex hull target constraint set SWARMS 0 is in the Interior of Constraint Set
convex hull target constraint set SWARMS 0 is on Boundary of Constraint Set
SWARMS MAIN RESULT Despite the lack of synchronization it is possible to define a simple undirected graph at each event time tik of each agent which characterizes which agents are in which other agents’ sensing regions at time tik. It can be shown that if there is ever an event time of any one agent at which this neighbor graph is connected, then the use of the aforementioned type of strategy by all n agents will cause them to eventually rendezvous at one point. Proving that this is so is challenging because the n agent system under consideration isasynchronous.
SWARMS Rendezvousing Asynchronously Each agent’s behavior can be modeled as a system Ai evolving on its event time sequence ti1, ti2,…. where, as we said before, tik is the time at which agent i’s kth maneuvering period begins. The n interacting Ai constitute the asynchronous multi-agent system of interest. Since event time sequences of different agents are not synchronized, standard dynamical systems analysis tools cannot be directly applied. ANALYTIC SYNCHRONIZATION The n agent asynchronous system is analyzed by “embedding” all the Ai into a single synchronous system S evolving on the time set T which results when the n individual agent event time sequences are merged into a single ordered set. This is accomplished by appropriately extending the domain of definition of each agent subsystem Ai from the event time sequence ti1, ti2,… to all of T. Convergence is then established using familiar ideas applicable to S.
SWARMS We will talk about two problems: The Asynchronous Rendezvous Problem The Asynchronous Heading Synchronization Problem
SWARMS The Asynchronous Heading Synchronization Problem
Motivated by simulation results reported in the paper: SWARMS The Asynchronous Heading Synchronization Problem
qi i = heading s = speed s SWARMS Vicsek et al. simulated a flock of n agents {particles} all moving in the plane at the same speed s, but with different headings 1,2, ….n Each agent’s heading is updated using a local rule based on the average of its own current heading plus the headings of its “neighbors.” Vicsek’s simulations demonstrated that these nearest neighbor rules can cause all agents to eventually move in the same direction despite the absence of a leader and/or centralized coordination and despite the fact that each agent’s set of neighbors changes with time. A theoretical explanation for this observed behavior can be found in Jadbabaie, Lin & Morse, IEEE TAC, June 2003
ri neighbors of agent i agent i SWARMS each agent is a neighbor of itself
s = speed s i i = heading Average at time t of headings of neighbors of agent i. SWARMS HEADING UPDATE EQUATIONS Ni(t) = set of indices of agents i’s “neighbors” at time t ni(t) = number of indices in Ni(t)
SWARMS Jadbabaie, Lin & Morse, IEEE TAC, June 2003 Interesting property: Neighbor graphs change with time Feature of the original Vicsek model considered in the above paper No leader All ri = r. Synchronous operation No delays in sensing of headings Main technical tools exploited Simple {undirected} graphs to describe neighbor relationships Algebraic graph theory A theorem of J. Wolfowitz in Proc. AMS, 1963 which gives conditions for an infinite product of left stochastic matrices to converge to a rank one matrix.
SWARMS Subsequent research by B. Francis, L. Moreau, V. Blondell, J. Tsitsiklis, D. Angeli, D. Spielman, M. Cao, B. Anderson, G. Tanner, G. Pappas, R. Beard, and others. No leader convergence rates ri r All ri = r. sensing delays Synchronous operation asynchronous operation No delays in sensing of headings Additional technical tools exploited include directed graphs scrambling matrices, Sarymsakov matrices, and random walks a special partial Lyapunov function tailored for Markov Chains – Doob {1953}, Senta {1981}, Tsisiklis {1984}, Bertsekas & Tsisiklis {1989}.
SWARMS The Asynchronous Vicsek Flocking Problem A modified version of the Vicsek flocking problem in which each agent updates its heading at times determined by its own clock. Groups’ clocks are not assumed to be synchronized together. The times at which any one agent updates its heading are not assumed to be evenly spaced.
s= speed s qi qi = heading SWARMS The Asynchronous Vicsek Model {Here t denotes real continuous time} There are n agents labeled 1 through n. Each agent i maintains a constant heading i(t) for ti(k -1)<t ·tik, k¸ 1 where ti0 = 0 and tik is agent i’s kth update event time. Event time sequences are not assumed to be synchronized Ni(tik) = set of indices of agentsi’sneighbors including itself, at timetik ni(tik) = number of indices inNi(tik)
SWARMS Flocking Asynchronously Each agent’s behavior can be modeled as a system Ai evolving on its event time sequence ti1, ti2,…. where, as we said before, tik is the time at which agent i instantaneously changes its heading from one value to the next. The n interacting Ai constitute the asynchronous multi-agent system of interest. Since event time sequences of different agents are not synchronized, standard dynamical systems analysis tools cannot be directly applied. ANALYTIC SYNCHRONIZATION The n agent asynchronous system is analyzed by “embedding” all the Ai into a single synchronous system S evolving on the index set of the time set T which results when the n individual agent event time sequences are merged into a single ordered set. This is accomplished by appropriately extending the domain of definition of each agent subsystem Ai from the event time sequence ti1, ti2,… to all of T. Convergence is then established using familiar ideas applicable to S.
t21 t22 t11 t23 t14 t27 t12 t26 t25 t15 t24 t16 t13 agent 2 agent 1 SWARMS interacting
t1 t21 t22 t2 t11 t3 t4 t23 t8 t14 t27 t12 t5 t12 t11 t26 t25 t9 t15 t10 t6 t24 t16 t13 t13 t7 agent 2 agent 1 SWARMS T =
Analytic Synchronization Define Since it must be true for any value of for which t is an event time of agent i, that where 0 is such that t 0 = next event time of agent i after t . But Therefore because i(t) is constant for t < t·t 0. SWARMS LetT= set of all event times tik of all n agents. Re-label elements of T as t0, t1, t2, … so that t + 1 > t = 0, 1, 2,...
if t is an event time of agent i But i(t) is constant between event times so if tis not an event time of agent i So if we define if tis not an event time of agent i then if tis not an event time of agent i Therefore all¸ 0 SWARMS At this point:
if tis an event time of agent i if tis not an event time of agent i all¸ 0 At its event times, agent i has the same neighbors as before embedding Between its event times, agent i has only itself as a neighbor SWARMS Summary LetT= set of all event times tik of all n agents. Re-label elements of T as t0, t1, t2, … so that t + 1 > t = 0, 1, 2,...
NEIGHBOR CONFIGURATIONS 1 6 2 5 7 4 3 (1,2) SWARMS P = index set of all possible neighbor configurations. V = agent index set ={1, 2, …, n} For each p2P Gp= {V, Ap} - a directed graph with vertex set V and arc set Ap (i, j) 2Apif agent i is a neighbor of agent j in configuration p All vertices have self-arcs G = set of all directed graphs with vertex set V and self-arcs at all vertices Since agent i has no neighbors other than itself between its event times, at such times vertex i has only one incoming arc.
7 4 1 3 5 2 6 (1,2) in-degree = 2 in-degree = 1 SWARMS Matrix Representation of Gp = {V, Ap } adjacency matrix Ap =[aij]n£n aij(p) = 1 if i is a neighbor of j aij(p) = 0 otherwise Dp = diagonal {d1(p), d2(p), …., dn(p)}n£n di(p) = in-degree of vertex i
7 4 1 3 5 2 6 (1,2) SWARMS State Space Equation adjacency matrix Ap =[aij]n £n aij(p) = 1 if i is a neighbor of j aij(p) = 0 otherwise Dp = diagonal {d1(p), d2(p), …., dn(p)}n£ n s() = index inPof neighbor configuration at time.
if tis an event time of agent i if tis not an event time of agent i SWARMS Switching Among Neighbor Graphs Constraints induced by analytic synchronization If t is not an event time of agent i, then vertex i in G() cannot have any incoming arcs other than its own.
Some Graph Concepts SWARMS Call a graph G 2 Grooted if there is at least one vertex v for which, for each vertex i, there is a directed path from v to i. Rooted graphs are important because if the graphs in the sequence G(0), G(0) ... encountered along a system trajectory are rooted, then convergence to a common heading occurs. Although rooted graphs could easily arise along a trajectory in the synchronous flocking problem, in the unsynchronized version of the problem under consideration here, this can never occur except possibly in the highly unusual situation when different agents have some of their event times occurring at exactly the same instant. By the composition of a graph G1 with a graph G2, written G2 ± G1, is meant that graph which has an arc from i to j just in case there is a vertex k for which G1 has an arc from i to k and G2 has an arc from k to j. Call a finite sequence of graphs G1, G2,….Gm in Gjointly rooted, if the composition Gm±±G1 is a rooted graph. Call an infinite sequence of graphs G1,G2,….Gj, … in Grepeatedlyjointly rooted, if there is a finite integer m such that each successive subsequence Gm(k-1) +1, G(m(k-1)+2), … Gmk, k¸ 1, is jointly rooted.
If the sequence of graphs G (0), G(1),... encountered along a trajectory is repeatedly jointly rooted, then at an exponentially fast convergence rate. SWARMS Reaching a Common Heading Asynchronously The hypothesis above is not vacuous: Despite the constraints imposed on the neighbor sets by analytic synchronization, there are in fact plenty of trajectories of the asynchronous system of interest which generate sequences of graphs which are repeatedly jointly rooted.
Observations SWARMS New data structures, models, etc. are needed to represent large groups of mobile autonomous agents and sensors at various degrees of granularity, for purposes of simulations, management, analysis and control. Such representations will exploit tools from both graph theory and from thetheory of dynamical systems At least initially, individual agent/sensor descriptions usingsimple kinematic and dynamic models will suffice. System complexitywill stem more from thenumber of agent/sensor models being studied than from the detailed properties of the individual agent models.