Ergodic Specifications for Flexible Swarm Control: From User Commands to Persistent Adaptation

Ahalya Prabhakar, Ian Abraham, Annalisa Taylor, Millicent Schlafly, Katarina Popovic, Giovani Diniz, Brendan Teich, Borislava Simidchieva, Shane Clark, Todd Murphey

Abstract

This paper presents a formulation for swarm control and high-level task planning that is dynamically responsive to user commands and adaptable to environmental changes. We design an end-to-end pipeline from a tactile tablet interface for user commands to onboard control of robotic agents based on decentralized ergodic coverage. Our approach demonstrates reliable and dynamic control of a swarm collective through the use of ergodic specifications for planning and executing agent trajectories as well as responding to user and external inputs. We validate our approach in a virtual reality simulation environment and in real-world experiments at the DARPA OFFSET Urban Swarm Challenge FX3 field tests with a robotic swarm where user-based control of the swarm and mission-based tasks require a dynamic and flexible response to changing conditions and objectives in real-time.

Live Paper Discussion Information

	Start Time	End Time
	07/15 15:00 UTC	07/15 17:00 UTC

Virtual Conference Presentation

Paper Reviews

Review 1

Overall comment: I am not entirely convinced this should be referred to as a swarm controller. It appears, to be effective, a lot of infomation must be passed between agents which gives the intuition it would not scale well. It may be more appropriate calling this a multi-agent control strategy. Introduction: -The first paragraph could be make the problem space more clear. -E.G. "One of the biggest problems in multi-agent control of robotic systems is the management and individualized control of the swarm of robots." The way this could read is the problem is; to enable an individual human operator single out and control individual agents in the swarm or design individual agent control laws that coalesce into cooperative behaviors or design interfaces enabling collective management and control of a swarm by a single human. -E.G."However, it is still necessary to develop a method that integrates both a framework to incorporate a user command into the supervision of a swarm and individual robot-level planning algorithms." Like the previous example this is also unclear. An example may clear this up. -"Our approach motivates the use of flexible density descriptions where each agent is responsible for coverage of the full area, but can communicate its past and intended trajectory to the other agents. This allows for each agent to prioritize local exploration while ensuring coverage specifications are robust to network dynamics." There is a lot of information presented here that make the approach that will be presented unclear and how it is different than [3]. E.g. What is a flexible density description, how the robots are covering the full area (patrolling?) while locally exloring, coverage specifications robust to network dynamics (do robots need to stay in proximity for communication)? Algorithm: -Eq (1) there is nothing wrong with this but the standard variables used for control affine systems are typically f(x) + g(x)u. Having the natural dynamics as g may go against the intuition of many readers. -Eq (2) what is s? -What is a sine arbitrary spatial distribution? I'm assuming this is a Fourier series for a spatial density function but it should be made clear in the text. -Eq (5) u should be defined differently as the ensemble control. -The agent dynamics being independent is a fine assumption but this reviewer believes this statement is not entirely true due to the obstacle avoidance and potentially the RRT planner that is dependent on the other agents (negligable at low density but probably not at high density). -Second to last paragraph of left column of page 4: "Minimizing the ergodic metric thus avoids issues often faced with multimodal optimizations as the robot will allocate proportional amounts of its time within some allotted time depending on the measure of importance specified by all the elements that are desirable (e.g., easter eggs and overriding user commands)." I think this should be allocated proportional amounts of time within some allotted space? Results: -The communication topology of the swarm is incredibly important to these results but is not really investigated or explained. Is everything presented here a star topology through a central computer, a fully connected graph, is there a communication range? -An interesting experiment would be to see how the cost function behaves with respect to some appropriate network topology metric. Figures: Figure 1 is never referred to in the text. There is no figure 2. Figure 3 is not referred to in the text.

Review 2

Strengths: The video was helpful in understanding how the spatial allocation of robots behaved in practice. This was a nice complement to the heat maps used in Figures 9 and 10 and helped improve the clarity of the paper. The work appears to be original and has the potential for being significant. I'd personally like to read more about the approach. The derivation of the ergodic control law appeared to be solid and well thought through. The math made sense, though there were some assumptions that had to be made (e.g., I had to assume that saying that v<n in the paragraph between equation (2) and equation (3) meant that the distribution could apply to 2D even though robot dynamics were in 6D (3 spatial dimensions plus roll, tilt, and yaw)). Areas for Improvement: There was a mismatch between some of the claims of the paper and the evidence provided to support those claims. Mismatches include: * Footnote 2 and the introductory paragraph if Section III assert that the work applies to heterogeneous agents, but the derivation and demonstration were only for homogeneous agents. Without the details, the claim about heterogeneity is not supported. * The last paragraph under figure 9 claimed that the swarm uniformly covered the workspace, but there was no quantitative data to support this so it is difficult to have confidence in the claim (or even fully understand what "uniformly covered" means). * The discussion claimed that the formulation could minimize human operator workload, but no data was gathered to support this assertion. An expert in human factors might point out that understanding why the agents were doing what they were doing might require more workload than giving inputs to another algorithm like a sheepdog steering algorithm. * The discussion claimed that as the number of agents was reduced the algorithm adapted, but no data was used to quantify what this meant. Similarly, claims were made about scalability and the effect of various communication topologies that were not quantified, * A claim was made in the introduction that adjusting swarm behavior by influencing individual agents becomes less effective as the number of agents grows. This depends on the way individual agents are selected, how the individual agents affect the other agents, and so on. Approaches such as those from PB Sujit's lab and MA Goodrich's lab show the ability to influence a lot of agents using very few individuals, so the claim needs to be better scoped and explained. There were some details that were missing that made it difficult to evaluate how generalizable the results were. Missing details include: * How did the RRT* planning algorithms ensure that agents would not collide? It didn't appear that the ergodic part of the algorithm addressed collisions, so the collision-avoidance must have been done in the RRT* algorithm. * The ergodic specification was decentralized, but it depended on all agents knowing the c_k parameters. A claim was made that the result was robust to varying ways of communicating C_k, but details were not included to explain how this would work.

Review 3

The paper is overall well written and is well structured. The formulation in section III was described in detail. Sufficient description was provided for each figure. The idea of combining the ergodic planner’s tendency to explore recently unvisited locations and user specified locations seems to be beneficial. This allows the user to concentrate on finding interesting locations, rather than spending time managing the swarm to spread across the area. The authors mention that they intend “to test human cognitive load” using the developed system in the future. Personally, I am interested in whether users would find certain aspects of the developed system easy or difficult to use and what might be the reason for those outcomes. For example, the usefulness of the tactile tablet seems trivial without a user study showcasing its advantages. For this paper’s purpose, perhaps an ordinary touchscreen tablet could have been sufficient to control the swarm? It is not clearly explained how a user explores within the virtual reality environment. The paper suggests that the user is able to navigate within the simulated environment. However, there is no description of how this is done. The ability to move around to change the viewpoint is a distinct type of control that adds further workload to the user while monitoring the swarm. The presentation quality of the work requires input from a senior author. Examples below. The meaning of the term ergodic should be introduced to help the reader. The first figure should not be placed before the abstract. Also on the first page, a figure should not span both columns. The abbreviation VR does not need to be introduced in the abstract, as it is not used there. It needs to be introduced in Section II, when first used. Figures 4, 5 and 8 are not well formatted. There are several minor typos/errors in the paper: In page 1, “Our approach attempts to mitigate these issues through a decentralized strategy which is independent of an central control hub ...” This should be “a” instead of “an”. In page 1, “[3] presents a decentralized, density-based coverage approach which influences multiples robots in a swarm from user commands.” This should be “multiple” instead of “multiples”. In page 1, “Other planners that attempt to replan based on updates swarm often rely on” - this is not clear. In page 5, “The parameter Σ is the width of the region of attraction (or repulsion) that can be tuned basd on the size of the task space and the desired granularity.” This should be “based” instead of “basd”. In page 7, “The ergodic specification enables each agent to constantly generates actions ... ” This should be “generate” instead of “generates”. Some references are incomplete, for example, [13]. Some titles in references do not use correct lower/upper casing, for example, [12].