Learning to control self-assembling morphologies: A study of generalization via modularity

Contemporary sensorimotor learning approaches typically start with an existing complex agent (e.g., a robotic arm), which they learn to control. In contrast, this paper investigates a modular co-evolution strategy: a collection of primitive agents learns to dynamically self-assemble into composite b...

Full description

Bibliographic Details
Main Author: Isola, Phillip John (Author)
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format: Article
Language:English
Published: 2021-01-12T18:40:29Z.
Subjects:
Online Access:Get fulltext
LEADER 01776 am a22001573u 4500
001 129385
042 |a dc 
100 1 0 |a Isola, Phillip John  |e author 
100 1 0 |a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science  |e contributor 
245 0 0 |a Learning to control self-assembling morphologies: A study of generalization via modularity 
260 |c 2021-01-12T18:40:29Z. 
856 |z Get fulltext  |u https://hdl.handle.net/1721.1/129385 
520 |a Contemporary sensorimotor learning approaches typically start with an existing complex agent (e.g., a robotic arm), which they learn to control. In contrast, this paper investigates a modular co-evolution strategy: a collection of primitive agents learns to dynamically self-assemble into composite bodies while also learning to coordinate their behavior to control these bodies. Each primitive agent consists of a limb with a motor attached at one end. Limbs may choose to link up to form collectives. When a limb initiates a link-up action, and there is another limb nearby, the latter is magnetically connected to the 'parent' limb's motor. This forms a new single agent, which may further link with other agents. In this way, complex morphologies can emerge, controlled by a policy whose architecture is in explicit correspondence with the morphology. We evaluate the performance of these dynamic and modular agents in simulated environments. We demonstrate better generalization to test-time changes both in the environment, as well as in the structure of the agent, compared to static and monolithic baselines. Project video and code are available at https://pathak22.github.io/modular-assemblies/. 
546 |a en 
655 7 |a Article 
773 |t Advances in Neural Information Processing Systems