Learning to control self-assembling morphologies: A study of generalization via modularity

Contemporary sensorimotor learning approaches typically start with an existing complex agent (e.g., a robotic arm), which they learn to control. In contrast, this paper investigates a modular co-evolution strategy: a collection of primitive agents learns to dynamically self-assemble into composite b...

Full description

Bibliographic Details
Main Author:	Isola, Phillip John (Author)
Other Authors:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format:	Article
Language:	English
Published:	2021-01-12T18:40:29Z.
Subjects:	Article
Online Access:	Get fulltext


LEADER	01776 am a22001573u 4500
001	129385
042			\|a dc
100	1	0	\|a Isola, Phillip John \|e author
100	1	0	\|a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science \|e contributor
245	0	0	\|a Learning to control self-assembling morphologies: A study of generalization via modularity
260			\|c 2021-01-12T18:40:29Z.
856			\|z Get fulltext \|u https://hdl.handle.net/1721.1/129385
520			\|a Contemporary sensorimotor learning approaches typically start with an existing complex agent (e.g., a robotic arm), which they learn to control. In contrast, this paper investigates a modular co-evolution strategy: a collection of primitive agents learns to dynamically self-assemble into composite bodies while also learning to coordinate their behavior to control these bodies. Each primitive agent consists of a limb with a motor attached at one end. Limbs may choose to link up to form collectives. When a limb initiates a link-up action, and there is another limb nearby, the latter is magnetically connected to the 'parent' limb's motor. This forms a new single agent, which may further link with other agents. In this way, complex morphologies can emerge, controlled by a policy whose architecture is in explicit correspondence with the morphology. We evaluate the performance of these dynamic and modular agents in simulated environments. We demonstrate better generalization to test-time changes both in the environment, as well as in the structure of the agent, compared to static and monolithic baselines. Project video and code are available at https://pathak22.github.io/modular-assemblies/.
546			\|a en
655	7		\|a Article
773			\|t Advances in Neural Information Processing Systems

Learning to control self-assembling morphologies: A study of generalization via modularity

Similar Items