Statistical models and decision making for robotic scientific information gathering

Thesis: S.M., Joint Program in Applied Ocean Physics and Engineering (Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science; and the Woods Hole Oceanographic Institution), 2018. === This electronic version was submitted by the student author. The certified...

Full description

Bibliographic Details
Main Author: Flaspohler, Genevieve Elaine
Other Authors: Yogesh Girdhar and Nicholas Roy.
Format: Others
Language:English
Published: Massachusetts Institute of Technology 2019
Subjects:
Online Access:http://hdl.handle.net/1721.1/120607
id ndltd-MIT-oai-dspace.mit.edu-1721.1-120607
record_format oai_dc
collection NDLTD
language English
format Others
sources NDLTD
topic Joint Program in Applied Ocean Physics and Engineering.
Electrical Engineering and Computer Science.
Woods Hole Oceanographic Institution.
Oceanography
Robotics
Algorithms
Artificial intelligence
Data collection platforms
Automatic data collection systems
spellingShingle Joint Program in Applied Ocean Physics and Engineering.
Electrical Engineering and Computer Science.
Woods Hole Oceanographic Institution.
Oceanography
Robotics
Algorithms
Artificial intelligence
Data collection platforms
Automatic data collection systems
Flaspohler, Genevieve Elaine
Statistical models and decision making for robotic scientific information gathering
description Thesis: S.M., Joint Program in Applied Ocean Physics and Engineering (Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science; and the Woods Hole Oceanographic Institution), 2018. === This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. === Cataloged from student-submitted PDF version of thesis. === Includes bibliographical references (pages 97-107). === Mobile robots and autonomous sensors have seen increasing use in scientific applications, from planetary rovers surveying for signs of life on Mars, to environmental buoys measuring and logging oceanographic conditions in coastal regions. This thesis makes contributions in both planning algorithms and model design for autonomous scientific information gathering, demonstrating how theory from machine learning, decision theory, theory of optimal experimental design, and statistical inference can be used to develop online algorithms for robotic information gathering that are robust to modeling errors, account for spatiotemporal structure in scientific data, and have probabilistic performance guarantees. This thesis first introduces a novel sample selection algorithm for online, irrevocable sampling in data streams that have spatiotemporal structure, such as those that commonly arise in robotics and environmental monitoring. Given a limited sampling capacity, the proposed periodic secretary algorithm uses an information-theoretic reward function to select samples in real-time that maximally reduce posterior uncertainty in a given scientific model. Additionally, we provide a lower bound on the quality of samples selected by the periodic secretary algorithm by leveraging the submodularity of the information-theoretic reward function. Finally, we demonstrate the robustness of the proposed approach by employing the periodic secretary algorithm to select samples irrevocably from a seven-year oceanographic data stream collected at the Martha's Vineyard Coastal Observatory off the coast of Cape Cod, USA. Secondly, we consider how scientific models can be specified in environments - such as the deep sea or deep space - where domain scientists may not have enough a priori knowledge to formulate a formal scientific model and hypothesis. These domains require scientific models that start with very little prior information and construct a model of the environment online as observations are gathered. We propose unsupervised machine learning as a technique for science model-learning in these environments. To this end, we introduce a hybrid Bayesian-deep learning model that learns a nonparametric topic model of a visual environment. We use this semantic visual model to identify observations that are poorly explained in the current model, and show experimentally that these highly perplexing observations often correspond to scientifically interesting phenomena. On a marine dataset collected by the SeaBED AUV on the Hannibal Sea Mount, images of high perplexity in the learned model corresponded, for example, to a scientifically novel crab congregation in the deep sea. The approaches presented in this thesis capture the depth and breadth of the problems facing the field of autonomous science. Developing robust autonomous systems that enhance our ability to perform exploratory science in environments such as the oceans, deep space, agricultural and disaster-relief zones will require insight and techniques from classical areas of robotics, such as motion and path planning, mapping, and localization, and from other domains, including machine learning, spatial statistics, optimization, and theory of experimental design. This thesis demonstrates how theory and practice from these diverse disciplines can be unified to address problems in autonomous scientific information gathering. === by Genevieve Elaine Flaspohler. === S.M.
author2 Yogesh Girdhar and Nicholas Roy.
author_facet Yogesh Girdhar and Nicholas Roy.
Flaspohler, Genevieve Elaine
author Flaspohler, Genevieve Elaine
author_sort Flaspohler, Genevieve Elaine
title Statistical models and decision making for robotic scientific information gathering
title_short Statistical models and decision making for robotic scientific information gathering
title_full Statistical models and decision making for robotic scientific information gathering
title_fullStr Statistical models and decision making for robotic scientific information gathering
title_full_unstemmed Statistical models and decision making for robotic scientific information gathering
title_sort statistical models and decision making for robotic scientific information gathering
publisher Massachusetts Institute of Technology
publishDate 2019
url http://hdl.handle.net/1721.1/120607
work_keys_str_mv AT flaspohlergenevieveelaine statisticalmodelsanddecisionmakingforroboticscientificinformationgathering
_version_ 1719031678677024768
spelling ndltd-MIT-oai-dspace.mit.edu-1721.1-1206072019-05-02T15:57:38Z Statistical models and decision making for robotic scientific information gathering Flaspohler, Genevieve Elaine Yogesh Girdhar and Nicholas Roy. Woods Hole Oceanographic Institution. Joint Program in Applied Ocean Physics and Engineering. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. Woods Hole Oceanographic Institution. Joint Program in Applied Ocean Physics and Engineering. Electrical Engineering and Computer Science. Woods Hole Oceanographic Institution. Oceanography Robotics Algorithms Artificial intelligence Data collection platforms Automatic data collection systems Thesis: S.M., Joint Program in Applied Ocean Physics and Engineering (Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science; and the Woods Hole Oceanographic Institution), 2018. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 97-107). Mobile robots and autonomous sensors have seen increasing use in scientific applications, from planetary rovers surveying for signs of life on Mars, to environmental buoys measuring and logging oceanographic conditions in coastal regions. This thesis makes contributions in both planning algorithms and model design for autonomous scientific information gathering, demonstrating how theory from machine learning, decision theory, theory of optimal experimental design, and statistical inference can be used to develop online algorithms for robotic information gathering that are robust to modeling errors, account for spatiotemporal structure in scientific data, and have probabilistic performance guarantees. This thesis first introduces a novel sample selection algorithm for online, irrevocable sampling in data streams that have spatiotemporal structure, such as those that commonly arise in robotics and environmental monitoring. Given a limited sampling capacity, the proposed periodic secretary algorithm uses an information-theoretic reward function to select samples in real-time that maximally reduce posterior uncertainty in a given scientific model. Additionally, we provide a lower bound on the quality of samples selected by the periodic secretary algorithm by leveraging the submodularity of the information-theoretic reward function. Finally, we demonstrate the robustness of the proposed approach by employing the periodic secretary algorithm to select samples irrevocably from a seven-year oceanographic data stream collected at the Martha's Vineyard Coastal Observatory off the coast of Cape Cod, USA. Secondly, we consider how scientific models can be specified in environments - such as the deep sea or deep space - where domain scientists may not have enough a priori knowledge to formulate a formal scientific model and hypothesis. These domains require scientific models that start with very little prior information and construct a model of the environment online as observations are gathered. We propose unsupervised machine learning as a technique for science model-learning in these environments. To this end, we introduce a hybrid Bayesian-deep learning model that learns a nonparametric topic model of a visual environment. We use this semantic visual model to identify observations that are poorly explained in the current model, and show experimentally that these highly perplexing observations often correspond to scientifically interesting phenomena. On a marine dataset collected by the SeaBED AUV on the Hannibal Sea Mount, images of high perplexity in the learned model corresponded, for example, to a scientifically novel crab congregation in the deep sea. The approaches presented in this thesis capture the depth and breadth of the problems facing the field of autonomous science. Developing robust autonomous systems that enhance our ability to perform exploratory science in environments such as the oceans, deep space, agricultural and disaster-relief zones will require insight and techniques from classical areas of robotics, such as motion and path planning, mapping, and localization, and from other domains, including machine learning, spatial statistics, optimization, and theory of experimental design. This thesis demonstrates how theory and practice from these diverse disciplines can be unified to address problems in autonomous scientific information gathering. by Genevieve Elaine Flaspohler. S.M. 2019-03-01T19:33:40Z 2019-03-01T19:33:40Z 2018 2018 Thesis http://hdl.handle.net/1721.1/120607 1088412583 eng MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. http://dspace.mit.edu/handle/1721.1/7582 107 pages application/pdf Massachusetts Institute of Technology