Visual tasks beyond categorization for training convolutional neural networks

Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2016. === Cataloged from PDF version of thesis. === Includes bibliographical references (pages 21-23). === Humans can perceive a variety of visual properties of objects besides their categ...

Full description

Bibliographic Details
Main Author:	Lee, Hyo-Dong
Other Authors:	James J. DiCarlo.
Format:	Others
Language:	English
Published:	Massachusetts Institute of Technology 2016
Subjects:	Electrical Engineering and Computer Science.
Online Access:	http://hdl.handle.net/1721.1/106095

Description
Summary:	Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2016. === Cataloged from PDF version of thesis. === Includes bibliographical references (pages 21-23). === Humans can perceive a variety of visual properties of objects besides their category. In this paper, we explore- whether convolutional neural networks (CNNs) can also learn object-related variables. The models are trained for object position, size and pose, respectively, from synthetic images and tested on unseen held-out objects. First, we show that some object properties come "for free" from learning others, and pose-optimized model can generalize to both categorical and non-categorical variables. Second, we demonstrate that pre-training the model with pose facilitates learning object categories from both synthetic and realistic images. === by Hyodong Lee. === S.M.

Visual tasks beyond categorization for training convolutional neural networks

Similar Items