The dynamics of invariant object and action recognition in the human visual system

Thesis: Ph. D., Massachusetts Institute of Technology, Computational and Systems Biology Program, 2015. === Cataloged from PDF version of thesis. === Includes bibliographical references (pages 123-138). === Humans can quickly and effortlessly recognize objects, and people and their actions from comp...

Full description

Bibliographic Details
Main Author: Isik, Leyla
Other Authors: Tomaso Poggio.
Format: Others
Language:English
Published: Massachusetts Institute of Technology 2015
Subjects:
Online Access:http://hdl.handle.net/1721.1/98000
id ndltd-MIT-oai-dspace.mit.edu-1721.1-98000
record_format oai_dc
spelling ndltd-MIT-oai-dspace.mit.edu-1721.1-980002019-05-02T16:05:48Z The dynamics of invariant object and action recognition in the human visual system Isik, Leyla Tomaso Poggio. Massachusetts Institute of Technology. Computational and Systems Biology Program. Massachusetts Institute of Technology. Computational and Systems Biology Program. Computational and Systems Biology Program. Thesis: Ph. D., Massachusetts Institute of Technology, Computational and Systems Biology Program, 2015. Cataloged from PDF version of thesis. Includes bibliographical references (pages 123-138). Humans can quickly and effortlessly recognize objects, and people and their actions from complex visual inputs. Despite the ease with which the human brain solves this problem, the underlying computational steps have remained enigmatic. What makes object and action recognition challenging are identity-preserving transformations that alter the visual appearance of objects and actions, such as changes in scale, position, and viewpoint. The majority of visual neuroscience studies examining visual recognition either use physiology recordings, which provide high spatiotemporal resolution data with limited brain coverage, or functional MRI, which provides high spatial resolution data from across the brain with limited temporal resolution. High temporal resolution data from across the brain is needed to break down and understand the computational steps underlying invariant visual recognition. In this thesis I use magenetoencephalography, machine learning, and computational modeling to study invariant visual recognition. I show that a temporal association learning rule for learning invariance in hierarchical visual systems is very robust to manipulations and visual disputations that happen during development (Chapter 2). I next show that object recognition occurs very quickly, with invariance to size and position developing in stages beginning around 100ms after stimulus onset (Chapter 3), and that action recognition occurs on a similarly fast time scale, 200 ms after video onset, with this early representation being invariant to changes in actor and viewpoint (Chapter 4). Finally, I show that the same hierarchical feedforward model can explain both the object and action recognition timing results, putting this timing data in the broader context of computer vision systems and models of the brain. This work sheds light on the computational mechanisms underlying invariant object and action recognition in the brain and demonstrates the importance of using high temporal resolution data to understand neural computations. by Leyla Isik. Ph. D. 2015-07-31T19:13:20Z 2015-07-31T19:13:20Z 2015 2015 Thesis http://hdl.handle.net/1721.1/98000 914481107 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 138 pages application/pdf Massachusetts Institute of Technology
collection NDLTD
language English
format Others
sources NDLTD
topic Computational and Systems Biology Program.
spellingShingle Computational and Systems Biology Program.
Isik, Leyla
The dynamics of invariant object and action recognition in the human visual system
description Thesis: Ph. D., Massachusetts Institute of Technology, Computational and Systems Biology Program, 2015. === Cataloged from PDF version of thesis. === Includes bibliographical references (pages 123-138). === Humans can quickly and effortlessly recognize objects, and people and their actions from complex visual inputs. Despite the ease with which the human brain solves this problem, the underlying computational steps have remained enigmatic. What makes object and action recognition challenging are identity-preserving transformations that alter the visual appearance of objects and actions, such as changes in scale, position, and viewpoint. The majority of visual neuroscience studies examining visual recognition either use physiology recordings, which provide high spatiotemporal resolution data with limited brain coverage, or functional MRI, which provides high spatial resolution data from across the brain with limited temporal resolution. High temporal resolution data from across the brain is needed to break down and understand the computational steps underlying invariant visual recognition. In this thesis I use magenetoencephalography, machine learning, and computational modeling to study invariant visual recognition. I show that a temporal association learning rule for learning invariance in hierarchical visual systems is very robust to manipulations and visual disputations that happen during development (Chapter 2). I next show that object recognition occurs very quickly, with invariance to size and position developing in stages beginning around 100ms after stimulus onset (Chapter 3), and that action recognition occurs on a similarly fast time scale, 200 ms after video onset, with this early representation being invariant to changes in actor and viewpoint (Chapter 4). Finally, I show that the same hierarchical feedforward model can explain both the object and action recognition timing results, putting this timing data in the broader context of computer vision systems and models of the brain. This work sheds light on the computational mechanisms underlying invariant object and action recognition in the brain and demonstrates the importance of using high temporal resolution data to understand neural computations. === by Leyla Isik. === Ph. D.
author2 Tomaso Poggio.
author_facet Tomaso Poggio.
Isik, Leyla
author Isik, Leyla
author_sort Isik, Leyla
title The dynamics of invariant object and action recognition in the human visual system
title_short The dynamics of invariant object and action recognition in the human visual system
title_full The dynamics of invariant object and action recognition in the human visual system
title_fullStr The dynamics of invariant object and action recognition in the human visual system
title_full_unstemmed The dynamics of invariant object and action recognition in the human visual system
title_sort dynamics of invariant object and action recognition in the human visual system
publisher Massachusetts Institute of Technology
publishDate 2015
url http://hdl.handle.net/1721.1/98000
work_keys_str_mv AT isikleyla thedynamicsofinvariantobjectandactionrecognitioninthehumanvisualsystem
AT isikleyla dynamicsofinvariantobjectandactionrecognitioninthehumanvisualsystem
_version_ 1719034628881252352