Invariant recognition drives neural representations of action sequences.

Recognizing the actions of others from visual stimuli is a crucial aspect of human perception that allows individuals to respond to social cues. Humans are able to discriminate between similar actions despite transformations, like changes in viewpoint or actor, that substantially alter the visual ap...

Full description

Bibliographic Details
Main Authors:	Andrea Tacchetti, Leyla Isik, Tomaso Poggio
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2017-12-01
Series:	PLoS Computational Biology
Online Access:	https://doi.org/10.1371/journal.pcbi.1005859

id	doaj-1d6f2227161a47b5b8f8107c4b4ceda5
record_format	Article
spelling	doaj-1d6f2227161a47b5b8f8107c4b4ceda52021-04-21T15:10:03ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582017-12-011312e100585910.1371/journal.pcbi.1005859Invariant recognition drives neural representations of action sequences.Andrea TacchettiLeyla IsikTomaso PoggioRecognizing the actions of others from visual stimuli is a crucial aspect of human perception that allows individuals to respond to social cues. Humans are able to discriminate between similar actions despite transformations, like changes in viewpoint or actor, that substantially alter the visual appearance of a scene. This ability to generalize across complex transformations is a hallmark of human visual intelligence. Advances in understanding action recognition at the neural level have not always translated into precise accounts of the computational principles underlying what representations of action sequences are constructed by human visual cortex. Here we test the hypothesis that invariant action discrimination might fill this gap. Recently, the study of artificial systems for static object perception has produced models, Convolutional Neural Networks (CNNs), that achieve human level performance in complex discriminative tasks. Within this class, architectures that better support invariant object recognition also produce image representations that better match those implied by human and primate neural data. However, whether these models produce representations of action sequences that support recognition across complex transformations and closely follow neural representations of actions remains unknown. Here we show that spatiotemporal CNNs accurately categorize video stimuli into action classes, and that deliberate model modifications that improve performance on an invariant action recognition task lead to data representations that better match human neural recordings. Our results support our hypothesis that performance on invariant discrimination dictates the neural representations of actions computed in the brain. These results broaden the scope of the invariant recognition framework for understanding visual intelligence from perception of inanimate objects and faces in static images to the study of human perception of action sequences.https://doi.org/10.1371/journal.pcbi.1005859
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Andrea Tacchetti Leyla Isik Tomaso Poggio
spellingShingle	Andrea Tacchetti Leyla Isik Tomaso Poggio Invariant recognition drives neural representations of action sequences. PLoS Computational Biology
author_facet	Andrea Tacchetti Leyla Isik Tomaso Poggio
author_sort	Andrea Tacchetti
title	Invariant recognition drives neural representations of action sequences.
title_short	Invariant recognition drives neural representations of action sequences.
title_full	Invariant recognition drives neural representations of action sequences.
title_fullStr	Invariant recognition drives neural representations of action sequences.
title_full_unstemmed	Invariant recognition drives neural representations of action sequences.
title_sort	invariant recognition drives neural representations of action sequences.
publisher	Public Library of Science (PLoS)
series	PLoS Computational Biology
issn	1553-734X 1553-7358
publishDate	2017-12-01
description	Recognizing the actions of others from visual stimuli is a crucial aspect of human perception that allows individuals to respond to social cues. Humans are able to discriminate between similar actions despite transformations, like changes in viewpoint or actor, that substantially alter the visual appearance of a scene. This ability to generalize across complex transformations is a hallmark of human visual intelligence. Advances in understanding action recognition at the neural level have not always translated into precise accounts of the computational principles underlying what representations of action sequences are constructed by human visual cortex. Here we test the hypothesis that invariant action discrimination might fill this gap. Recently, the study of artificial systems for static object perception has produced models, Convolutional Neural Networks (CNNs), that achieve human level performance in complex discriminative tasks. Within this class, architectures that better support invariant object recognition also produce image representations that better match those implied by human and primate neural data. However, whether these models produce representations of action sequences that support recognition across complex transformations and closely follow neural representations of actions remains unknown. Here we show that spatiotemporal CNNs accurately categorize video stimuli into action classes, and that deliberate model modifications that improve performance on an invariant action recognition task lead to data representations that better match human neural recordings. Our results support our hypothesis that performance on invariant discrimination dictates the neural representations of actions computed in the brain. These results broaden the scope of the invariant recognition framework for understanding visual intelligence from perception of inanimate objects and faces in static images to the study of human perception of action sequences.
url	https://doi.org/10.1371/journal.pcbi.1005859
work_keys_str_mv	AT andreatacchetti invariantrecognitiondrivesneuralrepresentationsofactionsequences AT leylaisik invariantrecognitiondrivesneuralrepresentationsofactionsequences AT tomasopoggio invariantrecognitiondrivesneuralrepresentationsofactionsequences
_version_	1714667741593468928

Invariant recognition drives neural representations of action sequences.

Similar Items