Object detection through search with a foveated visual system.

Humans and many other species sense visual information with varying spatial resolution across the visual field (foveated vision) and deploy eye movements to actively sample regions of interests in scenes. The advantage of such varying resolution architecture is a reduced computational, hence metabol...

Full description

Bibliographic Details
Main Authors:	Emre Akbas, Miguel P Eckstein
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2017-10-01
Series:	PLoS Computational Biology
Online Access:	http://europepmc.org/articles/PMC5669499?pdf=render

id	doaj-377f6d152d8048fe831654978aa98252
record_format	Article
spelling	doaj-377f6d152d8048fe831654978aa982522020-11-25T01:33:53ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582017-10-011310e100574310.1371/journal.pcbi.1005743Object detection through search with a foveated visual system.Emre AkbasMiguel P EcksteinHumans and many other species sense visual information with varying spatial resolution across the visual field (foveated vision) and deploy eye movements to actively sample regions of interests in scenes. The advantage of such varying resolution architecture is a reduced computational, hence metabolic cost. But what are the performance costs of such processing strategy relative to a scheme that processes the visual field at high spatial resolution? Here we first focus on visual search and combine object detectors from computer vision with a recent model of peripheral pooling regions found at the V1 layer of the human visual system. We develop a foveated object detector that processes the entire scene with varying resolution, uses retino-specific object detection classifiers to guide eye movements, aligns its fovea with regions of interest in the input image and integrates observations across multiple fixations. We compared the foveated object detector against a non-foveated version of the same object detector which processes the entire image at homogeneous high spatial resolution. We evaluated the accuracy of the foveated and non-foveated object detectors identifying 20 different objects classes in scenes from a standard computer vision data set (the PASCAL VOC 2007 dataset). We show that the foveated object detector can approximate the performance of the object detector with homogeneous high spatial resolution processing while bringing significant computational cost savings. Additionally, we assessed the impact of foveation on the computation of bottom-up saliency. An implementation of a simple foveated bottom-up saliency model with eye movements showed agreement in the selection of top salient regions of scenes with those selected by a non-foveated high resolution saliency model. Together, our results might help explain the evolution of foveated visual systems with eye movements as a solution that preserves perceptual performance in visual search while resulting in computational and metabolic savings to the brain.http://europepmc.org/articles/PMC5669499?pdf=render
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Emre Akbas Miguel P Eckstein
spellingShingle	Emre Akbas Miguel P Eckstein Object detection through search with a foveated visual system. PLoS Computational Biology
author_facet	Emre Akbas Miguel P Eckstein
author_sort	Emre Akbas
title	Object detection through search with a foveated visual system.
title_short	Object detection through search with a foveated visual system.
title_full	Object detection through search with a foveated visual system.
title_fullStr	Object detection through search with a foveated visual system.
title_full_unstemmed	Object detection through search with a foveated visual system.
title_sort	object detection through search with a foveated visual system.
publisher	Public Library of Science (PLoS)
series	PLoS Computational Biology
issn	1553-734X 1553-7358
publishDate	2017-10-01
description	Humans and many other species sense visual information with varying spatial resolution across the visual field (foveated vision) and deploy eye movements to actively sample regions of interests in scenes. The advantage of such varying resolution architecture is a reduced computational, hence metabolic cost. But what are the performance costs of such processing strategy relative to a scheme that processes the visual field at high spatial resolution? Here we first focus on visual search and combine object detectors from computer vision with a recent model of peripheral pooling regions found at the V1 layer of the human visual system. We develop a foveated object detector that processes the entire scene with varying resolution, uses retino-specific object detection classifiers to guide eye movements, aligns its fovea with regions of interest in the input image and integrates observations across multiple fixations. We compared the foveated object detector against a non-foveated version of the same object detector which processes the entire image at homogeneous high spatial resolution. We evaluated the accuracy of the foveated and non-foveated object detectors identifying 20 different objects classes in scenes from a standard computer vision data set (the PASCAL VOC 2007 dataset). We show that the foveated object detector can approximate the performance of the object detector with homogeneous high spatial resolution processing while bringing significant computational cost savings. Additionally, we assessed the impact of foveation on the computation of bottom-up saliency. An implementation of a simple foveated bottom-up saliency model with eye movements showed agreement in the selection of top salient regions of scenes with those selected by a non-foveated high resolution saliency model. Together, our results might help explain the evolution of foveated visual systems with eye movements as a solution that preserves perceptual performance in visual search while resulting in computational and metabolic savings to the brain.
url	http://europepmc.org/articles/PMC5669499?pdf=render
work_keys_str_mv	AT emreakbas objectdetectionthroughsearchwithafoveatedvisualsystem AT miguelpeckstein objectdetectionthroughsearchwithafoveatedvisualsystem
_version_	1725075199000313856

Object detection through search with a foveated visual system.

Similar Items