What and Where: A Bayesian inference theory of visual attention

In the theoretical framework of this paper, attention is part of the inference process that solves the visual recognition problem of what is where. The theory proposes a computational role for attention and leads to a model that predicts some of its main properties at the level of psychophysics and...

Full description

Bibliographic Details
Main Authors: Chikkerur, Sharat (Contributor), Serre, Thomas R. (Contributor), Tan, Cheston (Contributor), Poggio, Tomaso A. (Contributor)
Other Authors: Massachusetts Institute of Technology. Center for Biological & Computational Learning (Contributor), Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences (Contributor), McGovern Institute for Brain Research at MIT (Contributor)
Format: Article
Language:English
Published: Elsevier B.V., 2011-06-22T15:57:31Z.
Subjects:
Online Access:Get fulltext
LEADER 02527 am a22003013u 4500
001 64647
042 |a dc 
100 1 0 |a Chikkerur, Sharat  |e author 
100 1 0 |a Massachusetts Institute of Technology. Center for Biological & Computational Learning  |e contributor 
100 1 0 |a Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences  |e contributor 
100 1 0 |a McGovern Institute for Brain Research at MIT  |e contributor 
100 1 0 |a Poggio, Tomaso A.  |e contributor 
100 1 0 |a Chikkerur, Sharat  |e contributor 
100 1 0 |a Serre, Thomas R.  |e contributor 
100 1 0 |a Tan, Cheston  |e contributor 
100 1 0 |a Poggio, Tomaso A.  |e contributor 
700 1 0 |a Serre, Thomas R.  |e author 
700 1 0 |a Tan, Cheston  |e author 
700 1 0 |a Poggio, Tomaso A.  |e author 
245 0 0 |a What and Where: A Bayesian inference theory of visual attention 
260 |b Elsevier B.V.,   |c 2011-06-22T15:57:31Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/64647 
520 |a In the theoretical framework of this paper, attention is part of the inference process that solves the visual recognition problem of what is where. The theory proposes a computational role for attention and leads to a model that predicts some of its main properties at the level of psychophysics and physiology. In our approach, the main goal of the visual system is to infer the identity and the position of objects in visual scenes: spatial attention emerges as a strategy to reduce the uncertainty in shape information while feature-based attention reduces the uncertainty in spatial information. Featural and spatial attention represent two distinct modes of a computational process solving the problem of recognizing and localizing objects, especially in difficult recognition tasks such as in cluttered natural scenes. We describe a specific computational model and relate it to the known functional anatomy of attention. We show that several well-known attentional phenomena - including bottom-up pop-out effects, multiplicative modulation of neuronal tuning curves and shift in contrast responses - all emerge naturally as predictions of the model. We also show that the Bayesian model predicts well human eye fixations (considered as a proxy for shifts of attention) in natural scenes. 
520 |a United States. Defense Advanced Research Projects Agency (FA8650-06-C-7632) 
520 |a United States. Defense Advanced Research Projects Agency (FA8650-09-1-7946) 
546 |a en_US 
655 7 |a Article 
773 |t Visual Research