Enhanced HMAX model with feedforward feature learning for multiclass categorization

In recent years, the interdisciplinary research between neuroscience and computer vision has promoted the development in both fields. Many biologically inspired visual models are proposed, and among them, the Hierarchical Max-pooling model (HMAX) is a feedforward model mimicking the structures and f...

Full description

Bibliographic Details
Main Authors:	Yinlin eLi, Wei eWu, Bo eZhang, Fengfu eLi
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2015-10-01
Series:	Frontiers in Computational Neuroscience
Subjects:	feedforward HMAX saliency map multiclass categorization Biologically inspired Feature encoding
Online Access:	http://journal.frontiersin.org/Journal/10.3389/fncom.2015.00123/full

id	doaj-137243c78d2b49aa8a2a507747896b81
record_format	Article
spelling	doaj-137243c78d2b49aa8a2a507747896b812020-11-24T23:46:56ZengFrontiers Media S.A.Frontiers in Computational Neuroscience1662-51882015-10-01910.3389/fncom.2015.00123160196Enhanced HMAX model with feedforward feature learning for multiclass categorizationYinlin eLi0Wei eWu1Bo eZhang2Fengfu eLi3State Key Lab of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of SciencesState Key Lab of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of SciencesInstitute of Applied Mathematics, Academy of Mathematics and Systems Science, Chinese Academy of SciencesInstitute of Applied Mathematics, Academy of Mathematics and Systems Science, Chinese Academy of SciencesIn recent years, the interdisciplinary research between neuroscience and computer vision has promoted the development in both fields. Many biologically inspired visual models are proposed, and among them, the Hierarchical Max-pooling model (HMAX) is a feedforward model mimicking the structures and functions of V1 to posterior inferotemporal (PIT) layer of the primate visual cortex, which could generate a series of position- and scale- invariant features. However, it could be improved with attention modulation and memory processing, which are two important properties of the primate visual cortex. Thus, in this paper, based on recent biological research on the primate visual cortex, we still mimic the first 100-150 milliseconds of visual cognition to enhance the HMAX model, which mainly focuses on the unsupervised feedforward feature learning process. The main modifications are as follows: 1) To mimic the attention modulation mechanism of V1 layer, a bottom-up saliency map is computed in the S1 layer of the HMAX model, which can support the initial feature extraction for memory processing; 2) To mimic the learning, clustering and short-term memory to long-term memory conversion abilities of V2 and IT, an unsupervised iterative clustering method is used to learn clusters with multiscale middle level patches, which are taken as long-term memory; 3) Inspired by the multiple feature encoding mode of the primate visual cortex, information including color, orientation, and spatial position are encoded in different layers of the HMAX model progressively. By adding a softmax layer at the top of the model, multiclass categorization experiments can be conducted, and the results on Caltech101 show that the enhanced model with a smaller memory size exhibits higher accuracy than the original HMAX model, and could also achieve better accuracy than other unsupervised feature learning methods in multiclass categorization task.http://journal.frontiersin.org/Journal/10.3389/fncom.2015.00123/fullfeedforwardHMAXsaliency mapmulticlass categorizationBiologically inspiredFeature encoding
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Yinlin eLi Wei eWu Bo eZhang Fengfu eLi
spellingShingle	Yinlin eLi Wei eWu Bo eZhang Fengfu eLi Enhanced HMAX model with feedforward feature learning for multiclass categorization Frontiers in Computational Neuroscience feedforward HMAX saliency map multiclass categorization Biologically inspired Feature encoding
author_facet	Yinlin eLi Wei eWu Bo eZhang Fengfu eLi
author_sort	Yinlin eLi
title	Enhanced HMAX model with feedforward feature learning for multiclass categorization
title_short	Enhanced HMAX model with feedforward feature learning for multiclass categorization
title_full	Enhanced HMAX model with feedforward feature learning for multiclass categorization
title_fullStr	Enhanced HMAX model with feedforward feature learning for multiclass categorization
title_full_unstemmed	Enhanced HMAX model with feedforward feature learning for multiclass categorization
title_sort	enhanced hmax model with feedforward feature learning for multiclass categorization
publisher	Frontiers Media S.A.
series	Frontiers in Computational Neuroscience
issn	1662-5188
publishDate	2015-10-01
description	In recent years, the interdisciplinary research between neuroscience and computer vision has promoted the development in both fields. Many biologically inspired visual models are proposed, and among them, the Hierarchical Max-pooling model (HMAX) is a feedforward model mimicking the structures and functions of V1 to posterior inferotemporal (PIT) layer of the primate visual cortex, which could generate a series of position- and scale- invariant features. However, it could be improved with attention modulation and memory processing, which are two important properties of the primate visual cortex. Thus, in this paper, based on recent biological research on the primate visual cortex, we still mimic the first 100-150 milliseconds of visual cognition to enhance the HMAX model, which mainly focuses on the unsupervised feedforward feature learning process. The main modifications are as follows: 1) To mimic the attention modulation mechanism of V1 layer, a bottom-up saliency map is computed in the S1 layer of the HMAX model, which can support the initial feature extraction for memory processing; 2) To mimic the learning, clustering and short-term memory to long-term memory conversion abilities of V2 and IT, an unsupervised iterative clustering method is used to learn clusters with multiscale middle level patches, which are taken as long-term memory; 3) Inspired by the multiple feature encoding mode of the primate visual cortex, information including color, orientation, and spatial position are encoded in different layers of the HMAX model progressively. By adding a softmax layer at the top of the model, multiclass categorization experiments can be conducted, and the results on Caltech101 show that the enhanced model with a smaller memory size exhibits higher accuracy than the original HMAX model, and could also achieve better accuracy than other unsupervised feature learning methods in multiclass categorization task.
topic	feedforward HMAX saliency map multiclass categorization Biologically inspired Feature encoding
url	http://journal.frontiersin.org/Journal/10.3389/fncom.2015.00123/full
work_keys_str_mv	AT yinlineli enhancedhmaxmodelwithfeedforwardfeaturelearningformulticlasscategorization AT weiewu enhancedhmaxmodelwithfeedforwardfeaturelearningformulticlasscategorization AT boezhang enhancedhmaxmodelwithfeedforwardfeaturelearningformulticlasscategorization AT fengfueli enhancedhmaxmodelwithfeedforwardfeaturelearningformulticlasscategorization
_version_	1725491534465335296

Enhanced HMAX model with feedforward feature learning for multiclass categorization

Similar Items