Feature Selection Module for CNN Based Object Detector

In the field of computer vision, the detection of multiple objects with different scales within a single image is challenging. To target this problem, feature pyramids are a basic component commonly found in multi-scale object detectors. In the construction of standard feature pyramids, different se...

Full description

Bibliographic Details
Main Authors:	Yongjun Ma, Songhua Zhang
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Object detection image recognition multi-layer neural network attention mechanism feature fusion
Online Access:	https://ieeexplore.ieee.org/document/9405654/

id	doaj-2baef96f44cc45c9899aa5bdae2f8ad5
record_format	Article
spelling	doaj-2baef96f44cc45c9899aa5bdae2f8ad52021-05-14T23:00:33ZengIEEEIEEE Access2169-35362021-01-019694566946610.1109/ACCESS.2021.30735659405654Feature Selection Module for CNN Based Object DetectorYongjun Ma0https://orcid.org/0000-0002-0513-1793Songhua Zhang1https://orcid.org/0000-0003-0624-6741Department of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin, ChinaDepartment of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin, ChinaIn the field of computer vision, the detection of multiple objects with different scales within a single image is challenging. To target this problem, feature pyramids are a basic component commonly found in multi-scale object detectors. In the construction of standard feature pyramids, different semantic features are simply connected to rebuild a new feature map, regardless of whether these features have a positive effect to the output or not. In order to avoid introducing too many redundant features within the feature fusion stage, a new feature fusion module called the Feature Selection Module (FSM) was proposed in this paper, which can automatically detect the most representative features for the rebuilding of feature maps. The channel attention mechanism in FSM is able to process and score each channel, filtering out irrelevant features while focusing on features with high contribution. Moreover, FSM can be easily embedded within feature pyramids. Simply adding a small number of trainable parameters to the network can significantly improve the ability of feature extraction. We validated our FSM with the VOC 2007 object detection dataset, based on Yolo series detectors. Findings from the present study demonstrates that for a small computational cost, our method is able to consistently improve the performance of Yolo detectors.https://ieeexplore.ieee.org/document/9405654/Object detectionimage recognitionmulti-layer neural networkattention mechanismfeature fusion
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Yongjun Ma Songhua Zhang
spellingShingle	Yongjun Ma Songhua Zhang Feature Selection Module for CNN Based Object Detector IEEE Access Object detection image recognition multi-layer neural network attention mechanism feature fusion
author_facet	Yongjun Ma Songhua Zhang
author_sort	Yongjun Ma
title	Feature Selection Module for CNN Based Object Detector
title_short	Feature Selection Module for CNN Based Object Detector
title_full	Feature Selection Module for CNN Based Object Detector
title_fullStr	Feature Selection Module for CNN Based Object Detector
title_full_unstemmed	Feature Selection Module for CNN Based Object Detector
title_sort	feature selection module for cnn based object detector
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2021-01-01
description	In the field of computer vision, the detection of multiple objects with different scales within a single image is challenging. To target this problem, feature pyramids are a basic component commonly found in multi-scale object detectors. In the construction of standard feature pyramids, different semantic features are simply connected to rebuild a new feature map, regardless of whether these features have a positive effect to the output or not. In order to avoid introducing too many redundant features within the feature fusion stage, a new feature fusion module called the Feature Selection Module (FSM) was proposed in this paper, which can automatically detect the most representative features for the rebuilding of feature maps. The channel attention mechanism in FSM is able to process and score each channel, filtering out irrelevant features while focusing on features with high contribution. Moreover, FSM can be easily embedded within feature pyramids. Simply adding a small number of trainable parameters to the network can significantly improve the ability of feature extraction. We validated our FSM with the VOC 2007 object detection dataset, based on Yolo series detectors. Findings from the present study demonstrates that for a small computational cost, our method is able to consistently improve the performance of Yolo detectors.
topic	Object detection image recognition multi-layer neural network attention mechanism feature fusion
url	https://ieeexplore.ieee.org/document/9405654/
work_keys_str_mv	AT yongjunma featureselectionmoduleforcnnbasedobjectdetector AT songhuazhang featureselectionmoduleforcnnbasedobjectdetector
_version_	1721440811042734080

Feature Selection Module for CNN Based Object Detector

Similar Items