Feature Selection Module for CNN Based Object Detector

In the field of computer vision, the detection of multiple objects with different scales within a single image is challenging. To target this problem, feature pyramids are a basic component commonly found in multi-scale object detectors. In the construction of standard feature pyramids, different se...

Full description

Bibliographic Details
Main Authors: Yongjun Ma, Songhua Zhang
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9405654/
id doaj-2baef96f44cc45c9899aa5bdae2f8ad5
record_format Article
spelling doaj-2baef96f44cc45c9899aa5bdae2f8ad52021-05-14T23:00:33ZengIEEEIEEE Access2169-35362021-01-019694566946610.1109/ACCESS.2021.30735659405654Feature Selection Module for CNN Based Object DetectorYongjun Ma0https://orcid.org/0000-0002-0513-1793Songhua Zhang1https://orcid.org/0000-0003-0624-6741Department of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin, ChinaDepartment of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin, ChinaIn the field of computer vision, the detection of multiple objects with different scales within a single image is challenging. To target this problem, feature pyramids are a basic component commonly found in multi-scale object detectors. In the construction of standard feature pyramids, different semantic features are simply connected to rebuild a new feature map, regardless of whether these features have a positive effect to the output or not. In order to avoid introducing too many redundant features within the feature fusion stage, a new feature fusion module called the Feature Selection Module (FSM) was proposed in this paper, which can automatically detect the most representative features for the rebuilding of feature maps. The channel attention mechanism in FSM is able to process and score each channel, filtering out irrelevant features while focusing on features with high contribution. Moreover, FSM can be easily embedded within feature pyramids. Simply adding a small number of trainable parameters to the network can significantly improve the ability of feature extraction. We validated our FSM with the VOC 2007 object detection dataset, based on Yolo series detectors. Findings from the present study demonstrates that for a small computational cost, our method is able to consistently improve the performance of Yolo detectors.https://ieeexplore.ieee.org/document/9405654/Object detectionimage recognitionmulti-layer neural networkattention mechanismfeature fusion
collection DOAJ
language English
format Article
sources DOAJ
author Yongjun Ma
Songhua Zhang
spellingShingle Yongjun Ma
Songhua Zhang
Feature Selection Module for CNN Based Object Detector
IEEE Access
Object detection
image recognition
multi-layer neural network
attention mechanism
feature fusion
author_facet Yongjun Ma
Songhua Zhang
author_sort Yongjun Ma
title Feature Selection Module for CNN Based Object Detector
title_short Feature Selection Module for CNN Based Object Detector
title_full Feature Selection Module for CNN Based Object Detector
title_fullStr Feature Selection Module for CNN Based Object Detector
title_full_unstemmed Feature Selection Module for CNN Based Object Detector
title_sort feature selection module for cnn based object detector
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description In the field of computer vision, the detection of multiple objects with different scales within a single image is challenging. To target this problem, feature pyramids are a basic component commonly found in multi-scale object detectors. In the construction of standard feature pyramids, different semantic features are simply connected to rebuild a new feature map, regardless of whether these features have a positive effect to the output or not. In order to avoid introducing too many redundant features within the feature fusion stage, a new feature fusion module called the Feature Selection Module (FSM) was proposed in this paper, which can automatically detect the most representative features for the rebuilding of feature maps. The channel attention mechanism in FSM is able to process and score each channel, filtering out irrelevant features while focusing on features with high contribution. Moreover, FSM can be easily embedded within feature pyramids. Simply adding a small number of trainable parameters to the network can significantly improve the ability of feature extraction. We validated our FSM with the VOC 2007 object detection dataset, based on Yolo series detectors. Findings from the present study demonstrates that for a small computational cost, our method is able to consistently improve the performance of Yolo detectors.
topic Object detection
image recognition
multi-layer neural network
attention mechanism
feature fusion
url https://ieeexplore.ieee.org/document/9405654/
work_keys_str_mv AT yongjunma featureselectionmoduleforcnnbasedobjectdetector
AT songhuazhang featureselectionmoduleforcnnbasedobjectdetector
_version_ 1721440811042734080