Feature Selection Module for CNN Based Object Detector
In the field of computer vision, the detection of multiple objects with different scales within a single image is challenging. To target this problem, feature pyramids are a basic component commonly found in multi-scale object detectors. In the construction of standard feature pyramids, different se...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9405654/ |
id |
doaj-2baef96f44cc45c9899aa5bdae2f8ad5 |
---|---|
record_format |
Article |
spelling |
doaj-2baef96f44cc45c9899aa5bdae2f8ad52021-05-14T23:00:33ZengIEEEIEEE Access2169-35362021-01-019694566946610.1109/ACCESS.2021.30735659405654Feature Selection Module for CNN Based Object DetectorYongjun Ma0https://orcid.org/0000-0002-0513-1793Songhua Zhang1https://orcid.org/0000-0003-0624-6741Department of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin, ChinaDepartment of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin, ChinaIn the field of computer vision, the detection of multiple objects with different scales within a single image is challenging. To target this problem, feature pyramids are a basic component commonly found in multi-scale object detectors. In the construction of standard feature pyramids, different semantic features are simply connected to rebuild a new feature map, regardless of whether these features have a positive effect to the output or not. In order to avoid introducing too many redundant features within the feature fusion stage, a new feature fusion module called the Feature Selection Module (FSM) was proposed in this paper, which can automatically detect the most representative features for the rebuilding of feature maps. The channel attention mechanism in FSM is able to process and score each channel, filtering out irrelevant features while focusing on features with high contribution. Moreover, FSM can be easily embedded within feature pyramids. Simply adding a small number of trainable parameters to the network can significantly improve the ability of feature extraction. We validated our FSM with the VOC 2007 object detection dataset, based on Yolo series detectors. Findings from the present study demonstrates that for a small computational cost, our method is able to consistently improve the performance of Yolo detectors.https://ieeexplore.ieee.org/document/9405654/Object detectionimage recognitionmulti-layer neural networkattention mechanismfeature fusion |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yongjun Ma Songhua Zhang |
spellingShingle |
Yongjun Ma Songhua Zhang Feature Selection Module for CNN Based Object Detector IEEE Access Object detection image recognition multi-layer neural network attention mechanism feature fusion |
author_facet |
Yongjun Ma Songhua Zhang |
author_sort |
Yongjun Ma |
title |
Feature Selection Module for CNN Based Object Detector |
title_short |
Feature Selection Module for CNN Based Object Detector |
title_full |
Feature Selection Module for CNN Based Object Detector |
title_fullStr |
Feature Selection Module for CNN Based Object Detector |
title_full_unstemmed |
Feature Selection Module for CNN Based Object Detector |
title_sort |
feature selection module for cnn based object detector |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
In the field of computer vision, the detection of multiple objects with different scales within a single image is challenging. To target this problem, feature pyramids are a basic component commonly found in multi-scale object detectors. In the construction of standard feature pyramids, different semantic features are simply connected to rebuild a new feature map, regardless of whether these features have a positive effect to the output or not. In order to avoid introducing too many redundant features within the feature fusion stage, a new feature fusion module called the Feature Selection Module (FSM) was proposed in this paper, which can automatically detect the most representative features for the rebuilding of feature maps. The channel attention mechanism in FSM is able to process and score each channel, filtering out irrelevant features while focusing on features with high contribution. Moreover, FSM can be easily embedded within feature pyramids. Simply adding a small number of trainable parameters to the network can significantly improve the ability of feature extraction. We validated our FSM with the VOC 2007 object detection dataset, based on Yolo series detectors. Findings from the present study demonstrates that for a small computational cost, our method is able to consistently improve the performance of Yolo detectors. |
topic |
Object detection image recognition multi-layer neural network attention mechanism feature fusion |
url |
https://ieeexplore.ieee.org/document/9405654/ |
work_keys_str_mv |
AT yongjunma featureselectionmoduleforcnnbasedobjectdetector AT songhuazhang featureselectionmoduleforcnnbasedobjectdetector |
_version_ |
1721440811042734080 |