SSD-TSEFFM: New SSD Using Trident Feature and Squeeze and Extraction Feature Fusion

The single shot multi-box detector (SSD) exhibits low accuracy in small-object detection; this is because it does not consider the scale contextual information between its layers, and the shallow layers lack adequate semantic information. To improve the accuracy of the original SSD, this paper propo...

Full description

Bibliographic Details
Published in:Sensors
Main Authors: Young-Joon Hwang, Jin-Gu Lee, Un-Chul Moon, Ho-Hyun Park
Format: Article
Language:English
Published: MDPI AG 2020-06-01
Subjects:
Online Access:https://www.mdpi.com/1424-8220/20/13/3630
_version_ 1849890460064022528
author Young-Joon Hwang
Jin-Gu Lee
Un-Chul Moon
Ho-Hyun Park
author_facet Young-Joon Hwang
Jin-Gu Lee
Un-Chul Moon
Ho-Hyun Park
author_sort Young-Joon Hwang
collection DOAJ
container_title Sensors
description The single shot multi-box detector (SSD) exhibits low accuracy in small-object detection; this is because it does not consider the scale contextual information between its layers, and the shallow layers lack adequate semantic information. To improve the accuracy of the original SSD, this paper proposes a new single shot multi-box detector using trident feature and squeeze and extraction feature fusion (SSD-TSEFFM); this detector employs the trident network and the squeeze and excitation feature fusion module. Furthermore, a trident feature module (TFM) is developed, inspired by the trident network, to consider the scale contextual information. The use of this module makes the proposed model robust to scale changes owing to the application of dilated convolution. Further, the squeeze and excitation block feature fusion module (SEFFM) is used to provide more semantic information to the model. The SSD-TSEFFM is compared with the faster regions with convolution neural network features (RCNN) (2015), SSD (2016), and DF-SSD (2020) on the PASCAL VOC 2007 and 2012 datasets. The experimental results demonstrate the high accuracy of the proposed model in small-object detection, in addition to a good overall accuracy. The SSD-TSEFFM achieved 80.4% mAP and 80.2% mAP on the 2007 and 2012 datasets, respectively. This indicates an average improvement of approximately 2% over other models.
format Article
id doaj-art-dff8f63e034b451bb310dad71d66ffce
institution Directory of Open Access Journals
issn 1424-8220
language English
publishDate 2020-06-01
publisher MDPI AG
record_format Article
spelling doaj-art-dff8f63e034b451bb310dad71d66ffce2025-08-20T01:04:02ZengMDPI AGSensors1424-82202020-06-012013363010.3390/s20133630SSD-TSEFFM: New SSD Using Trident Feature and Squeeze and Extraction Feature FusionYoung-Joon Hwang0Jin-Gu Lee1Un-Chul Moon2Ho-Hyun Park3School of Electrical and Electronics Engineering, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, KoreaSchool of Electrical and Electronics Engineering, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, KoreaSchool of Electrical and Electronics Engineering, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, KoreaSchool of Electrical and Electronics Engineering, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, KoreaThe single shot multi-box detector (SSD) exhibits low accuracy in small-object detection; this is because it does not consider the scale contextual information between its layers, and the shallow layers lack adequate semantic information. To improve the accuracy of the original SSD, this paper proposes a new single shot multi-box detector using trident feature and squeeze and extraction feature fusion (SSD-TSEFFM); this detector employs the trident network and the squeeze and excitation feature fusion module. Furthermore, a trident feature module (TFM) is developed, inspired by the trident network, to consider the scale contextual information. The use of this module makes the proposed model robust to scale changes owing to the application of dilated convolution. Further, the squeeze and excitation block feature fusion module (SEFFM) is used to provide more semantic information to the model. The SSD-TSEFFM is compared with the faster regions with convolution neural network features (RCNN) (2015), SSD (2016), and DF-SSD (2020) on the PASCAL VOC 2007 and 2012 datasets. The experimental results demonstrate the high accuracy of the proposed model in small-object detection, in addition to a good overall accuracy. The SSD-TSEFFM achieved 80.4% mAP and 80.2% mAP on the 2007 and 2012 datasets, respectively. This indicates an average improvement of approximately 2% over other models.https://www.mdpi.com/1424-8220/20/13/3630small-object detectionSSDtrident networksqueeze and excitationfeature fusion
spellingShingle Young-Joon Hwang
Jin-Gu Lee
Un-Chul Moon
Ho-Hyun Park
SSD-TSEFFM: New SSD Using Trident Feature and Squeeze and Extraction Feature Fusion
small-object detection
SSD
trident network
squeeze and excitation
feature fusion
title SSD-TSEFFM: New SSD Using Trident Feature and Squeeze and Extraction Feature Fusion
title_full SSD-TSEFFM: New SSD Using Trident Feature and Squeeze and Extraction Feature Fusion
title_fullStr SSD-TSEFFM: New SSD Using Trident Feature and Squeeze and Extraction Feature Fusion
title_full_unstemmed SSD-TSEFFM: New SSD Using Trident Feature and Squeeze and Extraction Feature Fusion
title_short SSD-TSEFFM: New SSD Using Trident Feature and Squeeze and Extraction Feature Fusion
title_sort ssd tseffm new ssd using trident feature and squeeze and extraction feature fusion
topic small-object detection
SSD
trident network
squeeze and excitation
feature fusion
url https://www.mdpi.com/1424-8220/20/13/3630
work_keys_str_mv AT youngjoonhwang ssdtseffmnewssdusingtridentfeatureandsqueezeandextractionfeaturefusion
AT jingulee ssdtseffmnewssdusingtridentfeatureandsqueezeandextractionfeaturefusion
AT unchulmoon ssdtseffmnewssdusingtridentfeatureandsqueezeandextractionfeaturefusion
AT hohyunpark ssdtseffmnewssdusingtridentfeatureandsqueezeandextractionfeaturefusion