A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images

Thanks to the excellent feature representation capabilities of neural networks, deep learning-based methods perform far better than traditional methods on target detection tasks such as ship detection. Although various network models have been proposed for SAR ship detection such as DRBox-v1, DRBox-...

Full description

Bibliographic Details
Main Authors:	Rong Yang, Zhenru Pan, Xiaoxue Jia, Lei Zhang, Yunkai Deng
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:	Neural network rotatable bounding box (RBox) synthetic aperture radar target detection
Online Access:	https://ieeexplore.ieee.org/document/9316765/

id	doaj-26fa3de8d24e43169381048ed4d78d01
record_format	Article
spelling	doaj-26fa3de8d24e43169381048ed4d78d012021-06-03T23:06:40ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01141938195810.1109/JSTARS.2021.30498519316765A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR ImagesRong Yang0https://orcid.org/0000-0001-5620-7615Zhenru Pan1https://orcid.org/0000-0002-8123-8939Xiaoxue Jia2https://orcid.org/0000-0003-0031-933XLei Zhang3Yunkai Deng4Department of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaDepartment of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaDepartment of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaDepartment of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaDepartment of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaThanks to the excellent feature representation capabilities of neural networks, deep learning-based methods perform far better than traditional methods on target detection tasks such as ship detection. Although various network models have been proposed for SAR ship detection such as DRBox-v1, DRBox-v2, and MSR2N, there are still some problems such as mismatch of feature scale, contradictions between different learning tasks, and unbalanced distribution of positive samples, which have not been mentioned in these studies. In this article, an improved one-stage object detection framework based on RetinaNet and rotatable bounding box (RBox), which is referred as R-RetinaNet, is proposed to solve the above problems. The main improvements of R-RetinaNet as well as the contributions of this article are threefold. First, a scale calibration method is proposed to align the scale distribution of the output backbone feature map with the scale distribution of the targets. Second, a feature fusion network based on task-wise attention feature pyramid network is designed to decouple the feature optimization process of different tasks, which alleviates the conflict between different learning goals. Finally, an adaptive intersection over union (IoU) threshold training method is proposed for RBox-based model to correct the unbalanced distribution of positive samples caused by the fixed IoU threshold on RBox. Experimental results show that our method obtains 13.26%, 9.49%, 8.92%, and 4.55% gains in average precision under an IoU threshold of 0.5 on the public SAR ship detection dataset compared with four state-of-the-art RBox-based methods, respectively.https://ieeexplore.ieee.org/document/9316765/Neural networkrotatable bounding box (RBox)synthetic aperture radartarget detection
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Rong Yang Zhenru Pan Xiaoxue Jia Lei Zhang Yunkai Deng
spellingShingle	Rong Yang Zhenru Pan Xiaoxue Jia Lei Zhang Yunkai Deng A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Neural network rotatable bounding box (RBox) synthetic aperture radar target detection
author_facet	Rong Yang Zhenru Pan Xiaoxue Jia Lei Zhang Yunkai Deng
author_sort	Rong Yang
title	A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images
title_short	A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images
title_full	A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images
title_fullStr	A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images
title_full_unstemmed	A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images
title_sort	novel cnn-based detector for ship detection based on rotatable bounding box in sar images
publisher	IEEE
series	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
issn	2151-1535
publishDate	2021-01-01
description	Thanks to the excellent feature representation capabilities of neural networks, deep learning-based methods perform far better than traditional methods on target detection tasks such as ship detection. Although various network models have been proposed for SAR ship detection such as DRBox-v1, DRBox-v2, and MSR2N, there are still some problems such as mismatch of feature scale, contradictions between different learning tasks, and unbalanced distribution of positive samples, which have not been mentioned in these studies. In this article, an improved one-stage object detection framework based on RetinaNet and rotatable bounding box (RBox), which is referred as R-RetinaNet, is proposed to solve the above problems. The main improvements of R-RetinaNet as well as the contributions of this article are threefold. First, a scale calibration method is proposed to align the scale distribution of the output backbone feature map with the scale distribution of the targets. Second, a feature fusion network based on task-wise attention feature pyramid network is designed to decouple the feature optimization process of different tasks, which alleviates the conflict between different learning goals. Finally, an adaptive intersection over union (IoU) threshold training method is proposed for RBox-based model to correct the unbalanced distribution of positive samples caused by the fixed IoU threshold on RBox. Experimental results show that our method obtains 13.26%, 9.49%, 8.92%, and 4.55% gains in average precision under an IoU threshold of 0.5 on the public SAR ship detection dataset compared with four state-of-the-art RBox-based methods, respectively.
topic	Neural network rotatable bounding box (RBox) synthetic aperture radar target detection
url	https://ieeexplore.ieee.org/document/9316765/
work_keys_str_mv	AT rongyang anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT zhenrupan anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT xiaoxuejia anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT leizhang anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT yunkaideng anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT rongyang novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT zhenrupan novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT xiaoxuejia novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT leizhang novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT yunkaideng novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages
_version_	1721398540462194688

A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images

Similar Items