A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images

Thanks to the excellent feature representation capabilities of neural networks, deep learning-based methods perform far better than traditional methods on target detection tasks such as ship detection. Although various network models have been proposed for SAR ship detection such as DRBox-v1, DRBox-...

Full description

Bibliographic Details
Main Authors: Rong Yang, Zhenru Pan, Xiaoxue Jia, Lei Zhang, Yunkai Deng
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9316765/
id doaj-26fa3de8d24e43169381048ed4d78d01
record_format Article
spelling doaj-26fa3de8d24e43169381048ed4d78d012021-06-03T23:06:40ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01141938195810.1109/JSTARS.2021.30498519316765A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR ImagesRong Yang0https://orcid.org/0000-0001-5620-7615Zhenru Pan1https://orcid.org/0000-0002-8123-8939Xiaoxue Jia2https://orcid.org/0000-0003-0031-933XLei Zhang3Yunkai Deng4Department of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaDepartment of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaDepartment of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaDepartment of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaDepartment of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaThanks to the excellent feature representation capabilities of neural networks, deep learning-based methods perform far better than traditional methods on target detection tasks such as ship detection. Although various network models have been proposed for SAR ship detection such as DRBox-v1, DRBox-v2, and MSR2N, there are still some problems such as mismatch of feature scale, contradictions between different learning tasks, and unbalanced distribution of positive samples, which have not been mentioned in these studies. In this article, an improved one-stage object detection framework based on RetinaNet and rotatable bounding box (RBox), which is referred as R-RetinaNet, is proposed to solve the above problems. The main improvements of R-RetinaNet as well as the contributions of this article are threefold. First, a scale calibration method is proposed to align the scale distribution of the output backbone feature map with the scale distribution of the targets. Second, a feature fusion network based on task-wise attention feature pyramid network is designed to decouple the feature optimization process of different tasks, which alleviates the conflict between different learning goals. Finally, an adaptive intersection over union (IoU) threshold training method is proposed for RBox-based model to correct the unbalanced distribution of positive samples caused by the fixed IoU threshold on RBox. Experimental results show that our method obtains 13.26%, 9.49%, 8.92%, and 4.55% gains in average precision under an IoU threshold of 0.5 on the public SAR ship detection dataset compared with four state-of-the-art RBox-based methods, respectively.https://ieeexplore.ieee.org/document/9316765/Neural networkrotatable bounding box (RBox)synthetic aperture radartarget detection
collection DOAJ
language English
format Article
sources DOAJ
author Rong Yang
Zhenru Pan
Xiaoxue Jia
Lei Zhang
Yunkai Deng
spellingShingle Rong Yang
Zhenru Pan
Xiaoxue Jia
Lei Zhang
Yunkai Deng
A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Neural network
rotatable bounding box (RBox)
synthetic aperture radar
target detection
author_facet Rong Yang
Zhenru Pan
Xiaoxue Jia
Lei Zhang
Yunkai Deng
author_sort Rong Yang
title A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images
title_short A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images
title_full A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images
title_fullStr A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images
title_full_unstemmed A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images
title_sort novel cnn-based detector for ship detection based on rotatable bounding box in sar images
publisher IEEE
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
issn 2151-1535
publishDate 2021-01-01
description Thanks to the excellent feature representation capabilities of neural networks, deep learning-based methods perform far better than traditional methods on target detection tasks such as ship detection. Although various network models have been proposed for SAR ship detection such as DRBox-v1, DRBox-v2, and MSR2N, there are still some problems such as mismatch of feature scale, contradictions between different learning tasks, and unbalanced distribution of positive samples, which have not been mentioned in these studies. In this article, an improved one-stage object detection framework based on RetinaNet and rotatable bounding box (RBox), which is referred as R-RetinaNet, is proposed to solve the above problems. The main improvements of R-RetinaNet as well as the contributions of this article are threefold. First, a scale calibration method is proposed to align the scale distribution of the output backbone feature map with the scale distribution of the targets. Second, a feature fusion network based on task-wise attention feature pyramid network is designed to decouple the feature optimization process of different tasks, which alleviates the conflict between different learning goals. Finally, an adaptive intersection over union (IoU) threshold training method is proposed for RBox-based model to correct the unbalanced distribution of positive samples caused by the fixed IoU threshold on RBox. Experimental results show that our method obtains 13.26%, 9.49%, 8.92%, and 4.55% gains in average precision under an IoU threshold of 0.5 on the public SAR ship detection dataset compared with four state-of-the-art RBox-based methods, respectively.
topic Neural network
rotatable bounding box (RBox)
synthetic aperture radar
target detection
url https://ieeexplore.ieee.org/document/9316765/
work_keys_str_mv AT rongyang anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages
AT zhenrupan anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages
AT xiaoxuejia anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages
AT leizhang anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages
AT yunkaideng anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages
AT rongyang novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages
AT zhenrupan novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages
AT xiaoxuejia novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages
AT leizhang novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages
AT yunkaideng novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages
_version_ 1721398540462194688