A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images
Thanks to the excellent feature representation capabilities of neural networks, deep learning-based methods perform far better than traditional methods on target detection tasks such as ship detection. Although various network models have been proposed for SAR ship detection such as DRBox-v1, DRBox-...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9316765/ |
id |
doaj-26fa3de8d24e43169381048ed4d78d01 |
---|---|
record_format |
Article |
spelling |
doaj-26fa3de8d24e43169381048ed4d78d012021-06-03T23:06:40ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01141938195810.1109/JSTARS.2021.30498519316765A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR ImagesRong Yang0https://orcid.org/0000-0001-5620-7615Zhenru Pan1https://orcid.org/0000-0002-8123-8939Xiaoxue Jia2https://orcid.org/0000-0003-0031-933XLei Zhang3Yunkai Deng4Department of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaDepartment of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaDepartment of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaDepartment of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaDepartment of Space Microwave Remote Sensing System, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaThanks to the excellent feature representation capabilities of neural networks, deep learning-based methods perform far better than traditional methods on target detection tasks such as ship detection. Although various network models have been proposed for SAR ship detection such as DRBox-v1, DRBox-v2, and MSR2N, there are still some problems such as mismatch of feature scale, contradictions between different learning tasks, and unbalanced distribution of positive samples, which have not been mentioned in these studies. In this article, an improved one-stage object detection framework based on RetinaNet and rotatable bounding box (RBox), which is referred as R-RetinaNet, is proposed to solve the above problems. The main improvements of R-RetinaNet as well as the contributions of this article are threefold. First, a scale calibration method is proposed to align the scale distribution of the output backbone feature map with the scale distribution of the targets. Second, a feature fusion network based on task-wise attention feature pyramid network is designed to decouple the feature optimization process of different tasks, which alleviates the conflict between different learning goals. Finally, an adaptive intersection over union (IoU) threshold training method is proposed for RBox-based model to correct the unbalanced distribution of positive samples caused by the fixed IoU threshold on RBox. Experimental results show that our method obtains 13.26%, 9.49%, 8.92%, and 4.55% gains in average precision under an IoU threshold of 0.5 on the public SAR ship detection dataset compared with four state-of-the-art RBox-based methods, respectively.https://ieeexplore.ieee.org/document/9316765/Neural networkrotatable bounding box (RBox)synthetic aperture radartarget detection |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Rong Yang Zhenru Pan Xiaoxue Jia Lei Zhang Yunkai Deng |
spellingShingle |
Rong Yang Zhenru Pan Xiaoxue Jia Lei Zhang Yunkai Deng A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Neural network rotatable bounding box (RBox) synthetic aperture radar target detection |
author_facet |
Rong Yang Zhenru Pan Xiaoxue Jia Lei Zhang Yunkai Deng |
author_sort |
Rong Yang |
title |
A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images |
title_short |
A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images |
title_full |
A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images |
title_fullStr |
A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images |
title_full_unstemmed |
A Novel CNN-Based Detector for Ship Detection Based on Rotatable Bounding Box in SAR Images |
title_sort |
novel cnn-based detector for ship detection based on rotatable bounding box in sar images |
publisher |
IEEE |
series |
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
issn |
2151-1535 |
publishDate |
2021-01-01 |
description |
Thanks to the excellent feature representation capabilities of neural networks, deep learning-based methods perform far better than traditional methods on target detection tasks such as ship detection. Although various network models have been proposed for SAR ship detection such as DRBox-v1, DRBox-v2, and MSR2N, there are still some problems such as mismatch of feature scale, contradictions between different learning tasks, and unbalanced distribution of positive samples, which have not been mentioned in these studies. In this article, an improved one-stage object detection framework based on RetinaNet and rotatable bounding box (RBox), which is referred as R-RetinaNet, is proposed to solve the above problems. The main improvements of R-RetinaNet as well as the contributions of this article are threefold. First, a scale calibration method is proposed to align the scale distribution of the output backbone feature map with the scale distribution of the targets. Second, a feature fusion network based on task-wise attention feature pyramid network is designed to decouple the feature optimization process of different tasks, which alleviates the conflict between different learning goals. Finally, an adaptive intersection over union (IoU) threshold training method is proposed for RBox-based model to correct the unbalanced distribution of positive samples caused by the fixed IoU threshold on RBox. Experimental results show that our method obtains 13.26%, 9.49%, 8.92%, and 4.55% gains in average precision under an IoU threshold of 0.5 on the public SAR ship detection dataset compared with four state-of-the-art RBox-based methods, respectively. |
topic |
Neural network rotatable bounding box (RBox) synthetic aperture radar target detection |
url |
https://ieeexplore.ieee.org/document/9316765/ |
work_keys_str_mv |
AT rongyang anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT zhenrupan anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT xiaoxuejia anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT leizhang anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT yunkaideng anovelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT rongyang novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT zhenrupan novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT xiaoxuejia novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT leizhang novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages AT yunkaideng novelcnnbaseddetectorforshipdetectionbasedonrotatableboundingboxinsarimages |
_version_ |
1721398540462194688 |