Fine Building Segmentation in High-Resolution SAR Images Via Selective Pyramid Dilated Network

The building extraction from synthetic aperture radar (SAR) images has always been a challenging research topic. Recently, the deep convolution neural network brings excellent improvements in SAR segmentation. The fully convolutional network and other variants are widely transferred to the SAR studi...

Full description

Bibliographic Details
Main Authors: Hao Jing, Xian Sun, Zhirui Wang, Kaiqiang Chen, Wenhui Diao, Kun Fu
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9416747/
id doaj-f7c6d9d35a72458abf4eaa5863513442
record_format Article
spelling doaj-f7c6d9d35a72458abf4eaa58635134422021-07-13T23:00:13ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01146608662310.1109/JSTARS.2021.30760859416747Fine Building Segmentation in High-Resolution SAR Images Via Selective Pyramid Dilated NetworkHao Jing0https://orcid.org/0000-0003-2354-0846Xian Sun1https://orcid.org/0000-0002-0038-9816Zhirui Wang2https://orcid.org/0000-0003-2877-0384Kaiqiang Chen3https://orcid.org/0000-0002-8314-2375Wenhui Diao4https://orcid.org/0000-0002-3931-3974Kun Fu5https://orcid.org/0000-0002-0450-6469Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaKey Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing, ChinaKey Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing, ChinaKey Laboratory of Network Information System Technology (NIST), Institute of Electronics, Chinese Academy of Sciences, Beijing, ChinaAerospace Information Research Institute, Chinese Academy of Sciences, Beijing, ChinaThe building extraction from synthetic aperture radar (SAR) images has always been a challenging research topic. Recently, the deep convolution neural network brings excellent improvements in SAR segmentation. The fully convolutional network and other variants are widely transferred to the SAR studies because of their high precision in optical images. They are still limited by their processing in terms of the geometric distortion of buildings, the variability of building structures, and scattering interference between adjacent targets in the SAR images. In this article, a unified framework called selective spatial pyramid dilated (SSPD) network is proposed for the fine building segmentation in SAR images. First, we propose a novel encoder–decoder structure for the fine building feature reconstruction. The enhanced encoder and the dual-stage decoder, composed of the CBM and the SSPD module, extract and recover the crucial multiscale information better. Second, we design the multilayer SSPD module based on the selective spatial attention. The multiscale building information with different attention on multiple branches is combined, optimized, and adaptively selected for adaptive filtering and extracting features of complex multiscale building targets in SAR images. Third, according to the building features and SAR imaging mechanism, a new loss function called L-shape weighting loss (LWloss) is proposed to heighten the attention on the L-shape footprint characteristics of the buildings and reduce the missing detection of line buildings. Besides, LWloss can also alleviate the class imbalance problem in the optimization stage. Finally, the experiments on a large-scene SAR image dataset demonstrate the effectiveness of the proposed method and verify its superiority over other approaches, such as the region-based Markov random field, U-net, and DeepLabv3+.https://ieeexplore.ieee.org/document/9416747/Automatic fine segmentation of buildingsL-shape weighting loss (LWloss)selective spatial pyramid dilated (SSPD) networksynthetic aperture radar (SAR)
collection DOAJ
language English
format Article
sources DOAJ
author Hao Jing
Xian Sun
Zhirui Wang
Kaiqiang Chen
Wenhui Diao
Kun Fu
spellingShingle Hao Jing
Xian Sun
Zhirui Wang
Kaiqiang Chen
Wenhui Diao
Kun Fu
Fine Building Segmentation in High-Resolution SAR Images Via Selective Pyramid Dilated Network
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Automatic fine segmentation of buildings
L-shape weighting loss (LWloss)
selective spatial pyramid dilated (SSPD) network
synthetic aperture radar (SAR)
author_facet Hao Jing
Xian Sun
Zhirui Wang
Kaiqiang Chen
Wenhui Diao
Kun Fu
author_sort Hao Jing
title Fine Building Segmentation in High-Resolution SAR Images Via Selective Pyramid Dilated Network
title_short Fine Building Segmentation in High-Resolution SAR Images Via Selective Pyramid Dilated Network
title_full Fine Building Segmentation in High-Resolution SAR Images Via Selective Pyramid Dilated Network
title_fullStr Fine Building Segmentation in High-Resolution SAR Images Via Selective Pyramid Dilated Network
title_full_unstemmed Fine Building Segmentation in High-Resolution SAR Images Via Selective Pyramid Dilated Network
title_sort fine building segmentation in high-resolution sar images via selective pyramid dilated network
publisher IEEE
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
issn 2151-1535
publishDate 2021-01-01
description The building extraction from synthetic aperture radar (SAR) images has always been a challenging research topic. Recently, the deep convolution neural network brings excellent improvements in SAR segmentation. The fully convolutional network and other variants are widely transferred to the SAR studies because of their high precision in optical images. They are still limited by their processing in terms of the geometric distortion of buildings, the variability of building structures, and scattering interference between adjacent targets in the SAR images. In this article, a unified framework called selective spatial pyramid dilated (SSPD) network is proposed for the fine building segmentation in SAR images. First, we propose a novel encoder–decoder structure for the fine building feature reconstruction. The enhanced encoder and the dual-stage decoder, composed of the CBM and the SSPD module, extract and recover the crucial multiscale information better. Second, we design the multilayer SSPD module based on the selective spatial attention. The multiscale building information with different attention on multiple branches is combined, optimized, and adaptively selected for adaptive filtering and extracting features of complex multiscale building targets in SAR images. Third, according to the building features and SAR imaging mechanism, a new loss function called L-shape weighting loss (LWloss) is proposed to heighten the attention on the L-shape footprint characteristics of the buildings and reduce the missing detection of line buildings. Besides, LWloss can also alleviate the class imbalance problem in the optimization stage. Finally, the experiments on a large-scene SAR image dataset demonstrate the effectiveness of the proposed method and verify its superiority over other approaches, such as the region-based Markov random field, U-net, and DeepLabv3+.
topic Automatic fine segmentation of buildings
L-shape weighting loss (LWloss)
selective spatial pyramid dilated (SSPD) network
synthetic aperture radar (SAR)
url https://ieeexplore.ieee.org/document/9416747/
work_keys_str_mv AT haojing finebuildingsegmentationinhighresolutionsarimagesviaselectivepyramiddilatednetwork
AT xiansun finebuildingsegmentationinhighresolutionsarimagesviaselectivepyramiddilatednetwork
AT zhiruiwang finebuildingsegmentationinhighresolutionsarimagesviaselectivepyramiddilatednetwork
AT kaiqiangchen finebuildingsegmentationinhighresolutionsarimagesviaselectivepyramiddilatednetwork
AT wenhuidiao finebuildingsegmentationinhighresolutionsarimagesviaselectivepyramiddilatednetwork
AT kunfu finebuildingsegmentationinhighresolutionsarimagesviaselectivepyramiddilatednetwork
_version_ 1721304783952805888