DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images

As a basic research topic in the field of remote sensing, semantic segmentation of high-resolution aerial images has broad application prospects. However, most existing semantic segmentation methods usually extract multiscale features of images in a hierarchical manner and fail to make full use of t...

Full description

Bibliographic Details
Main Authors: Yun-Cheng Li, Heng-Chao Li, Wen-Shuai Hu, Hui-Ling Yu
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9507255/
id doaj-86826f6952864be58fae4adde0fb4a6b
record_format Article
spelling doaj-86826f6952864be58fae4adde0fb4a6b2021-09-09T23:00:15ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01148552856510.1109/JSTARS.2021.31021379507255DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial ImagesYun-Cheng Li0Heng-Chao Li1https://orcid.org/0000-0002-9735-570XWen-Shuai Hu2https://orcid.org/0000-0002-4757-2765Hui-Ling Yu3College of Information and Computer Engineering, Northeast Forestry University, Harbin, ChinaSchool of Information Science and Technology, Southwest Jiaotong University, Chengdu, ChinaSchool of Information Science and Technology, Southwest Jiaotong University, Chengdu, ChinaCollege of Information and Computer Engineering, Northeast Forestry University, Harbin, ChinaAs a basic research topic in the field of remote sensing, semantic segmentation of high-resolution aerial images has broad application prospects. However, most existing semantic segmentation methods usually extract multiscale features of images in a hierarchical manner and fail to make full use of the surface information from high-resolution remote sensing images. To address the above problem, we propose a novel dual-channel scale-aware segmentation network with position and channel attentions (DSPCANet) for high-resolution aerial images, which contains an Xception branch and a digital surface model-based position and channel attention fusion (DSMPCF) branch to process the near-infrared, red, and green (IRRG) spectral images and DSM images, respectively. First, the inner residual block (R2_Block) module represents the multiscale features at the granularity level and increases the range of the receptive field of each network layer. Furthermore, channel attention module (CAM) and improved position attention module (IPAM) are developed to embed into the DSMPCF branch to learn the geographic feature representation from the DSM images, while the Xception branch is applied to process the IRRG spectral images. Finally, in the fusion part of the proposed model, IPAM and CAM are further utilized to effectively model the fusion features from the spatial and channel dimensions, obtain the class-based correlation, and recalibrate the class-level information. The proposed DSPCANet model is evaluated on the ISPRS Vaihingen and Potsdam datasets, and the extensive experiments demonstrate that it is more accurate and efficient than other state-of-the-art methods.https://ieeexplore.ieee.org/document/9507255/Attention mechanismdeep learninghigh-resolution aerial imagesmultiscale learningsemantic segmentation
collection DOAJ
language English
format Article
sources DOAJ
author Yun-Cheng Li
Heng-Chao Li
Wen-Shuai Hu
Hui-Ling Yu
spellingShingle Yun-Cheng Li
Heng-Chao Li
Wen-Shuai Hu
Hui-Ling Yu
DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Attention mechanism
deep learning
high-resolution aerial images
multiscale learning
semantic segmentation
author_facet Yun-Cheng Li
Heng-Chao Li
Wen-Shuai Hu
Hui-Ling Yu
author_sort Yun-Cheng Li
title DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images
title_short DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images
title_full DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images
title_fullStr DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images
title_full_unstemmed DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images
title_sort dspcanet: dual-channel scale-aware segmentation network with position and channel attentions for high-resolution aerial images
publisher IEEE
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
issn 2151-1535
publishDate 2021-01-01
description As a basic research topic in the field of remote sensing, semantic segmentation of high-resolution aerial images has broad application prospects. However, most existing semantic segmentation methods usually extract multiscale features of images in a hierarchical manner and fail to make full use of the surface information from high-resolution remote sensing images. To address the above problem, we propose a novel dual-channel scale-aware segmentation network with position and channel attentions (DSPCANet) for high-resolution aerial images, which contains an Xception branch and a digital surface model-based position and channel attention fusion (DSMPCF) branch to process the near-infrared, red, and green (IRRG) spectral images and DSM images, respectively. First, the inner residual block (R2_Block) module represents the multiscale features at the granularity level and increases the range of the receptive field of each network layer. Furthermore, channel attention module (CAM) and improved position attention module (IPAM) are developed to embed into the DSMPCF branch to learn the geographic feature representation from the DSM images, while the Xception branch is applied to process the IRRG spectral images. Finally, in the fusion part of the proposed model, IPAM and CAM are further utilized to effectively model the fusion features from the spatial and channel dimensions, obtain the class-based correlation, and recalibrate the class-level information. The proposed DSPCANet model is evaluated on the ISPRS Vaihingen and Potsdam datasets, and the extensive experiments demonstrate that it is more accurate and efficient than other state-of-the-art methods.
topic Attention mechanism
deep learning
high-resolution aerial images
multiscale learning
semantic segmentation
url https://ieeexplore.ieee.org/document/9507255/
work_keys_str_mv AT yunchengli dspcanetdualchannelscaleawaresegmentationnetworkwithpositionandchannelattentionsforhighresolutionaerialimages
AT hengchaoli dspcanetdualchannelscaleawaresegmentationnetworkwithpositionandchannelattentionsforhighresolutionaerialimages
AT wenshuaihu dspcanetdualchannelscaleawaresegmentationnetworkwithpositionandchannelattentionsforhighresolutionaerialimages
AT huilingyu dspcanetdualchannelscaleawaresegmentationnetworkwithpositionandchannelattentionsforhighresolutionaerialimages
_version_ 1717758853161943040