DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images
As a basic research topic in the field of remote sensing, semantic segmentation of high-resolution aerial images has broad application prospects. However, most existing semantic segmentation methods usually extract multiscale features of images in a hierarchical manner and fail to make full use of t...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9507255/ |
id |
doaj-86826f6952864be58fae4adde0fb4a6b |
---|---|
record_format |
Article |
spelling |
doaj-86826f6952864be58fae4adde0fb4a6b2021-09-09T23:00:15ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01148552856510.1109/JSTARS.2021.31021379507255DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial ImagesYun-Cheng Li0Heng-Chao Li1https://orcid.org/0000-0002-9735-570XWen-Shuai Hu2https://orcid.org/0000-0002-4757-2765Hui-Ling Yu3College of Information and Computer Engineering, Northeast Forestry University, Harbin, ChinaSchool of Information Science and Technology, Southwest Jiaotong University, Chengdu, ChinaSchool of Information Science and Technology, Southwest Jiaotong University, Chengdu, ChinaCollege of Information and Computer Engineering, Northeast Forestry University, Harbin, ChinaAs a basic research topic in the field of remote sensing, semantic segmentation of high-resolution aerial images has broad application prospects. However, most existing semantic segmentation methods usually extract multiscale features of images in a hierarchical manner and fail to make full use of the surface information from high-resolution remote sensing images. To address the above problem, we propose a novel dual-channel scale-aware segmentation network with position and channel attentions (DSPCANet) for high-resolution aerial images, which contains an Xception branch and a digital surface model-based position and channel attention fusion (DSMPCF) branch to process the near-infrared, red, and green (IRRG) spectral images and DSM images, respectively. First, the inner residual block (R2_Block) module represents the multiscale features at the granularity level and increases the range of the receptive field of each network layer. Furthermore, channel attention module (CAM) and improved position attention module (IPAM) are developed to embed into the DSMPCF branch to learn the geographic feature representation from the DSM images, while the Xception branch is applied to process the IRRG spectral images. Finally, in the fusion part of the proposed model, IPAM and CAM are further utilized to effectively model the fusion features from the spatial and channel dimensions, obtain the class-based correlation, and recalibrate the class-level information. The proposed DSPCANet model is evaluated on the ISPRS Vaihingen and Potsdam datasets, and the extensive experiments demonstrate that it is more accurate and efficient than other state-of-the-art methods.https://ieeexplore.ieee.org/document/9507255/Attention mechanismdeep learninghigh-resolution aerial imagesmultiscale learningsemantic segmentation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yun-Cheng Li Heng-Chao Li Wen-Shuai Hu Hui-Ling Yu |
spellingShingle |
Yun-Cheng Li Heng-Chao Li Wen-Shuai Hu Hui-Ling Yu DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Attention mechanism deep learning high-resolution aerial images multiscale learning semantic segmentation |
author_facet |
Yun-Cheng Li Heng-Chao Li Wen-Shuai Hu Hui-Ling Yu |
author_sort |
Yun-Cheng Li |
title |
DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images |
title_short |
DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images |
title_full |
DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images |
title_fullStr |
DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images |
title_full_unstemmed |
DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images |
title_sort |
dspcanet: dual-channel scale-aware segmentation network with position and channel attentions for high-resolution aerial images |
publisher |
IEEE |
series |
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
issn |
2151-1535 |
publishDate |
2021-01-01 |
description |
As a basic research topic in the field of remote sensing, semantic segmentation of high-resolution aerial images has broad application prospects. However, most existing semantic segmentation methods usually extract multiscale features of images in a hierarchical manner and fail to make full use of the surface information from high-resolution remote sensing images. To address the above problem, we propose a novel dual-channel scale-aware segmentation network with position and channel attentions (DSPCANet) for high-resolution aerial images, which contains an Xception branch and a digital surface model-based position and channel attention fusion (DSMPCF) branch to process the near-infrared, red, and green (IRRG) spectral images and DSM images, respectively. First, the inner residual block (R2_Block) module represents the multiscale features at the granularity level and increases the range of the receptive field of each network layer. Furthermore, channel attention module (CAM) and improved position attention module (IPAM) are developed to embed into the DSMPCF branch to learn the geographic feature representation from the DSM images, while the Xception branch is applied to process the IRRG spectral images. Finally, in the fusion part of the proposed model, IPAM and CAM are further utilized to effectively model the fusion features from the spatial and channel dimensions, obtain the class-based correlation, and recalibrate the class-level information. The proposed DSPCANet model is evaluated on the ISPRS Vaihingen and Potsdam datasets, and the extensive experiments demonstrate that it is more accurate and efficient than other state-of-the-art methods. |
topic |
Attention mechanism deep learning high-resolution aerial images multiscale learning semantic segmentation |
url |
https://ieeexplore.ieee.org/document/9507255/ |
work_keys_str_mv |
AT yunchengli dspcanetdualchannelscaleawaresegmentationnetworkwithpositionandchannelattentionsforhighresolutionaerialimages AT hengchaoli dspcanetdualchannelscaleawaresegmentationnetworkwithpositionandchannelattentionsforhighresolutionaerialimages AT wenshuaihu dspcanetdualchannelscaleawaresegmentationnetworkwithpositionandchannelattentionsforhighresolutionaerialimages AT huilingyu dspcanetdualchannelscaleawaresegmentationnetworkwithpositionandchannelattentionsforhighresolutionaerialimages |
_version_ |
1717758853161943040 |