DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images

As a basic research topic in the field of remote sensing, semantic segmentation of high-resolution aerial images has broad application prospects. However, most existing semantic segmentation methods usually extract multiscale features of images in a hierarchical manner and fail to make full use of t...

Full description

Bibliographic Details
Main Authors:	Yun-Cheng Li, Heng-Chao Li, Wen-Shuai Hu, Hui-Ling Yu
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:	Attention mechanism deep learning high-resolution aerial images multiscale learning semantic segmentation
Online Access:	https://ieeexplore.ieee.org/document/9507255/

id	doaj-86826f6952864be58fae4adde0fb4a6b
record_format	Article
spelling	doaj-86826f6952864be58fae4adde0fb4a6b2021-09-09T23:00:15ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01148552856510.1109/JSTARS.2021.31021379507255DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial ImagesYun-Cheng Li0Heng-Chao Li1https://orcid.org/0000-0002-9735-570XWen-Shuai Hu2https://orcid.org/0000-0002-4757-2765Hui-Ling Yu3College of Information and Computer Engineering, Northeast Forestry University, Harbin, ChinaSchool of Information Science and Technology, Southwest Jiaotong University, Chengdu, ChinaSchool of Information Science and Technology, Southwest Jiaotong University, Chengdu, ChinaCollege of Information and Computer Engineering, Northeast Forestry University, Harbin, ChinaAs a basic research topic in the field of remote sensing, semantic segmentation of high-resolution aerial images has broad application prospects. However, most existing semantic segmentation methods usually extract multiscale features of images in a hierarchical manner and fail to make full use of the surface information from high-resolution remote sensing images. To address the above problem, we propose a novel dual-channel scale-aware segmentation network with position and channel attentions (DSPCANet) for high-resolution aerial images, which contains an Xception branch and a digital surface model-based position and channel attention fusion (DSMPCF) branch to process the near-infrared, red, and green (IRRG) spectral images and DSM images, respectively. First, the inner residual block (R2_Block) module represents the multiscale features at the granularity level and increases the range of the receptive field of each network layer. Furthermore, channel attention module (CAM) and improved position attention module (IPAM) are developed to embed into the DSMPCF branch to learn the geographic feature representation from the DSM images, while the Xception branch is applied to process the IRRG spectral images. Finally, in the fusion part of the proposed model, IPAM and CAM are further utilized to effectively model the fusion features from the spatial and channel dimensions, obtain the class-based correlation, and recalibrate the class-level information. The proposed DSPCANet model is evaluated on the ISPRS Vaihingen and Potsdam datasets, and the extensive experiments demonstrate that it is more accurate and efficient than other state-of-the-art methods.https://ieeexplore.ieee.org/document/9507255/Attention mechanismdeep learninghigh-resolution aerial imagesmultiscale learningsemantic segmentation
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Yun-Cheng Li Heng-Chao Li Wen-Shuai Hu Hui-Ling Yu
spellingShingle	Yun-Cheng Li Heng-Chao Li Wen-Shuai Hu Hui-Ling Yu DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Attention mechanism deep learning high-resolution aerial images multiscale learning semantic segmentation
author_facet	Yun-Cheng Li Heng-Chao Li Wen-Shuai Hu Hui-Ling Yu
author_sort	Yun-Cheng Li
title	DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images
title_short	DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images
title_full	DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images
title_fullStr	DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images
title_full_unstemmed	DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images
title_sort	dspcanet: dual-channel scale-aware segmentation network with position and channel attentions for high-resolution aerial images
publisher	IEEE
series	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
issn	2151-1535
publishDate	2021-01-01
description	As a basic research topic in the field of remote sensing, semantic segmentation of high-resolution aerial images has broad application prospects. However, most existing semantic segmentation methods usually extract multiscale features of images in a hierarchical manner and fail to make full use of the surface information from high-resolution remote sensing images. To address the above problem, we propose a novel dual-channel scale-aware segmentation network with position and channel attentions (DSPCANet) for high-resolution aerial images, which contains an Xception branch and a digital surface model-based position and channel attention fusion (DSMPCF) branch to process the near-infrared, red, and green (IRRG) spectral images and DSM images, respectively. First, the inner residual block (R2_Block) module represents the multiscale features at the granularity level and increases the range of the receptive field of each network layer. Furthermore, channel attention module (CAM) and improved position attention module (IPAM) are developed to embed into the DSMPCF branch to learn the geographic feature representation from the DSM images, while the Xception branch is applied to process the IRRG spectral images. Finally, in the fusion part of the proposed model, IPAM and CAM are further utilized to effectively model the fusion features from the spatial and channel dimensions, obtain the class-based correlation, and recalibrate the class-level information. The proposed DSPCANet model is evaluated on the ISPRS Vaihingen and Potsdam datasets, and the extensive experiments demonstrate that it is more accurate and efficient than other state-of-the-art methods.
topic	Attention mechanism deep learning high-resolution aerial images multiscale learning semantic segmentation
url	https://ieeexplore.ieee.org/document/9507255/
work_keys_str_mv	AT yunchengli dspcanetdualchannelscaleawaresegmentationnetworkwithpositionandchannelattentionsforhighresolutionaerialimages AT hengchaoli dspcanetdualchannelscaleawaresegmentationnetworkwithpositionandchannelattentionsforhighresolutionaerialimages AT wenshuaihu dspcanetdualchannelscaleawaresegmentationnetworkwithpositionandchannelattentionsforhighresolutionaerialimages AT huilingyu dspcanetdualchannelscaleawaresegmentationnetworkwithpositionandchannelattentionsforhighresolutionaerialimages
_version_	1717758853161943040

DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images

Similar Items