An Adaptive Multiscale Fusion Network Based on Regional Attention for Remote Sensing Images

With the widespread application of semantic segmentation in remote sensing images with high-resolution, how to improve the accuracy of segmentation becomes a research goal in the remote sensing field. An innovative Fully Convolutional Network (FCN) is proposed based on regional attention for improvi...

Full description

Bibliographic Details
Main Authors: Wanzhen Lu, Longxue Liang, Xiaosuo Wu, Xiaoyu Wang, Jiali Cai
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9109552/
id doaj-4ffc8216802542b883f28c301cb8a163
record_format Article
spelling doaj-4ffc8216802542b883f28c301cb8a1632021-03-30T02:55:48ZengIEEEIEEE Access2169-35362020-01-01810780210781310.1109/ACCESS.2020.30004259109552An Adaptive Multiscale Fusion Network Based on Regional Attention for Remote Sensing ImagesWanzhen Lu0https://orcid.org/0000-0003-3473-4670Longxue Liang1https://orcid.org/0000-0002-3938-7359Xiaosuo Wu2https://orcid.org/0000-0002-2707-2042Xiaoyu Wang3https://orcid.org/0000-0002-0213-5342Jiali Cai4https://orcid.org/0000-0003-2632-5084Electronic and Information Engineering Department, Lanzhou Jiaotong University, Lanzhou, ChinaElectronic and Information Engineering Department, Lanzhou Jiaotong University, Lanzhou, ChinaElectronic and Information Engineering Department, Lanzhou Jiaotong University, Lanzhou, ChinaElectronic and Information Engineering Department, Lanzhou Jiaotong University, Lanzhou, ChinaElectronic and Information Engineering Department, Lanzhou Jiaotong University, Lanzhou, ChinaWith the widespread application of semantic segmentation in remote sensing images with high-resolution, how to improve the accuracy of segmentation becomes a research goal in the remote sensing field. An innovative Fully Convolutional Network (FCN) is proposed based on regional attention for improving the performance of the semantic segmentation framework for remote sensing images. The proposed network follows the encoder-decoder architecture of semantic segmentation and includes the following three strategies to improve segmentation accuracy. The enhanced GCN module is applied to capture the semantic features of remote sensing images. MGFM is proposed to capture different contexts by sampling at different densities. Furthermore, RAM is offered to assign large weights to high-value information in different regions of the feature map. Our method is assessed on two datasets: ISPRS Potsdam dataset and CCF dataset. The results indicate that our model with those strategies outperforms baseline models (DCED50) concerning F1, mean IoU and PA, 10.81%,19.11%, and 11.36% on the Potsdam dataset and 29.26%, 27.64% and 13.57% on the CCF dataset.https://ieeexplore.ieee.org/document/9109552/Remote sensingfully convolutional networkssemantic segmentationencoder-decoder architectureregional attentionPotsdam dataset
collection DOAJ
language English
format Article
sources DOAJ
author Wanzhen Lu
Longxue Liang
Xiaosuo Wu
Xiaoyu Wang
Jiali Cai
spellingShingle Wanzhen Lu
Longxue Liang
Xiaosuo Wu
Xiaoyu Wang
Jiali Cai
An Adaptive Multiscale Fusion Network Based on Regional Attention for Remote Sensing Images
IEEE Access
Remote sensing
fully convolutional networks
semantic segmentation
encoder-decoder architecture
regional attention
Potsdam dataset
author_facet Wanzhen Lu
Longxue Liang
Xiaosuo Wu
Xiaoyu Wang
Jiali Cai
author_sort Wanzhen Lu
title An Adaptive Multiscale Fusion Network Based on Regional Attention for Remote Sensing Images
title_short An Adaptive Multiscale Fusion Network Based on Regional Attention for Remote Sensing Images
title_full An Adaptive Multiscale Fusion Network Based on Regional Attention for Remote Sensing Images
title_fullStr An Adaptive Multiscale Fusion Network Based on Regional Attention for Remote Sensing Images
title_full_unstemmed An Adaptive Multiscale Fusion Network Based on Regional Attention for Remote Sensing Images
title_sort adaptive multiscale fusion network based on regional attention for remote sensing images
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description With the widespread application of semantic segmentation in remote sensing images with high-resolution, how to improve the accuracy of segmentation becomes a research goal in the remote sensing field. An innovative Fully Convolutional Network (FCN) is proposed based on regional attention for improving the performance of the semantic segmentation framework for remote sensing images. The proposed network follows the encoder-decoder architecture of semantic segmentation and includes the following three strategies to improve segmentation accuracy. The enhanced GCN module is applied to capture the semantic features of remote sensing images. MGFM is proposed to capture different contexts by sampling at different densities. Furthermore, RAM is offered to assign large weights to high-value information in different regions of the feature map. Our method is assessed on two datasets: ISPRS Potsdam dataset and CCF dataset. The results indicate that our model with those strategies outperforms baseline models (DCED50) concerning F1, mean IoU and PA, 10.81%,19.11%, and 11.36% on the Potsdam dataset and 29.26%, 27.64% and 13.57% on the CCF dataset.
topic Remote sensing
fully convolutional networks
semantic segmentation
encoder-decoder architecture
regional attention
Potsdam dataset
url https://ieeexplore.ieee.org/document/9109552/
work_keys_str_mv AT wanzhenlu anadaptivemultiscalefusionnetworkbasedonregionalattentionforremotesensingimages
AT longxueliang anadaptivemultiscalefusionnetworkbasedonregionalattentionforremotesensingimages
AT xiaosuowu anadaptivemultiscalefusionnetworkbasedonregionalattentionforremotesensingimages
AT xiaoyuwang anadaptivemultiscalefusionnetworkbasedonregionalattentionforremotesensingimages
AT jialicai anadaptivemultiscalefusionnetworkbasedonregionalattentionforremotesensingimages
AT wanzhenlu adaptivemultiscalefusionnetworkbasedonregionalattentionforremotesensingimages
AT longxueliang adaptivemultiscalefusionnetworkbasedonregionalattentionforremotesensingimages
AT xiaosuowu adaptivemultiscalefusionnetworkbasedonregionalattentionforremotesensingimages
AT xiaoyuwang adaptivemultiscalefusionnetworkbasedonregionalattentionforremotesensingimages
AT jialicai adaptivemultiscalefusionnetworkbasedonregionalattentionforremotesensingimages
_version_ 1724184295924301824