Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification

Remote sensing image scene classification is an important means for the understanding of remote sensing images. Convolutional neural networks (CNNs) have been successfully applied to remote sensing image scene classification and have demonstrated remarkable performance. However, with improvements in...

Full description

Bibliographic Details
Main Authors: Donghang Yu, Haitao Guo, Qing Xu, Jun Lu, Chuan Zhao, Yuzhun Lin
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9222574/
id doaj-2a1ae314dd7f40a0b80353b3d4247051
record_format Article
spelling doaj-2a1ae314dd7f40a0b80353b3d42470512021-06-03T23:05:26ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352020-01-01136372638310.1109/JSTARS.2020.30302579222574Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene ClassificationDonghang Yu0https://orcid.org/0000-0002-5858-7671Haitao Guo1https://orcid.org/0000-0001-5769-2798Qing Xu2https://orcid.org/0000-0003-2505-7188Jun Lu3Chuan Zhao4https://orcid.org/0000-0001-9197-9768Yuzhun Lin5PLA Strategic Support Force Information Engineering University, Zheng Zhou, ChinaPLA Strategic Support Force Information Engineering University, Zheng Zhou, ChinaPLA Strategic Support Force Information Engineering University, Zheng Zhou, ChinaPLA Strategic Support Force Information Engineering University, Zheng Zhou, ChinaPLA Strategic Support Force Information Engineering University, Zheng Zhou, ChinaPLA Strategic Support Force Information Engineering University, Zheng Zhou, ChinaRemote sensing image scene classification is an important means for the understanding of remote sensing images. Convolutional neural networks (CNNs) have been successfully applied to remote sensing image scene classification and have demonstrated remarkable performance. However, with improvements in image resolution, remote sensing image categories are becoming increasingly diverse, and problems such as high intraclass diversity and high interclass similarity have arisen. The performance of ordinary CNNs at distinguishing increasingly complex remote sensing images is still limited. Therefore, we propose a feature fusion framework based on hierarchical attention and bilinear pooling called HABFNet for the scene classification of remote sensing images. First, the deep CNN ResNet50 is used to extract the deep features from different layers of the image, and these features are fused to boost their robustness and effectiveness. Second, we design an improved channel attention scheme to enhance the features from different layers. Finally, the enhanced features are cross-layer bilinearly pooled and fused, and the fused features are used for classification. Extensive experiments were conducted on three publicly available remote sensing image benchmarks. Comparisons with the state-of-the-art methods demonstrated that the proposed HABFNet achieved competitive classification performance.https://ieeexplore.ieee.org/document/9222574/Bilinear poolingchannel attentionhierarchical feature fusionremote sensing imagescene classification
collection DOAJ
language English
format Article
sources DOAJ
author Donghang Yu
Haitao Guo
Qing Xu
Jun Lu
Chuan Zhao
Yuzhun Lin
spellingShingle Donghang Yu
Haitao Guo
Qing Xu
Jun Lu
Chuan Zhao
Yuzhun Lin
Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Bilinear pooling
channel attention
hierarchical feature fusion
remote sensing image
scene classification
author_facet Donghang Yu
Haitao Guo
Qing Xu
Jun Lu
Chuan Zhao
Yuzhun Lin
author_sort Donghang Yu
title Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification
title_short Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification
title_full Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification
title_fullStr Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification
title_full_unstemmed Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification
title_sort hierarchical attention and bilinear fusion for remote sensing image scene classification
publisher IEEE
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
issn 2151-1535
publishDate 2020-01-01
description Remote sensing image scene classification is an important means for the understanding of remote sensing images. Convolutional neural networks (CNNs) have been successfully applied to remote sensing image scene classification and have demonstrated remarkable performance. However, with improvements in image resolution, remote sensing image categories are becoming increasingly diverse, and problems such as high intraclass diversity and high interclass similarity have arisen. The performance of ordinary CNNs at distinguishing increasingly complex remote sensing images is still limited. Therefore, we propose a feature fusion framework based on hierarchical attention and bilinear pooling called HABFNet for the scene classification of remote sensing images. First, the deep CNN ResNet50 is used to extract the deep features from different layers of the image, and these features are fused to boost their robustness and effectiveness. Second, we design an improved channel attention scheme to enhance the features from different layers. Finally, the enhanced features are cross-layer bilinearly pooled and fused, and the fused features are used for classification. Extensive experiments were conducted on three publicly available remote sensing image benchmarks. Comparisons with the state-of-the-art methods demonstrated that the proposed HABFNet achieved competitive classification performance.
topic Bilinear pooling
channel attention
hierarchical feature fusion
remote sensing image
scene classification
url https://ieeexplore.ieee.org/document/9222574/
work_keys_str_mv AT donghangyu hierarchicalattentionandbilinearfusionforremotesensingimagesceneclassification
AT haitaoguo hierarchicalattentionandbilinearfusionforremotesensingimagesceneclassification
AT qingxu hierarchicalattentionandbilinearfusionforremotesensingimagesceneclassification
AT junlu hierarchicalattentionandbilinearfusionforremotesensingimagesceneclassification
AT chuanzhao hierarchicalattentionandbilinearfusionforremotesensingimagesceneclassification
AT yuzhunlin hierarchicalattentionandbilinearfusionforremotesensingimagesceneclassification
_version_ 1721398650898219008