Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification
Remote sensing image scene classification is an important means for the understanding of remote sensing images. Convolutional neural networks (CNNs) have been successfully applied to remote sensing image scene classification and have demonstrated remarkable performance. However, with improvements in...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9222574/ |
id |
doaj-2a1ae314dd7f40a0b80353b3d4247051 |
---|---|
record_format |
Article |
spelling |
doaj-2a1ae314dd7f40a0b80353b3d42470512021-06-03T23:05:26ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352020-01-01136372638310.1109/JSTARS.2020.30302579222574Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene ClassificationDonghang Yu0https://orcid.org/0000-0002-5858-7671Haitao Guo1https://orcid.org/0000-0001-5769-2798Qing Xu2https://orcid.org/0000-0003-2505-7188Jun Lu3Chuan Zhao4https://orcid.org/0000-0001-9197-9768Yuzhun Lin5PLA Strategic Support Force Information Engineering University, Zheng Zhou, ChinaPLA Strategic Support Force Information Engineering University, Zheng Zhou, ChinaPLA Strategic Support Force Information Engineering University, Zheng Zhou, ChinaPLA Strategic Support Force Information Engineering University, Zheng Zhou, ChinaPLA Strategic Support Force Information Engineering University, Zheng Zhou, ChinaPLA Strategic Support Force Information Engineering University, Zheng Zhou, ChinaRemote sensing image scene classification is an important means for the understanding of remote sensing images. Convolutional neural networks (CNNs) have been successfully applied to remote sensing image scene classification and have demonstrated remarkable performance. However, with improvements in image resolution, remote sensing image categories are becoming increasingly diverse, and problems such as high intraclass diversity and high interclass similarity have arisen. The performance of ordinary CNNs at distinguishing increasingly complex remote sensing images is still limited. Therefore, we propose a feature fusion framework based on hierarchical attention and bilinear pooling called HABFNet for the scene classification of remote sensing images. First, the deep CNN ResNet50 is used to extract the deep features from different layers of the image, and these features are fused to boost their robustness and effectiveness. Second, we design an improved channel attention scheme to enhance the features from different layers. Finally, the enhanced features are cross-layer bilinearly pooled and fused, and the fused features are used for classification. Extensive experiments were conducted on three publicly available remote sensing image benchmarks. Comparisons with the state-of-the-art methods demonstrated that the proposed HABFNet achieved competitive classification performance.https://ieeexplore.ieee.org/document/9222574/Bilinear poolingchannel attentionhierarchical feature fusionremote sensing imagescene classification |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Donghang Yu Haitao Guo Qing Xu Jun Lu Chuan Zhao Yuzhun Lin |
spellingShingle |
Donghang Yu Haitao Guo Qing Xu Jun Lu Chuan Zhao Yuzhun Lin Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Bilinear pooling channel attention hierarchical feature fusion remote sensing image scene classification |
author_facet |
Donghang Yu Haitao Guo Qing Xu Jun Lu Chuan Zhao Yuzhun Lin |
author_sort |
Donghang Yu |
title |
Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification |
title_short |
Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification |
title_full |
Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification |
title_fullStr |
Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification |
title_full_unstemmed |
Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification |
title_sort |
hierarchical attention and bilinear fusion for remote sensing image scene classification |
publisher |
IEEE |
series |
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
issn |
2151-1535 |
publishDate |
2020-01-01 |
description |
Remote sensing image scene classification is an important means for the understanding of remote sensing images. Convolutional neural networks (CNNs) have been successfully applied to remote sensing image scene classification and have demonstrated remarkable performance. However, with improvements in image resolution, remote sensing image categories are becoming increasingly diverse, and problems such as high intraclass diversity and high interclass similarity have arisen. The performance of ordinary CNNs at distinguishing increasingly complex remote sensing images is still limited. Therefore, we propose a feature fusion framework based on hierarchical attention and bilinear pooling called HABFNet for the scene classification of remote sensing images. First, the deep CNN ResNet50 is used to extract the deep features from different layers of the image, and these features are fused to boost their robustness and effectiveness. Second, we design an improved channel attention scheme to enhance the features from different layers. Finally, the enhanced features are cross-layer bilinearly pooled and fused, and the fused features are used for classification. Extensive experiments were conducted on three publicly available remote sensing image benchmarks. Comparisons with the state-of-the-art methods demonstrated that the proposed HABFNet achieved competitive classification performance. |
topic |
Bilinear pooling channel attention hierarchical feature fusion remote sensing image scene classification |
url |
https://ieeexplore.ieee.org/document/9222574/ |
work_keys_str_mv |
AT donghangyu hierarchicalattentionandbilinearfusionforremotesensingimagesceneclassification AT haitaoguo hierarchicalattentionandbilinearfusionforremotesensingimagesceneclassification AT qingxu hierarchicalattentionandbilinearfusionforremotesensingimagesceneclassification AT junlu hierarchicalattentionandbilinearfusionforremotesensingimagesceneclassification AT chuanzhao hierarchicalattentionandbilinearfusionforremotesensingimagesceneclassification AT yuzhunlin hierarchicalattentionandbilinearfusionforremotesensingimagesceneclassification |
_version_ |
1721398650898219008 |