Aerial Scene Classification through Fine-Tuning with Adaptive Learning Rates and Label Smoothing

Remote Sensing (RS) image classification has recently attracted great attention for its application in different tasks, including environmental monitoring, battlefield surveillance, and geospatial object detection. The best practices for these tasks often involve transfer learning from pre-trained C...

Full description

Bibliographic Details
Main Authors: Biserka Petrovska, Tatjana Atanasova-Pacemska, Roberto Corizzo, Paolo Mignone, Petre Lameski, Eftim Zdravevski
Format: Article
Language:English
Published: MDPI AG 2020-08-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/10/17/5792
id doaj-c6625f321a544cae9fd190b817db3cb1
record_format Article
spelling doaj-c6625f321a544cae9fd190b817db3cb12020-11-25T02:58:46ZengMDPI AGApplied Sciences2076-34172020-08-01105792579210.3390/app10175792Aerial Scene Classification through Fine-Tuning with Adaptive Learning Rates and Label SmoothingBiserka Petrovska0Tatjana Atanasova-Pacemska1Roberto Corizzo2Paolo Mignone3Petre Lameski4Eftim Zdravevski5Ministry of Defence, 1000 Skopje, North MacedoniaFaculty of Computer Science, University Goce Delcev, 2000 Stip, North MacedoniaDepartment of Computer Science, American University, 4400 Massachusetts Ave NW, Washington, DC 20016, USADepartment of Computer Science, University of Bari Aldo Moro, Via E. Orabona, 4, 70125 Bari, Italy, <email>paolo.mignone@uniba.it</email>Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University in Skopje, Rugjer Boshkovik 16, 1000 Skopje, North MacedoniaFaculty of Computer Science and Engineering, Ss. Cyril and Methodius University in Skopje, Rugjer Boshkovik 16, 1000 Skopje, North MacedoniaRemote Sensing (RS) image classification has recently attracted great attention for its application in different tasks, including environmental monitoring, battlefield surveillance, and geospatial object detection. The best practices for these tasks often involve transfer learning from pre-trained Convolutional Neural Networks (CNNs). A common approach in the literature is employing CNNs for feature extraction, and subsequently train classifiers exploiting such features. In this paper, we propose the adoption of transfer learning by fine-tuning pre-trained CNNs for end-to-end aerial image classification. Our approach performs feature extraction from the fine-tuned neural networks and remote sensing image classification with a Support Vector Machine (SVM) model with linear and Radial Basis Function (RBF) kernels. To tune the learning rate hyperparameter, we employ a linear decay learning rate scheduler as well as cyclical learning rates. Moreover, in order to mitigate the overfitting problem of pre-trained models, we apply label smoothing regularization. For the fine-tuning and feature extraction process, we adopt the Inception-v3 and Xception inception-based CNNs, as well the residual-based networks ResNet50 and DenseNet121. We present extensive experiments on two real-world remote sensing image datasets: AID and NWPU-RESISC45. The results show that the proposed method exhibits classification accuracy of up to 98%, outperforming other state-of-the-art methods.https://www.mdpi.com/2076-3417/10/17/5792remote sensingconvolutional neural networkfine-tuninglearning rate schedulercyclical learning rateslabel smoothing
collection DOAJ
language English
format Article
sources DOAJ
author Biserka Petrovska
Tatjana Atanasova-Pacemska
Roberto Corizzo
Paolo Mignone
Petre Lameski
Eftim Zdravevski
spellingShingle Biserka Petrovska
Tatjana Atanasova-Pacemska
Roberto Corizzo
Paolo Mignone
Petre Lameski
Eftim Zdravevski
Aerial Scene Classification through Fine-Tuning with Adaptive Learning Rates and Label Smoothing
Applied Sciences
remote sensing
convolutional neural network
fine-tuning
learning rate scheduler
cyclical learning rates
label smoothing
author_facet Biserka Petrovska
Tatjana Atanasova-Pacemska
Roberto Corizzo
Paolo Mignone
Petre Lameski
Eftim Zdravevski
author_sort Biserka Petrovska
title Aerial Scene Classification through Fine-Tuning with Adaptive Learning Rates and Label Smoothing
title_short Aerial Scene Classification through Fine-Tuning with Adaptive Learning Rates and Label Smoothing
title_full Aerial Scene Classification through Fine-Tuning with Adaptive Learning Rates and Label Smoothing
title_fullStr Aerial Scene Classification through Fine-Tuning with Adaptive Learning Rates and Label Smoothing
title_full_unstemmed Aerial Scene Classification through Fine-Tuning with Adaptive Learning Rates and Label Smoothing
title_sort aerial scene classification through fine-tuning with adaptive learning rates and label smoothing
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2020-08-01
description Remote Sensing (RS) image classification has recently attracted great attention for its application in different tasks, including environmental monitoring, battlefield surveillance, and geospatial object detection. The best practices for these tasks often involve transfer learning from pre-trained Convolutional Neural Networks (CNNs). A common approach in the literature is employing CNNs for feature extraction, and subsequently train classifiers exploiting such features. In this paper, we propose the adoption of transfer learning by fine-tuning pre-trained CNNs for end-to-end aerial image classification. Our approach performs feature extraction from the fine-tuned neural networks and remote sensing image classification with a Support Vector Machine (SVM) model with linear and Radial Basis Function (RBF) kernels. To tune the learning rate hyperparameter, we employ a linear decay learning rate scheduler as well as cyclical learning rates. Moreover, in order to mitigate the overfitting problem of pre-trained models, we apply label smoothing regularization. For the fine-tuning and feature extraction process, we adopt the Inception-v3 and Xception inception-based CNNs, as well the residual-based networks ResNet50 and DenseNet121. We present extensive experiments on two real-world remote sensing image datasets: AID and NWPU-RESISC45. The results show that the proposed method exhibits classification accuracy of up to 98%, outperforming other state-of-the-art methods.
topic remote sensing
convolutional neural network
fine-tuning
learning rate scheduler
cyclical learning rates
label smoothing
url https://www.mdpi.com/2076-3417/10/17/5792
work_keys_str_mv AT biserkapetrovska aerialsceneclassificationthroughfinetuningwithadaptivelearningratesandlabelsmoothing
AT tatjanaatanasovapacemska aerialsceneclassificationthroughfinetuningwithadaptivelearningratesandlabelsmoothing
AT robertocorizzo aerialsceneclassificationthroughfinetuningwithadaptivelearningratesandlabelsmoothing
AT paolomignone aerialsceneclassificationthroughfinetuningwithadaptivelearningratesandlabelsmoothing
AT petrelameski aerialsceneclassificationthroughfinetuningwithadaptivelearningratesandlabelsmoothing
AT eftimzdravevski aerialsceneclassificationthroughfinetuningwithadaptivelearningratesandlabelsmoothing
_version_ 1724705235901874176