Lithological Mapping Research Based on Feature Selection Model of ReliefF-RF

Multispectral satellite data allow experts to discriminate rock units based on their spectral signature characteristics. Here, Sentinel-2, ASTER and the Landsat-8 Operational Land Imager (OLI) were assessed for lithological mapping by using a random forest (RF) classifier for a study area located in...

Full description

Bibliographic Details
Published in:Applied Sciences
Main Authors: Jing Xi, Qigang Jiang, Huaxin Liu, Xin Gao
Format: Article
Language:English
Published: MDPI AG 2023-10-01
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/20/11225
_version_ 1850133129159770112
author Jing Xi
Qigang Jiang
Huaxin Liu
Xin Gao
author_facet Jing Xi
Qigang Jiang
Huaxin Liu
Xin Gao
author_sort Jing Xi
collection DOAJ
container_title Applied Sciences
description Multispectral satellite data allow experts to discriminate rock units based on their spectral signature characteristics. Here, Sentinel-2, ASTER and the Landsat-8 Operational Land Imager (OLI) were assessed for lithological mapping by using a random forest (RF) classifier for a study area located in Xitieshan, Northwest China. The classification accuracy of Sentinel-2 was 60.71%, which was 5.24% and 4.77% higher than the accuracies for ASTER and the Landsat-8 OLI, respectively. Three image enhancement techniques, namely, principal component analysis (PCA), independent component analysis (ICA) and minimum noise fraction (MNF), were used with grey-level cooccurrence matrices (GLCMs) to increase the quality of the input datasets. The ICA could discriminate between rock unit datasets better than the other approaches. In contrast, GLCM performed poorly when used independently. The overall classification accuracies were 60.71%, 62.63%, 64.34%, 65.21% and 58.87% for the 10 bands of Sentinel-2, PCA, MNF, ICA and GLCM, respectively. Then, five datasets were combined as a single group and applied in RF classification. Sentinel-2 obtained an overall accuracy of 73.96% and performed better than the other single-dataset approaches used in this study. Furthermore, the classification result of RF was achieved better performance than that of the support vector machine algorithm (SVM). During feature selection processing, ReliefF, the most successful pre-processing algorithm, was employed to preliminarily perform feature screening. Then, the optimal dataset was selected on the basis of the importance ranking of RF. A total of 20 more important predictors were selected from 114 original features using the ReliefF-RF model. These predictors were used in the lithological mapping, and an overall accuracy of 77.63% was reached.
format Article
id doaj-art-41df631432bb45fcbc587c8e09f6c950
institution Directory of Open Access Journals
issn 2076-3417
language English
publishDate 2023-10-01
publisher MDPI AG
record_format Article
spelling doaj-art-41df631432bb45fcbc587c8e09f6c9502025-08-19T23:52:04ZengMDPI AGApplied Sciences2076-34172023-10-0113201122510.3390/app132011225Lithological Mapping Research Based on Feature Selection Model of ReliefF-RFJing Xi0Qigang Jiang1Huaxin Liu2Xin Gao3College of Geo-Exploration Science and Technology, Jilin University, Changchun 130026, ChinaCollege of Geo-Exploration Science and Technology, Jilin University, Changchun 130026, ChinaCollege of Geo-Exploration Science and Technology, Jilin University, Changchun 130026, ChinaCollege of Geo-Exploration Science and Technology, Jilin University, Changchun 130026, ChinaMultispectral satellite data allow experts to discriminate rock units based on their spectral signature characteristics. Here, Sentinel-2, ASTER and the Landsat-8 Operational Land Imager (OLI) were assessed for lithological mapping by using a random forest (RF) classifier for a study area located in Xitieshan, Northwest China. The classification accuracy of Sentinel-2 was 60.71%, which was 5.24% and 4.77% higher than the accuracies for ASTER and the Landsat-8 OLI, respectively. Three image enhancement techniques, namely, principal component analysis (PCA), independent component analysis (ICA) and minimum noise fraction (MNF), were used with grey-level cooccurrence matrices (GLCMs) to increase the quality of the input datasets. The ICA could discriminate between rock unit datasets better than the other approaches. In contrast, GLCM performed poorly when used independently. The overall classification accuracies were 60.71%, 62.63%, 64.34%, 65.21% and 58.87% for the 10 bands of Sentinel-2, PCA, MNF, ICA and GLCM, respectively. Then, five datasets were combined as a single group and applied in RF classification. Sentinel-2 obtained an overall accuracy of 73.96% and performed better than the other single-dataset approaches used in this study. Furthermore, the classification result of RF was achieved better performance than that of the support vector machine algorithm (SVM). During feature selection processing, ReliefF, the most successful pre-processing algorithm, was employed to preliminarily perform feature screening. Then, the optimal dataset was selected on the basis of the importance ranking of RF. A total of 20 more important predictors were selected from 114 original features using the ReliefF-RF model. These predictors were used in the lithological mapping, and an overall accuracy of 77.63% was reached.https://www.mdpi.com/2076-3417/13/20/11225lithological mappingSentinel-2feature selection modelmachine learningtextural analysis
spellingShingle Jing Xi
Qigang Jiang
Huaxin Liu
Xin Gao
Lithological Mapping Research Based on Feature Selection Model of ReliefF-RF
lithological mapping
Sentinel-2
feature selection model
machine learning
textural analysis
title Lithological Mapping Research Based on Feature Selection Model of ReliefF-RF
title_full Lithological Mapping Research Based on Feature Selection Model of ReliefF-RF
title_fullStr Lithological Mapping Research Based on Feature Selection Model of ReliefF-RF
title_full_unstemmed Lithological Mapping Research Based on Feature Selection Model of ReliefF-RF
title_short Lithological Mapping Research Based on Feature Selection Model of ReliefF-RF
title_sort lithological mapping research based on feature selection model of relieff rf
topic lithological mapping
Sentinel-2
feature selection model
machine learning
textural analysis
url https://www.mdpi.com/2076-3417/13/20/11225
work_keys_str_mv AT jingxi lithologicalmappingresearchbasedonfeatureselectionmodelofrelieffrf
AT qigangjiang lithologicalmappingresearchbasedonfeatureselectionmodelofrelieffrf
AT huaxinliu lithologicalmappingresearchbasedonfeatureselectionmodelofrelieffrf
AT xingao lithologicalmappingresearchbasedonfeatureselectionmodelofrelieffrf