Soil Mapping Based on the Integration of the Similarity-Based Approach and Random Forests

Digital soil mapping (DSM) is currently the primary framework for predicting the spatial variation of soil information (soil type or soil properties). Random forests and similarity-based methods have been used widely in DSM. However, the accuracy of the similarity-based approach is limited, and the...

Full description

Bibliographic Details
Main Authors: Desheng Wang, A-Xing Zhu
Format: Article
Language:English
Published: MDPI AG 2020-05-01
Series:Land
Subjects:
Online Access:https://www.mdpi.com/2073-445X/9/6/174
id doaj-92c9a6b6318a45a785805e74cc602852
record_format Article
spelling doaj-92c9a6b6318a45a785805e74cc6028522020-11-25T02:33:30ZengMDPI AGLand2073-445X2020-05-01917417410.3390/land9060174Soil Mapping Based on the Integration of the Similarity-Based Approach and Random ForestsDesheng Wang0A-Xing Zhu1Key Laboratory of Virtual Geographic Environment, Nanjing Normal University, Ministry of Education, Nanjing 210023, ChinaKey Laboratory of Virtual Geographic Environment, Nanjing Normal University, Ministry of Education, Nanjing 210023, ChinaDigital soil mapping (DSM) is currently the primary framework for predicting the spatial variation of soil information (soil type or soil properties). Random forests and similarity-based methods have been used widely in DSM. However, the accuracy of the similarity-based approach is limited, and the performance of random forests is affected by the quality of the feature set. The objective of this study was to present a method for soil mapping by integrating the similarity-based approach and the random forests method. The Heshan area (Heilongjiang province, China) was selected as the case study for mapping soil subgroups. The results of the regular validation samples showed that the overall accuracy of the integrated method (71.79%) is higher than that of a similarity-based approach (58.97%) and random forests (66.67%). The results of the 5-fold cross-validation showed that the overall accuracy of the integrated method, similarity-based approach, and random forests range from 55% to 72.73%, 43.48% to 69.57%, and 54.17% to 70.83%, with an average accuracy of 66.61%, 57.39%, and 59.62%, respectively. These results suggest that the proposed method can produce a high-quality covariate set and achieve a better performance than either the random forests or similarity-based approach alone.https://www.mdpi.com/2073-445X/9/6/174Digital soil mappingsimilarity-based approachrandom forestsmethod integration
collection DOAJ
language English
format Article
sources DOAJ
author Desheng Wang
A-Xing Zhu
spellingShingle Desheng Wang
A-Xing Zhu
Soil Mapping Based on the Integration of the Similarity-Based Approach and Random Forests
Land
Digital soil mapping
similarity-based approach
random forests
method integration
author_facet Desheng Wang
A-Xing Zhu
author_sort Desheng Wang
title Soil Mapping Based on the Integration of the Similarity-Based Approach and Random Forests
title_short Soil Mapping Based on the Integration of the Similarity-Based Approach and Random Forests
title_full Soil Mapping Based on the Integration of the Similarity-Based Approach and Random Forests
title_fullStr Soil Mapping Based on the Integration of the Similarity-Based Approach and Random Forests
title_full_unstemmed Soil Mapping Based on the Integration of the Similarity-Based Approach and Random Forests
title_sort soil mapping based on the integration of the similarity-based approach and random forests
publisher MDPI AG
series Land
issn 2073-445X
publishDate 2020-05-01
description Digital soil mapping (DSM) is currently the primary framework for predicting the spatial variation of soil information (soil type or soil properties). Random forests and similarity-based methods have been used widely in DSM. However, the accuracy of the similarity-based approach is limited, and the performance of random forests is affected by the quality of the feature set. The objective of this study was to present a method for soil mapping by integrating the similarity-based approach and the random forests method. The Heshan area (Heilongjiang province, China) was selected as the case study for mapping soil subgroups. The results of the regular validation samples showed that the overall accuracy of the integrated method (71.79%) is higher than that of a similarity-based approach (58.97%) and random forests (66.67%). The results of the 5-fold cross-validation showed that the overall accuracy of the integrated method, similarity-based approach, and random forests range from 55% to 72.73%, 43.48% to 69.57%, and 54.17% to 70.83%, with an average accuracy of 66.61%, 57.39%, and 59.62%, respectively. These results suggest that the proposed method can produce a high-quality covariate set and achieve a better performance than either the random forests or similarity-based approach alone.
topic Digital soil mapping
similarity-based approach
random forests
method integration
url https://www.mdpi.com/2073-445X/9/6/174
work_keys_str_mv AT deshengwang soilmappingbasedontheintegrationofthesimilaritybasedapproachandrandomforests
AT axingzhu soilmappingbasedontheintegrationofthesimilaritybasedapproachandrandomforests
_version_ 1724813581787070464