Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas
The purpose of the present study was to predict the areas affected by flood hazard in the Talar watershed, Mazandaran province, Iran, using Adaptive Boosting (AdaBoost), Boosted Generalized Linear Models (BGLM), Extreme Gradient Boosting (XGB) ensemble models, and the novel ensemble framework of dee...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Taylor & Francis Group
2021-01-01
|
Series: | Geomatics, Natural Hazards & Risk |
Subjects: | |
Online Access: | http://dx.doi.org/10.1080/19475705.2021.1968510 |
id |
doaj-e67a004be22040c88893ad566c36a10e |
---|---|
record_format |
Article |
spelling |
doaj-e67a004be22040c88893ad566c36a10e2021-09-06T14:06:25ZengTaylor & Francis GroupGeomatics, Natural Hazards & Risk1947-57051947-57132021-01-011212607262810.1080/19475705.2021.19685101968510Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areasQuoc Bao Pham0Subodh Chandra Pal1Rabin Chakrabortty2Akbar Norouzi3Mohammad Golshan4Akinwale T. Ogunrinde5Saeid Janizadeh6Khaled Mohamed Khedher7Duong Tran Anh8Institute of Applied Technology, Thu Dau Mot UniversityDepartment of Geography, The University of BurdwanDepartment of Geography, The University of BurdwanDepartment of Natural Engineering, Faculty of Natural Resources and Earth Science, Shahrekord UnversityNatural Resources and Watershed Management OfficeDepartment of Agricultural and Environmental Engineering, Federal University of TechnologyDepartment of Watershed Management Engineering and Sciences, Faculty in Natural Resources and Marine Science, Tarbiat Modares UniversityDepartment of Civil Engineering, College of Engineering, King Khalid UniversityHo Chi Minh City University of Technology (HUTECH)The purpose of the present study was to predict the areas affected by flood hazard in the Talar watershed, Mazandaran province, Iran, using Adaptive Boosting (AdaBoost), Boosted Generalized Linear Models (BGLM), Extreme Gradient Boosting (XGB) ensemble models, and the novel ensemble framework of deep decision trees include the Deep Boosting (DB) model. For this purpose, 14 flood conditioning variables were used as independent variables in flood hazard modeling. In addition, 130 flood points in the region were identified by field visits and available flood information, which were used as the dependent variable in modeling. The results showed that all used models have a good efficiency in predicting flood hazard. The area under curve (AUC) of BGLM, XGB, AdaBoost and DB models were 0.88, 0.87, 0.89 and 0.91, respectively, which indicated the highest efficiency of the DB model in flood hazard modeling in the study area. Relative importance of the variables showed that they have different effects in each model. Altitude and distance from the river are more important than other variables. However, these two variables have been selected as the most important variables based on machine learning models, but other variables may be influential in flood hazards.http://dx.doi.org/10.1080/19475705.2021.1968510boosting ensemble modeldeep boosting (db)flood hazarddeep decision treetalar watershed |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Quoc Bao Pham Subodh Chandra Pal Rabin Chakrabortty Akbar Norouzi Mohammad Golshan Akinwale T. Ogunrinde Saeid Janizadeh Khaled Mohamed Khedher Duong Tran Anh |
spellingShingle |
Quoc Bao Pham Subodh Chandra Pal Rabin Chakrabortty Akbar Norouzi Mohammad Golshan Akinwale T. Ogunrinde Saeid Janizadeh Khaled Mohamed Khedher Duong Tran Anh Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas Geomatics, Natural Hazards & Risk boosting ensemble model deep boosting (db) flood hazard deep decision tree talar watershed |
author_facet |
Quoc Bao Pham Subodh Chandra Pal Rabin Chakrabortty Akbar Norouzi Mohammad Golshan Akinwale T. Ogunrinde Saeid Janizadeh Khaled Mohamed Khedher Duong Tran Anh |
author_sort |
Quoc Bao Pham |
title |
Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas |
title_short |
Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas |
title_full |
Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas |
title_fullStr |
Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas |
title_full_unstemmed |
Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas |
title_sort |
evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas |
publisher |
Taylor & Francis Group |
series |
Geomatics, Natural Hazards & Risk |
issn |
1947-5705 1947-5713 |
publishDate |
2021-01-01 |
description |
The purpose of the present study was to predict the areas affected by flood hazard in the Talar watershed, Mazandaran province, Iran, using Adaptive Boosting (AdaBoost), Boosted Generalized Linear Models (BGLM), Extreme Gradient Boosting (XGB) ensemble models, and the novel ensemble framework of deep decision trees include the Deep Boosting (DB) model. For this purpose, 14 flood conditioning variables were used as independent variables in flood hazard modeling. In addition, 130 flood points in the region were identified by field visits and available flood information, which were used as the dependent variable in modeling. The results showed that all used models have a good efficiency in predicting flood hazard. The area under curve (AUC) of BGLM, XGB, AdaBoost and DB models were 0.88, 0.87, 0.89 and 0.91, respectively, which indicated the highest efficiency of the DB model in flood hazard modeling in the study area. Relative importance of the variables showed that they have different effects in each model. Altitude and distance from the river are more important than other variables. However, these two variables have been selected as the most important variables based on machine learning models, but other variables may be influential in flood hazards. |
topic |
boosting ensemble model deep boosting (db) flood hazard deep decision tree talar watershed |
url |
http://dx.doi.org/10.1080/19475705.2021.1968510 |
work_keys_str_mv |
AT quocbaopham evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas AT subodhchandrapal evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas AT rabinchakrabortty evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas AT akbarnorouzi evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas AT mohammadgolshan evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas AT akinwaletogunrinde evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas AT saeidjanizadeh evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas AT khaledmohamedkhedher evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas AT duongtrananh evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas |
_version_ |
1717779336905359360 |