Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas

The purpose of the present study was to predict the areas affected by flood hazard in the Talar watershed, Mazandaran province, Iran, using Adaptive Boosting (AdaBoost), Boosted Generalized Linear Models (BGLM), Extreme Gradient Boosting (XGB) ensemble models, and the novel ensemble framework of dee...

Full description

Bibliographic Details
Main Authors: Quoc Bao Pham, Subodh Chandra Pal, Rabin Chakrabortty, Akbar Norouzi, Mohammad Golshan, Akinwale T. Ogunrinde, Saeid Janizadeh, Khaled Mohamed Khedher, Duong Tran Anh
Format: Article
Language:English
Published: Taylor & Francis Group 2021-01-01
Series:Geomatics, Natural Hazards & Risk
Subjects:
Online Access:http://dx.doi.org/10.1080/19475705.2021.1968510
id doaj-e67a004be22040c88893ad566c36a10e
record_format Article
spelling doaj-e67a004be22040c88893ad566c36a10e2021-09-06T14:06:25ZengTaylor & Francis GroupGeomatics, Natural Hazards & Risk1947-57051947-57132021-01-011212607262810.1080/19475705.2021.19685101968510Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areasQuoc Bao Pham0Subodh Chandra Pal1Rabin Chakrabortty2Akbar Norouzi3Mohammad Golshan4Akinwale T. Ogunrinde5Saeid Janizadeh6Khaled Mohamed Khedher7Duong Tran Anh8Institute of Applied Technology, Thu Dau Mot UniversityDepartment of Geography, The University of BurdwanDepartment of Geography, The University of BurdwanDepartment of Natural Engineering, Faculty of Natural Resources and Earth Science, Shahrekord UnversityNatural Resources and Watershed Management OfficeDepartment of Agricultural and Environmental Engineering, Federal University of TechnologyDepartment of Watershed Management Engineering and Sciences, Faculty in Natural Resources and Marine Science, Tarbiat Modares UniversityDepartment of Civil Engineering, College of Engineering, King Khalid UniversityHo Chi Minh City University of Technology (HUTECH)The purpose of the present study was to predict the areas affected by flood hazard in the Talar watershed, Mazandaran province, Iran, using Adaptive Boosting (AdaBoost), Boosted Generalized Linear Models (BGLM), Extreme Gradient Boosting (XGB) ensemble models, and the novel ensemble framework of deep decision trees include the Deep Boosting (DB) model. For this purpose, 14 flood conditioning variables were used as independent variables in flood hazard modeling. In addition, 130 flood points in the region were identified by field visits and available flood information, which were used as the dependent variable in modeling. The results showed that all used models have a good efficiency in predicting flood hazard. The area under curve (AUC) of BGLM, XGB, AdaBoost and DB models were 0.88, 0.87, 0.89 and 0.91, respectively, which indicated the highest efficiency of the DB model in flood hazard modeling in the study area. Relative importance of the variables showed that they have different effects in each model. Altitude and distance from the river are more important than other variables. However, these two variables have been selected as the most important variables based on machine learning models, but other variables may be influential in flood hazards.http://dx.doi.org/10.1080/19475705.2021.1968510boosting ensemble modeldeep boosting (db)flood hazarddeep decision treetalar watershed
collection DOAJ
language English
format Article
sources DOAJ
author Quoc Bao Pham
Subodh Chandra Pal
Rabin Chakrabortty
Akbar Norouzi
Mohammad Golshan
Akinwale T. Ogunrinde
Saeid Janizadeh
Khaled Mohamed Khedher
Duong Tran Anh
spellingShingle Quoc Bao Pham
Subodh Chandra Pal
Rabin Chakrabortty
Akbar Norouzi
Mohammad Golshan
Akinwale T. Ogunrinde
Saeid Janizadeh
Khaled Mohamed Khedher
Duong Tran Anh
Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas
Geomatics, Natural Hazards & Risk
boosting ensemble model
deep boosting (db)
flood hazard
deep decision tree
talar watershed
author_facet Quoc Bao Pham
Subodh Chandra Pal
Rabin Chakrabortty
Akbar Norouzi
Mohammad Golshan
Akinwale T. Ogunrinde
Saeid Janizadeh
Khaled Mohamed Khedher
Duong Tran Anh
author_sort Quoc Bao Pham
title Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas
title_short Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas
title_full Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas
title_fullStr Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas
title_full_unstemmed Evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas
title_sort evaluation of various boosting ensemble algorithms for predicting flood hazard susceptibility areas
publisher Taylor & Francis Group
series Geomatics, Natural Hazards & Risk
issn 1947-5705
1947-5713
publishDate 2021-01-01
description The purpose of the present study was to predict the areas affected by flood hazard in the Talar watershed, Mazandaran province, Iran, using Adaptive Boosting (AdaBoost), Boosted Generalized Linear Models (BGLM), Extreme Gradient Boosting (XGB) ensemble models, and the novel ensemble framework of deep decision trees include the Deep Boosting (DB) model. For this purpose, 14 flood conditioning variables were used as independent variables in flood hazard modeling. In addition, 130 flood points in the region were identified by field visits and available flood information, which were used as the dependent variable in modeling. The results showed that all used models have a good efficiency in predicting flood hazard. The area under curve (AUC) of BGLM, XGB, AdaBoost and DB models were 0.88, 0.87, 0.89 and 0.91, respectively, which indicated the highest efficiency of the DB model in flood hazard modeling in the study area. Relative importance of the variables showed that they have different effects in each model. Altitude and distance from the river are more important than other variables. However, these two variables have been selected as the most important variables based on machine learning models, but other variables may be influential in flood hazards.
topic boosting ensemble model
deep boosting (db)
flood hazard
deep decision tree
talar watershed
url http://dx.doi.org/10.1080/19475705.2021.1968510
work_keys_str_mv AT quocbaopham evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas
AT subodhchandrapal evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas
AT rabinchakrabortty evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas
AT akbarnorouzi evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas
AT mohammadgolshan evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas
AT akinwaletogunrinde evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas
AT saeidjanizadeh evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas
AT khaledmohamedkhedher evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas
AT duongtrananh evaluationofvariousboostingensemblealgorithmsforpredictingfloodhazardsusceptibilityareas
_version_ 1717779336905359360