Distribution of Groundwater Arsenic in Uruguay Using Hybrid Machine Learning and Expert System Approaches
Groundwater arsenic in Uruguay is an important environmental hazard, hence, predicting its distribution is important to inform stakeholders. Furthermore, occurrences in Uruguay are known to variably show dependence on depth and geology, arguably reflecting different processes controlling groundwater...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-02-01
|
Series: | Water |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-4441/13/4/527 |
id |
doaj-0030633b53ca4d21a4c263dda8929901 |
---|---|
record_format |
Article |
spelling |
doaj-0030633b53ca4d21a4c263dda89299012021-02-19T00:03:49ZengMDPI AGWater2073-44412021-02-011352752710.3390/w13040527Distribution of Groundwater Arsenic in Uruguay Using Hybrid Machine Learning and Expert System ApproachesRuohan Wu0Elena M. Alvareda1David A. Polya2Gonzalo Blanco3Pablo Gamazo4Department of Earth and Environmental Sciences, School of Natural Sciences and Williamson Research Centre for Molecular Environmental Sciences, University of Manchester, Manchester M13 9PL, UKDepartamento del Agua, Centro Universitario Regional Litoral Norte, Universidad de la República, Gral. Rivera 1350, Salto 50000, UruguayDepartment of Earth and Environmental Sciences, School of Natural Sciences and Williamson Research Centre for Molecular Environmental Sciences, University of Manchester, Manchester M13 9PL, UKPDU Geología y Recursos Minerales, Centro Universitario Regional Este, Universidad de la República, Ruta 8 km 282, Treinta y Tres 33000, UruguayDepartamento del Agua, Centro Universitario Regional Litoral Norte, Universidad de la República, Gral. Rivera 1350, Salto 50000, UruguayGroundwater arsenic in Uruguay is an important environmental hazard, hence, predicting its distribution is important to inform stakeholders. Furthermore, occurrences in Uruguay are known to variably show dependence on depth and geology, arguably reflecting different processes controlling groundwater arsenic concentrations. Here, we present the distribution of groundwater arsenic in Uruguay modelled by a variety of machine learning, basic expert systems, and hybrid approaches. A pure random forest approach, using 26 potential predictor variables, gave rise to a groundwater arsenic distribution model with a very high degree of accuracy (AUC = 0.92), which is consistent with known high groundwater arsenic hazard areas. These areas are mainly in southwest Uruguay, including the Paysandú, Río Negro, Soriano, Colonia, Flores, San José, Florida, Montevideo, and Canelones departments, where the Mercedes, Cuaternario Oeste, Raigón, and Cretácico main aquifers occur. A hybrid approach separating the country into sedimentary and crystalline aquifer domains resulted in slight material improvement in a high arsenic hazard distribution. However, a further hybrid approach separately modelling shallow (<50 m) and deep aquifers (>50 m) resulted in the identification of more high hazard areas in Flores, Durazno, and the northwest corner of Florida departments in shallow aquifers than the pure model. Both hybrid models considering depth (AUC = 0.95) and geology (AUC = 0.97) produced improved accuracy. Hybrid machine learning models with expert selection of important environmental parameters may sometimes be a better choice than pure machine learning models, particularly where there are incomplete datasets, but perhaps, counterintuitively, this is not always the case.https://www.mdpi.com/2073-4441/13/4/527arsenicgroundwaterUruguaygeostatisticsdepthgeology |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ruohan Wu Elena M. Alvareda David A. Polya Gonzalo Blanco Pablo Gamazo |
spellingShingle |
Ruohan Wu Elena M. Alvareda David A. Polya Gonzalo Blanco Pablo Gamazo Distribution of Groundwater Arsenic in Uruguay Using Hybrid Machine Learning and Expert System Approaches Water arsenic groundwater Uruguay geostatistics depth geology |
author_facet |
Ruohan Wu Elena M. Alvareda David A. Polya Gonzalo Blanco Pablo Gamazo |
author_sort |
Ruohan Wu |
title |
Distribution of Groundwater Arsenic in Uruguay Using Hybrid Machine Learning and Expert System Approaches |
title_short |
Distribution of Groundwater Arsenic in Uruguay Using Hybrid Machine Learning and Expert System Approaches |
title_full |
Distribution of Groundwater Arsenic in Uruguay Using Hybrid Machine Learning and Expert System Approaches |
title_fullStr |
Distribution of Groundwater Arsenic in Uruguay Using Hybrid Machine Learning and Expert System Approaches |
title_full_unstemmed |
Distribution of Groundwater Arsenic in Uruguay Using Hybrid Machine Learning and Expert System Approaches |
title_sort |
distribution of groundwater arsenic in uruguay using hybrid machine learning and expert system approaches |
publisher |
MDPI AG |
series |
Water |
issn |
2073-4441 |
publishDate |
2021-02-01 |
description |
Groundwater arsenic in Uruguay is an important environmental hazard, hence, predicting its distribution is important to inform stakeholders. Furthermore, occurrences in Uruguay are known to variably show dependence on depth and geology, arguably reflecting different processes controlling groundwater arsenic concentrations. Here, we present the distribution of groundwater arsenic in Uruguay modelled by a variety of machine learning, basic expert systems, and hybrid approaches. A pure random forest approach, using 26 potential predictor variables, gave rise to a groundwater arsenic distribution model with a very high degree of accuracy (AUC = 0.92), which is consistent with known high groundwater arsenic hazard areas. These areas are mainly in southwest Uruguay, including the Paysandú, Río Negro, Soriano, Colonia, Flores, San José, Florida, Montevideo, and Canelones departments, where the Mercedes, Cuaternario Oeste, Raigón, and Cretácico main aquifers occur. A hybrid approach separating the country into sedimentary and crystalline aquifer domains resulted in slight material improvement in a high arsenic hazard distribution. However, a further hybrid approach separately modelling shallow (<50 m) and deep aquifers (>50 m) resulted in the identification of more high hazard areas in Flores, Durazno, and the northwest corner of Florida departments in shallow aquifers than the pure model. Both hybrid models considering depth (AUC = 0.95) and geology (AUC = 0.97) produced improved accuracy. Hybrid machine learning models with expert selection of important environmental parameters may sometimes be a better choice than pure machine learning models, particularly where there are incomplete datasets, but perhaps, counterintuitively, this is not always the case. |
topic |
arsenic groundwater Uruguay geostatistics depth geology |
url |
https://www.mdpi.com/2073-4441/13/4/527 |
work_keys_str_mv |
AT ruohanwu distributionofgroundwaterarsenicinuruguayusinghybridmachinelearningandexpertsystemapproaches AT elenamalvareda distributionofgroundwaterarsenicinuruguayusinghybridmachinelearningandexpertsystemapproaches AT davidapolya distributionofgroundwaterarsenicinuruguayusinghybridmachinelearningandexpertsystemapproaches AT gonzaloblanco distributionofgroundwaterarsenicinuruguayusinghybridmachinelearningandexpertsystemapproaches AT pablogamazo distributionofgroundwaterarsenicinuruguayusinghybridmachinelearningandexpertsystemapproaches |
_version_ |
1724261917424353280 |