Estimation of missing Ellenberg Indicator Values for tree species in South-eastern Europe: a comparison of methods

Ellenberg indicator values (EIV) are widely used in vegetation ecology, but the values for many species in Southeastern Europe are not available due to incomplete knowledge of their ecology: it is therefore of paramount importance to estimate missing values in existing databases. The entire EIV set...

وصف كامل

التفاصيل البيبلوغرافية
الحاوية / القاعدة:Ecological Indicators
المؤلفون الرئيسيون: Letizia Leccese, Giuliano Fanelli, Vito Emanuele Cambria, Marco Massimi, Fabio Attorre, Marco Alfò, Svetlana Aćić, Erwin Bergmeier, Andraž Čarni, Mirjana Cuk, Renata Custerevska, Panayotis Dimopoulos, Petrit Hoda, Alfred Mullaj, Urban Šilc, Zeljko Skvorc, Zvjezdana Stancic, Zora Dajic Stevanovic, Rossen Tzonev, Kiril Vassilev, Luca Malatesta, Michele De Sanctis
التنسيق: مقال
اللغة:الإنجليزية
منشور في: Elsevier 2024-03-01
الموضوعات:
الوصول للمادة أونلاين:http://www.sciencedirect.com/science/article/pii/S1470160X2400308X
_version_ 1850414399281430528
author Letizia Leccese
Giuliano Fanelli
Vito Emanuele Cambria
Marco Massimi
Fabio Attorre
Marco Alfò
Svetlana Aćić
Erwin Bergmeier
Andraž Čarni
Mirjana Cuk
Renata Custerevska
Panayotis Dimopoulos
Petrit Hoda
Alfred Mullaj
Urban Šilc
Zeljko Skvorc
Zvjezdana Stancic
Zora Dajic Stevanovic
Rossen Tzonev
Kiril Vassilev
Luca Malatesta
Michele De Sanctis
author_facet Letizia Leccese
Giuliano Fanelli
Vito Emanuele Cambria
Marco Massimi
Fabio Attorre
Marco Alfò
Svetlana Aćić
Erwin Bergmeier
Andraž Čarni
Mirjana Cuk
Renata Custerevska
Panayotis Dimopoulos
Petrit Hoda
Alfred Mullaj
Urban Šilc
Zeljko Skvorc
Zvjezdana Stancic
Zora Dajic Stevanovic
Rossen Tzonev
Kiril Vassilev
Luca Malatesta
Michele De Sanctis
author_sort Letizia Leccese
collection DOAJ
container_title Ecological Indicators
description Ellenberg indicator values (EIV) are widely used in vegetation ecology, but the values for many species in Southeastern Europe are not available due to incomplete knowledge of their ecology: it is therefore of paramount importance to estimate missing values in existing databases. The entire EIV set for a single species can be missing or a single EIV can be missing for species for which other indicator values are available. Our aim here is to provide a simple method to impute missing values for species who have missing data in a single or multiple EIV. For this purpose, we adopt a multiple imputation procedure and compare a number of imputation methods on the basis of two datasets: i) “indices”, the set of 9 Ellenberg indicators taken from literature, available for 10,824 species and ii) “vegetation”, a set describing the physical and climatic characteristics (Light, Temperature, Continentality, Soil moisture, Nitrogen, Soil pH, Hemeroby index, Humidity, Organic_matter) of 29,935 relevés from Southeastern Europe where at least one tree species is present. The imputation methods we considered are: k-Nearest Neighbour, multiple linear regression (with or without collinearity correction), Reprediction Algorithm, Weighted Averaging (WA) and Weighted Averaging Partial Least Squares (WAPLS) regression. The different methods of imputation were compared by looking at the output produced and its deviation from the “true” observed values for a set of species with known EIVs. We have considered a set of species with known EIVs and proceeded to multiple imputation using the methods above; as a measure of performance we adopted the mean squared error (MSE) estimate, and expert judgement of ecological consistency. Models based on Regression and k-Nearest Neighbour seem to outperform the others. On the contrary, Reprediction algorithm in its different forms: produced less satisfactory results.Imputation of missing values is generally based on expert knowledge or on some variant of weighted averaging (also known as Hill’s method). Here we show that other methods may be more effective and should be appropriately considered by vegetation scientists, since those may allow the application of EIVs in other biogeographic regions.
format Article
id doaj-art-d66ace2cb539466c8a809b670db4faf7
institution Directory of Open Access Journals
issn 1470-160X
language English
publishDate 2024-03-01
publisher Elsevier
record_format Article
spelling doaj-art-d66ace2cb539466c8a809b670db4faf72025-08-19T22:45:36ZengElsevierEcological Indicators1470-160X2024-03-0116011185110.1016/j.ecolind.2024.111851Estimation of missing Ellenberg Indicator Values for tree species in South-eastern Europe: a comparison of methodsLetizia Leccese0Giuliano Fanelli1Vito Emanuele Cambria2Marco Massimi3Fabio Attorre4Marco Alfò5Svetlana Aćić6Erwin Bergmeier7Andraž Čarni8Mirjana Cuk9Renata Custerevska10Panayotis Dimopoulos11Petrit Hoda12Alfred Mullaj13Urban Šilc14Zeljko Skvorc15Zvjezdana Stancic16Zora Dajic Stevanovic17Rossen Tzonev18Kiril Vassilev19Luca Malatesta20Michele De Sanctis21Department of Statistical Science, Sapienza University of Rome, ItalyDepartment of Environmental Biology, Sapienza University of Rome, ItalyDepartment of Environmental Biology, Sapienza University of Rome, ItalyDepartment of Environmental Biology, Sapienza University of Rome, ItalyDepartment of Environmental Biology, Sapienza University of Rome, ItalyDepartment of Statistical Science, Sapienza University of Rome, ItalyFaculty of Agriculture, University of Belgrade, SerbiaDepartment of Vegetation Analysis & Phytodiversity, University of Göttingen, GermanyResearch Centre of the Slovenian Academy of Science and Arts, SloveniaFaculty of Sciences, University of Novi Sad, SerbiaInstitute of Biology, Faculty of Sciences, Ss. Cyril and Methodius University in Skopje, MacedoniaLaboratory of Botany, Department of Biology, University of Patras University Campus, 26504 Rio, GreeceUniversity of Tirana, Faculty of Natural Sciences, AlbaniaUniversity of Tirana, Faculty of Natural Sciences, AlbaniaResearch Centre of the Slovenian Academy of Science and Arts, SloveniaFaculty of Forestry, University of Zagreb, CroatiaFaculty of Geotechnical Engineering, University of Zagreb, CroatiaFaculty of Agriculture, University of Belgrade, SerbiaDepartment of Ecology and Environmental Protection, Sofia University “St. Kliment Ohridski”, BulgariaInstitute of Biodiversity and Ecosystem Research, Bulgarian Academy of Sciences, BulgariaDepartment of Environmental Biology, Sapienza University of Rome, Italy; Corresponding author.Department of Environmental Biology, Sapienza University of Rome, ItalyEllenberg indicator values (EIV) are widely used in vegetation ecology, but the values for many species in Southeastern Europe are not available due to incomplete knowledge of their ecology: it is therefore of paramount importance to estimate missing values in existing databases. The entire EIV set for a single species can be missing or a single EIV can be missing for species for which other indicator values are available. Our aim here is to provide a simple method to impute missing values for species who have missing data in a single or multiple EIV. For this purpose, we adopt a multiple imputation procedure and compare a number of imputation methods on the basis of two datasets: i) “indices”, the set of 9 Ellenberg indicators taken from literature, available for 10,824 species and ii) “vegetation”, a set describing the physical and climatic characteristics (Light, Temperature, Continentality, Soil moisture, Nitrogen, Soil pH, Hemeroby index, Humidity, Organic_matter) of 29,935 relevés from Southeastern Europe where at least one tree species is present. The imputation methods we considered are: k-Nearest Neighbour, multiple linear regression (with or without collinearity correction), Reprediction Algorithm, Weighted Averaging (WA) and Weighted Averaging Partial Least Squares (WAPLS) regression. The different methods of imputation were compared by looking at the output produced and its deviation from the “true” observed values for a set of species with known EIVs. We have considered a set of species with known EIVs and proceeded to multiple imputation using the methods above; as a measure of performance we adopted the mean squared error (MSE) estimate, and expert judgement of ecological consistency. Models based on Regression and k-Nearest Neighbour seem to outperform the others. On the contrary, Reprediction algorithm in its different forms: produced less satisfactory results.Imputation of missing values is generally based on expert knowledge or on some variant of weighted averaging (also known as Hill’s method). Here we show that other methods may be more effective and should be appropriately considered by vegetation scientists, since those may allow the application of EIVs in other biogeographic regions.http://www.sciencedirect.com/science/article/pii/S1470160X2400308XVegetation ecologyPlant indicatorsVegetation databasesBiodiversity informaticsBioindicationMissing values
spellingShingle Letizia Leccese
Giuliano Fanelli
Vito Emanuele Cambria
Marco Massimi
Fabio Attorre
Marco Alfò
Svetlana Aćić
Erwin Bergmeier
Andraž Čarni
Mirjana Cuk
Renata Custerevska
Panayotis Dimopoulos
Petrit Hoda
Alfred Mullaj
Urban Šilc
Zeljko Skvorc
Zvjezdana Stancic
Zora Dajic Stevanovic
Rossen Tzonev
Kiril Vassilev
Luca Malatesta
Michele De Sanctis
Estimation of missing Ellenberg Indicator Values for tree species in South-eastern Europe: a comparison of methods
Vegetation ecology
Plant indicators
Vegetation databases
Biodiversity informatics
Bioindication
Missing values
title Estimation of missing Ellenberg Indicator Values for tree species in South-eastern Europe: a comparison of methods
title_full Estimation of missing Ellenberg Indicator Values for tree species in South-eastern Europe: a comparison of methods
title_fullStr Estimation of missing Ellenberg Indicator Values for tree species in South-eastern Europe: a comparison of methods
title_full_unstemmed Estimation of missing Ellenberg Indicator Values for tree species in South-eastern Europe: a comparison of methods
title_short Estimation of missing Ellenberg Indicator Values for tree species in South-eastern Europe: a comparison of methods
title_sort estimation of missing ellenberg indicator values for tree species in south eastern europe a comparison of methods
topic Vegetation ecology
Plant indicators
Vegetation databases
Biodiversity informatics
Bioindication
Missing values
url http://www.sciencedirect.com/science/article/pii/S1470160X2400308X
work_keys_str_mv AT letizialeccese estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT giulianofanelli estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT vitoemanuelecambria estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT marcomassimi estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT fabioattorre estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT marcoalfo estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT svetlanaacic estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT erwinbergmeier estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT andrazcarni estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT mirjanacuk estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT renatacusterevska estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT panayotisdimopoulos estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT petrithoda estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT alfredmullaj estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT urbansilc estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT zeljkoskvorc estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT zvjezdanastancic estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT zoradajicstevanovic estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT rossentzonev estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT kirilvassilev estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT lucamalatesta estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods
AT micheledesanctis estimationofmissingellenbergindicatorvaluesfortreespeciesinsoutheasterneuropeacomparisonofmethods