A Compendium of Chemical Class and Use Type Open Access Databases

With an ever-increasing production and registration of chemical substances, obtaining reliable and up to date information on their use types (UT) and chemical class (CC) is of crucial importance. We evaluated the current status of open access chemical substance databases (DBs) regarding UT and CC in...

وصف كامل

التفاصيل البيبلوغرافية
الحاوية / القاعدة:Data
المؤلفون الرئيسيون: Niklas Heinemann, Sascha Bub, Jakob Wolfram, Sebastian Stehle, Lara L. Petschick, Ralf Schulz
التنسيق: مقال
اللغة:الإنجليزية
منشور في: MDPI AG 2020-12-01
الموضوعات:
الوصول للمادة أونلاين:https://www.mdpi.com/2306-5729/5/4/114
_version_ 1850546511116500992
author Niklas Heinemann
Sascha Bub
Jakob Wolfram
Sebastian Stehle
Lara L. Petschick
Ralf Schulz
author_facet Niklas Heinemann
Sascha Bub
Jakob Wolfram
Sebastian Stehle
Lara L. Petschick
Ralf Schulz
author_sort Niklas Heinemann
collection DOAJ
container_title Data
description With an ever-increasing production and registration of chemical substances, obtaining reliable and up to date information on their use types (UT) and chemical class (CC) is of crucial importance. We evaluated the current status of open access chemical substance databases (DBs) regarding UT and CC information using the “Meta-analysis of the Global Impact of Chemicals” (MAGIC) graph as a benchmark. A decision tree-based selection process was used to choose the most suitable out of 96 databases. To compare the DB content for 100 weighted, randomly selected chemical substances, an extensive quantitative and qualitative analysis was performed. It was found that four DBs yielded more qualitative and quantitative UT and CC results than the current MAGIC graph: The European Bioinformatics Institute DB, ChemSpider, the English Wikipedia page, and the National Center for Biotechnology Information (NCBI). The NCBI, along with its subsidiary DBs PubChem and Medical Subject Headings (MeSH), showed the best performance according to the defined criteria. To analyse large datasets, harmonisation of the available information might be beneficial, as the available DBs mostly aggregate information without harmonising them.
format Article
id doaj-art-e6da05bf33ab4f858b35b0dfefd261f5
institution Directory of Open Access Journals
issn 2306-5729
language English
publishDate 2020-12-01
publisher MDPI AG
record_format Article
spelling doaj-art-e6da05bf33ab4f858b35b0dfefd261f52025-08-19T22:37:22ZengMDPI AGData2306-57292020-12-015411410.3390/data5040114A Compendium of Chemical Class and Use Type Open Access DatabasesNiklas Heinemann0Sascha Bub1Jakob Wolfram2Sebastian Stehle3Lara L. Petschick4Ralf Schulz5iES Landau, Institute for Environmental Sciences, University of Koblenz-Landau, D-76829 Landau, GermanyiES Landau, Institute for Environmental Sciences, University of Koblenz-Landau, D-76829 Landau, GermanyiES Landau, Institute for Environmental Sciences, University of Koblenz-Landau, D-76829 Landau, GermanyiES Landau, Institute for Environmental Sciences, University of Koblenz-Landau, D-76829 Landau, GermanyiES Landau, Institute for Environmental Sciences, University of Koblenz-Landau, D-76829 Landau, GermanyiES Landau, Institute for Environmental Sciences, University of Koblenz-Landau, D-76829 Landau, GermanyWith an ever-increasing production and registration of chemical substances, obtaining reliable and up to date information on their use types (UT) and chemical class (CC) is of crucial importance. We evaluated the current status of open access chemical substance databases (DBs) regarding UT and CC information using the “Meta-analysis of the Global Impact of Chemicals” (MAGIC) graph as a benchmark. A decision tree-based selection process was used to choose the most suitable out of 96 databases. To compare the DB content for 100 weighted, randomly selected chemical substances, an extensive quantitative and qualitative analysis was performed. It was found that four DBs yielded more qualitative and quantitative UT and CC results than the current MAGIC graph: The European Bioinformatics Institute DB, ChemSpider, the English Wikipedia page, and the National Center for Biotechnology Information (NCBI). The NCBI, along with its subsidiary DBs PubChem and Medical Subject Headings (MeSH), showed the best performance according to the defined criteria. To analyse large datasets, harmonisation of the available information might be beneficial, as the available DBs mostly aggregate information without harmonising them.https://www.mdpi.com/2306-5729/5/4/114ecotoxicologygraph databaseenvironmental datadata harmonisationchemical use typeschemical class
spellingShingle Niklas Heinemann
Sascha Bub
Jakob Wolfram
Sebastian Stehle
Lara L. Petschick
Ralf Schulz
A Compendium of Chemical Class and Use Type Open Access Databases
ecotoxicology
graph database
environmental data
data harmonisation
chemical use types
chemical class
title A Compendium of Chemical Class and Use Type Open Access Databases
title_full A Compendium of Chemical Class and Use Type Open Access Databases
title_fullStr A Compendium of Chemical Class and Use Type Open Access Databases
title_full_unstemmed A Compendium of Chemical Class and Use Type Open Access Databases
title_short A Compendium of Chemical Class and Use Type Open Access Databases
title_sort compendium of chemical class and use type open access databases
topic ecotoxicology
graph database
environmental data
data harmonisation
chemical use types
chemical class
url https://www.mdpi.com/2306-5729/5/4/114
work_keys_str_mv AT niklasheinemann acompendiumofchemicalclassandusetypeopenaccessdatabases
AT saschabub acompendiumofchemicalclassandusetypeopenaccessdatabases
AT jakobwolfram acompendiumofchemicalclassandusetypeopenaccessdatabases
AT sebastianstehle acompendiumofchemicalclassandusetypeopenaccessdatabases
AT laralpetschick acompendiumofchemicalclassandusetypeopenaccessdatabases
AT ralfschulz acompendiumofchemicalclassandusetypeopenaccessdatabases
AT niklasheinemann compendiumofchemicalclassandusetypeopenaccessdatabases
AT saschabub compendiumofchemicalclassandusetypeopenaccessdatabases
AT jakobwolfram compendiumofchemicalclassandusetypeopenaccessdatabases
AT sebastianstehle compendiumofchemicalclassandusetypeopenaccessdatabases
AT laralpetschick compendiumofchemicalclassandusetypeopenaccessdatabases
AT ralfschulz compendiumofchemicalclassandusetypeopenaccessdatabases