Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas

The use of mass spectrometry-based metabolomics to study human, plant and microbial biochemistry and their interactions with the environment largely depends on the ability to annotate metabolite structures by matching mass spectral features of the measured metabolites to curated spectra of reference...

Full description

Bibliographic Details
Main Authors: Clément Frainay, Emma L. Schymanski, Steffen Neumann, Benjamin Merlet, Reza M. Salek, Fabien Jourdan, Oscar Yanes
Format: Article
Language:English
Published: MDPI AG 2018-09-01
Series:Metabolites
Subjects:
Online Access:http://www.mdpi.com/2218-1989/8/3/51
id doaj-dba76be0876447b78ddc043bb6451b67
record_format Article
spelling doaj-dba76be0876447b78ddc043bb6451b672020-11-25T00:02:40ZengMDPI AGMetabolites2218-19892018-09-01835110.3390/metabo8030051metabo8030051Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered AreasClément Frainay0Emma L. Schymanski1Steffen Neumann2Benjamin Merlet3Reza M. Salek4Fabien Jourdan5Oscar Yanes6Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, 31555 Toulouse, FranceEawag: Swiss Federal Institute for Aquatic Science and Technology, Überlandstrasse 133, 8600 Dübendorf, SwitzerlandLeibniz Institute of Plant Biochemistry, Department of Stress and Developmental Biology, Weinberg 3, 06120 Halle, GermanyToxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, 31555 Toulouse, FranceThe International Agency for Research on Cancer (IARC), 150 Cours Albert Thomas, 69372 Lyon CEDEX 08, FranceToxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, 31555 Toulouse, FranceMetabolomics Platform, IISPV, Department of Electronic Engineering, Universitat Rovira i Virgili, Avinguda Paisos Catalans 26, 43007 Tarragona, SpainThe use of mass spectrometry-based metabolomics to study human, plant and microbial biochemistry and their interactions with the environment largely depends on the ability to annotate metabolite structures by matching mass spectral features of the measured metabolites to curated spectra of reference standards. While reference databases for metabolomics now provide information for hundreds of thousands of compounds, barely 5% of these known small molecules have experimental data from pure standards. Remarkably, it is still unknown how well existing mass spectral libraries cover the biochemical landscape of prokaryotic and eukaryotic organisms. To address this issue, we have investigated the coverage of 38 genome-scale metabolic networks by public and commercial mass spectral databases, and found that on average only 40% of nodes in metabolic networks could be mapped by mass spectral information from standards. Next, we deciphered computationally which parts of the human metabolic network are poorly covered by mass spectral libraries, revealing gaps in the eicosanoids, vitamins and bile acid metabolism. Finally, our network topology analysis based on the betweenness centrality of metabolites revealed the top 20 most important metabolites that, if added to MS databases, may facilitate human metabolome characterization in the future.http://www.mdpi.com/2218-1989/8/3/51metabolic networksmass spectral librariesmetabolite annotationmetabolomics data mapping
collection DOAJ
language English
format Article
sources DOAJ
author Clément Frainay
Emma L. Schymanski
Steffen Neumann
Benjamin Merlet
Reza M. Salek
Fabien Jourdan
Oscar Yanes
spellingShingle Clément Frainay
Emma L. Schymanski
Steffen Neumann
Benjamin Merlet
Reza M. Salek
Fabien Jourdan
Oscar Yanes
Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas
Metabolites
metabolic networks
mass spectral libraries
metabolite annotation
metabolomics data mapping
author_facet Clément Frainay
Emma L. Schymanski
Steffen Neumann
Benjamin Merlet
Reza M. Salek
Fabien Jourdan
Oscar Yanes
author_sort Clément Frainay
title Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas
title_short Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas
title_full Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas
title_fullStr Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas
title_full_unstemmed Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas
title_sort mind the gap: mapping mass spectral databases in genome-scale metabolic networks reveals poorly covered areas
publisher MDPI AG
series Metabolites
issn 2218-1989
publishDate 2018-09-01
description The use of mass spectrometry-based metabolomics to study human, plant and microbial biochemistry and their interactions with the environment largely depends on the ability to annotate metabolite structures by matching mass spectral features of the measured metabolites to curated spectra of reference standards. While reference databases for metabolomics now provide information for hundreds of thousands of compounds, barely 5% of these known small molecules have experimental data from pure standards. Remarkably, it is still unknown how well existing mass spectral libraries cover the biochemical landscape of prokaryotic and eukaryotic organisms. To address this issue, we have investigated the coverage of 38 genome-scale metabolic networks by public and commercial mass spectral databases, and found that on average only 40% of nodes in metabolic networks could be mapped by mass spectral information from standards. Next, we deciphered computationally which parts of the human metabolic network are poorly covered by mass spectral libraries, revealing gaps in the eicosanoids, vitamins and bile acid metabolism. Finally, our network topology analysis based on the betweenness centrality of metabolites revealed the top 20 most important metabolites that, if added to MS databases, may facilitate human metabolome characterization in the future.
topic metabolic networks
mass spectral libraries
metabolite annotation
metabolomics data mapping
url http://www.mdpi.com/2218-1989/8/3/51
work_keys_str_mv AT clementfrainay mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas
AT emmalschymanski mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas
AT steffenneumann mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas
AT benjaminmerlet mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas
AT rezamsalek mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas
AT fabienjourdan mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas
AT oscaryanes mindthegapmappingmassspectraldatabasesingenomescalemetabolicnetworksrevealspoorlycoveredareas
_version_ 1725437254737854464