Atom Identifiers Generated by a Neighborhood-Specific Graph Coloring Method Enable Compound Harmonization across Metabolic Databases

Metabolic flux analysis requires both a reliable metabolic model and reliable metabolic profiles in characterizing metabolic reprogramming. Advances in analytic methodologies enable production of high-quality metabolomics datasets capturing isotopic flux. However, useful metabolic models can be diff...

Full description

Bibliographic Details
Published in:Metabolites
Main Authors: Huan Jin, Joshua M. Mitchell, Hunter N. B. Moseley
Format: Article
Language:English
Published: MDPI AG 2020-09-01
Subjects:
Online Access:https://www.mdpi.com/2218-1989/10/9/368
_version_ 1850423564724862976
author Huan Jin
Joshua M. Mitchell
Hunter N. B. Moseley
author_facet Huan Jin
Joshua M. Mitchell
Hunter N. B. Moseley
author_sort Huan Jin
collection DOAJ
container_title Metabolites
description Metabolic flux analysis requires both a reliable metabolic model and reliable metabolic profiles in characterizing metabolic reprogramming. Advances in analytic methodologies enable production of high-quality metabolomics datasets capturing isotopic flux. However, useful metabolic models can be difficult to derive due to the lack of relatively complete atom-resolved metabolic networks for a variety of organisms, including human. Here, we developed a neighborhood-specific graph coloring method that creates unique identifiers for each atom in a compound facilitating construction of an atom-resolved metabolic network. What is more, this method is guaranteed to generate the same identifier for symmetric atoms, enabling automatic identification of possible additional mappings caused by molecular symmetry. Furthermore, a compound coloring identifier derived from the corresponding atom coloring identifiers can be used for compound harmonization across various metabolic network databases, which is an essential first step in network integration. With the compound coloring identifiers, 8865 correspondences between KEGG (Kyoto Encyclopedia of Genes and Genomes) and MetaCyc compounds are detected, with 5451 of them confirmed by other identifiers provided by the two databases. In addition, we found that the Enzyme Commission numbers (EC) of reactions can be used to validate possible correspondence pairs, with 1848 unconfirmed pairs validated by commonality in reaction ECs. Moreover, we were able to detect various issues and errors with compound representation in KEGG and MetaCyc databases by compound coloring identifiers, demonstrating the usefulness of this methodology for database curation.
format Article
id doaj-art-2421cab7bb274e059ef8a0c8e8d4f8a1
institution Directory of Open Access Journals
issn 2218-1989
language English
publishDate 2020-09-01
publisher MDPI AG
record_format Article
spelling doaj-art-2421cab7bb274e059ef8a0c8e8d4f8a12025-08-19T22:41:35ZengMDPI AGMetabolites2218-19892020-09-0110936810.3390/metabo10090368Atom Identifiers Generated by a Neighborhood-Specific Graph Coloring Method Enable Compound Harmonization across Metabolic DatabasesHuan Jin0Joshua M. Mitchell1Hunter N. B. Moseley2Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY 40536, USADepartment of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USADepartment of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USAMetabolic flux analysis requires both a reliable metabolic model and reliable metabolic profiles in characterizing metabolic reprogramming. Advances in analytic methodologies enable production of high-quality metabolomics datasets capturing isotopic flux. However, useful metabolic models can be difficult to derive due to the lack of relatively complete atom-resolved metabolic networks for a variety of organisms, including human. Here, we developed a neighborhood-specific graph coloring method that creates unique identifiers for each atom in a compound facilitating construction of an atom-resolved metabolic network. What is more, this method is guaranteed to generate the same identifier for symmetric atoms, enabling automatic identification of possible additional mappings caused by molecular symmetry. Furthermore, a compound coloring identifier derived from the corresponding atom coloring identifiers can be used for compound harmonization across various metabolic network databases, which is an essential first step in network integration. With the compound coloring identifiers, 8865 correspondences between KEGG (Kyoto Encyclopedia of Genes and Genomes) and MetaCyc compounds are detected, with 5451 of them confirmed by other identifiers provided by the two databases. In addition, we found that the Enzyme Commission numbers (EC) of reactions can be used to validate possible correspondence pairs, with 1848 unconfirmed pairs validated by commonality in reaction ECs. Moreover, we were able to detect various issues and errors with compound representation in KEGG and MetaCyc databases by compound coloring identifiers, demonstrating the usefulness of this methodology for database curation.https://www.mdpi.com/2218-1989/10/9/368metabolomicsatom-resolved metabolic networkatom identifiercompound identifierdatabase harmonizationgraph theory
spellingShingle Huan Jin
Joshua M. Mitchell
Hunter N. B. Moseley
Atom Identifiers Generated by a Neighborhood-Specific Graph Coloring Method Enable Compound Harmonization across Metabolic Databases
metabolomics
atom-resolved metabolic network
atom identifier
compound identifier
database harmonization
graph theory
title Atom Identifiers Generated by a Neighborhood-Specific Graph Coloring Method Enable Compound Harmonization across Metabolic Databases
title_full Atom Identifiers Generated by a Neighborhood-Specific Graph Coloring Method Enable Compound Harmonization across Metabolic Databases
title_fullStr Atom Identifiers Generated by a Neighborhood-Specific Graph Coloring Method Enable Compound Harmonization across Metabolic Databases
title_full_unstemmed Atom Identifiers Generated by a Neighborhood-Specific Graph Coloring Method Enable Compound Harmonization across Metabolic Databases
title_short Atom Identifiers Generated by a Neighborhood-Specific Graph Coloring Method Enable Compound Harmonization across Metabolic Databases
title_sort atom identifiers generated by a neighborhood specific graph coloring method enable compound harmonization across metabolic databases
topic metabolomics
atom-resolved metabolic network
atom identifier
compound identifier
database harmonization
graph theory
url https://www.mdpi.com/2218-1989/10/9/368
work_keys_str_mv AT huanjin atomidentifiersgeneratedbyaneighborhoodspecificgraphcoloringmethodenablecompoundharmonizationacrossmetabolicdatabases
AT joshuammitchell atomidentifiersgeneratedbyaneighborhoodspecificgraphcoloringmethodenablecompoundharmonizationacrossmetabolicdatabases
AT hunternbmoseley atomidentifiersgeneratedbyaneighborhoodspecificgraphcoloringmethodenablecompoundharmonizationacrossmetabolicdatabases