MLGL-MP: a Multi-Label Graph Learning framework enhanced by pathway interdependence for Metabolic Pathway prediction

MOTIVATION: During lead compound optimization, it is crucial to identify pathways where a drug-like compound is metabolized. Recently, machine learning-based methods have achieved inspiring progress to predict potential metabolic pathways for drug-like compounds. However, they neglect the knowledge...

Full description

Bibliographic Details
Main Authors: Du, B.-X (Author), Nyamabo, A.K (Author), Shi, J.-Y (Author), Yiu, S.-M (Author), Yu, H. (Author), Zhao, P.-C (Author), Zhu, B. (Author)
Format: Article
Language:English
Published: NLM (Medline) 2022
Online Access:View Fulltext in Publisher
LEADER 02714nam a2200205Ia 4500
001 10.1093-bioinformatics-btac222
008 220706s2022 CNT 000 0 und d
020 |a 13674811 (ISSN) 
245 1 0 |a MLGL-MP: a Multi-Label Graph Learning framework enhanced by pathway interdependence for Metabolic Pathway prediction 
260 0 |b NLM (Medline)  |c 2022 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1093/bioinformatics/btac222 
520 3 |a MOTIVATION: During lead compound optimization, it is crucial to identify pathways where a drug-like compound is metabolized. Recently, machine learning-based methods have achieved inspiring progress to predict potential metabolic pathways for drug-like compounds. However, they neglect the knowledge that metabolic pathways are dependent on each other. Moreover, they are inadequate to elucidate why compounds participate in specific pathways. RESULTS: To address these issues, we propose a novel Multi-Label Graph Learning framework of Metabolic Pathway prediction boosted by pathway interdependence, called MLGL-MP, which contains a compound encoder, a pathway encoder and a multi-label predictor. The compound encoder learns compound embedding representations by graph neural networks. After constructing a pathway dependence graph by re-trained word embeddings and pathway co-occurrences, the pathway encoder learns pathway embeddings by graph convolutional networks. Moreover, after adapting the compound embedding space into the pathway embedding space, the multi-label predictor measures the proximity of two spaces to discriminate which pathways a compound participates in. The comparison with state-of-the-art methods on KEGG pathways demonstrates the superiority of our MLGL-MP. Also, the ablation studies reveal how its three components contribute to the model, including the pathway dependence, the adapter between compound embeddings and pathway embeddings, as well as the pre-training strategy. Furthermore, a case study illustrates the interpretability of MLGL-MP by indicating crucial substructures in a compound, which are significantly associated with the attending metabolic pathways. It is anticipated that this work can boost metabolic pathway predictions in drug discovery. AVAILABILITY AND IMPLEMENTATION: The code and data underlying this article are freely available at https://github.com/dubingxue/MLGL-MP. © The Author(s) 2022. Published by Oxford University Press. 
700 1 |a Du, B.-X.  |e author 
700 1 |a Nyamabo, A.K.  |e author 
700 1 |a Shi, J.-Y.  |e author 
700 1 |a Yiu, S.-M.  |e author 
700 1 |a Yu, H.  |e author 
700 1 |a Zhao, P.-C.  |e author 
700 1 |a Zhu, B.  |e author 
773 |t Bioinformatics (Oxford, England)