SMILE: systems metabolomics using interpretable learning and evolution

Abstract Background Direct link between metabolism and cell and organism phenotype in health and disease makes metabolomics, a high throughput study of small molecular metabolites, an essential methodology for understanding and diagnosing disease development and progression. Machine learning methods...

Full description

Bibliographic Details
Main Authors: Chengyuan Sha, Miroslava Cuperlovic-Culf, Ting Hu
Format: Article
Language:English
Published: BMC 2021-05-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-021-04209-1
id doaj-0d068470168c430eb89d0f55abad981a
record_format Article
spelling doaj-0d068470168c430eb89d0f55abad981a2021-05-30T11:52:50ZengBMCBMC Bioinformatics1471-21052021-05-0122111710.1186/s12859-021-04209-1SMILE: systems metabolomics using interpretable learning and evolutionChengyuan Sha0Miroslava Cuperlovic-Culf1Ting Hu2School of Computing, Queen’s UniversityDigital Technologies Research Center, National Research Council CanadaSchool of Computing, Queen’s UniversityAbstract Background Direct link between metabolism and cell and organism phenotype in health and disease makes metabolomics, a high throughput study of small molecular metabolites, an essential methodology for understanding and diagnosing disease development and progression. Machine learning methods have seen increasing adoptions in metabolomics thanks to their powerful prediction abilities. However, the “black-box” nature of many machine learning models remains a major challenge for wide acceptance and utility as it makes the interpretation of decision process difficult. This challenge is particularly predominant in biomedical research where understanding of the underlying decision making mechanism is essential for insuring safety and gaining new knowledge. Results In this article, we proposed a novel computational framework, Systems Metabolomics using Interpretable Learning and Evolution (SMILE), for supervised metabolomics data analysis. Our methodology uses an evolutionary algorithm to learn interpretable predictive models and to identify the most influential metabolites and their interactions in association with disease. Moreover, we have developed a web application with a graphical user interface that can be used for easy analysis, interpretation and visualization of the results. Performance of the method and utilization of the web interface is shown using metabolomics data for Alzheimer’s disease. Conclusions SMILE was able to identify several influential metabolites on AD and to provide interpretable predictive models that can be further used for a better understanding of the metabolic background of AD. SMILE addresses the emerging issue of interpretability and explainability in machine learning, and contributes to more transparent and powerful applications of machine learning in bioinformatics.https://doi.org/10.1186/s12859-021-04209-1MetabolomicsAlzheimer’s diseaseInterpretable machine learningFeature interactionEvolutionary algorithm
collection DOAJ
language English
format Article
sources DOAJ
author Chengyuan Sha
Miroslava Cuperlovic-Culf
Ting Hu
spellingShingle Chengyuan Sha
Miroslava Cuperlovic-Culf
Ting Hu
SMILE: systems metabolomics using interpretable learning and evolution
BMC Bioinformatics
Metabolomics
Alzheimer’s disease
Interpretable machine learning
Feature interaction
Evolutionary algorithm
author_facet Chengyuan Sha
Miroslava Cuperlovic-Culf
Ting Hu
author_sort Chengyuan Sha
title SMILE: systems metabolomics using interpretable learning and evolution
title_short SMILE: systems metabolomics using interpretable learning and evolution
title_full SMILE: systems metabolomics using interpretable learning and evolution
title_fullStr SMILE: systems metabolomics using interpretable learning and evolution
title_full_unstemmed SMILE: systems metabolomics using interpretable learning and evolution
title_sort smile: systems metabolomics using interpretable learning and evolution
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2021-05-01
description Abstract Background Direct link between metabolism and cell and organism phenotype in health and disease makes metabolomics, a high throughput study of small molecular metabolites, an essential methodology for understanding and diagnosing disease development and progression. Machine learning methods have seen increasing adoptions in metabolomics thanks to their powerful prediction abilities. However, the “black-box” nature of many machine learning models remains a major challenge for wide acceptance and utility as it makes the interpretation of decision process difficult. This challenge is particularly predominant in biomedical research where understanding of the underlying decision making mechanism is essential for insuring safety and gaining new knowledge. Results In this article, we proposed a novel computational framework, Systems Metabolomics using Interpretable Learning and Evolution (SMILE), for supervised metabolomics data analysis. Our methodology uses an evolutionary algorithm to learn interpretable predictive models and to identify the most influential metabolites and their interactions in association with disease. Moreover, we have developed a web application with a graphical user interface that can be used for easy analysis, interpretation and visualization of the results. Performance of the method and utilization of the web interface is shown using metabolomics data for Alzheimer’s disease. Conclusions SMILE was able to identify several influential metabolites on AD and to provide interpretable predictive models that can be further used for a better understanding of the metabolic background of AD. SMILE addresses the emerging issue of interpretability and explainability in machine learning, and contributes to more transparent and powerful applications of machine learning in bioinformatics.
topic Metabolomics
Alzheimer’s disease
Interpretable machine learning
Feature interaction
Evolutionary algorithm
url https://doi.org/10.1186/s12859-021-04209-1
work_keys_str_mv AT chengyuansha smilesystemsmetabolomicsusinginterpretablelearningandevolution
AT miroslavacuperlovicculf smilesystemsmetabolomicsusinginterpretablelearningandevolution
AT tinghu smilesystemsmetabolomicsusinginterpretablelearningandevolution
_version_ 1721419894668394496