SMILE: systems metabolomics using interpretable learning and evolution

Background: Direct link between metabolism and cell and organism phenotype in health and disease makes metabolomics, a high throughput study of small molecular metabolites, an essential methodology for understanding and diagnosing disease development and progression. Machine learning methods have se...

Full description

Bibliographic Details
Main Authors: Cuperlovic-Culf, M. (Author), Hu, T. (Author), Sha, C. (Author)
Format: Article
Language:English
Published: BioMed Central Ltd 2021
Subjects:
Online Access:View Fulltext in Publisher
LEADER 03457nam a2200529Ia 4500
001 10.1186-s12859-021-04209-1
008 220427s2021 CNT 000 0 und d
020 |a 14712105 (ISSN) 
245 1 0 |a SMILE: systems metabolomics using interpretable learning and evolution 
260 0 |b BioMed Central Ltd  |c 2021 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1186/s12859-021-04209-1 
520 3 |a Background: Direct link between metabolism and cell and organism phenotype in health and disease makes metabolomics, a high throughput study of small molecular metabolites, an essential methodology for understanding and diagnosing disease development and progression. Machine learning methods have seen increasing adoptions in metabolomics thanks to their powerful prediction abilities. However, the “black-box” nature of many machine learning models remains a major challenge for wide acceptance and utility as it makes the interpretation of decision process difficult. This challenge is particularly predominant in biomedical research where understanding of the underlying decision making mechanism is essential for insuring safety and gaining new knowledge. Results: In this article, we proposed a novel computational framework, Systems Metabolomics using Interpretable Learning and Evolution (SMILE), for supervised metabolomics data analysis. Our methodology uses an evolutionary algorithm to learn interpretable predictive models and to identify the most influential metabolites and their interactions in association with disease. Moreover, we have developed a web application with a graphical user interface that can be used for easy analysis, interpretation and visualization of the results. Performance of the method and utilization of the web interface is shown using metabolomics data for Alzheimer’s disease. Conclusions: SMILE was able to identify several influential metabolites on AD and to provide interpretable predictive models that can be further used for a better understanding of the metabolic background of AD. SMILE addresses the emerging issue of interpretability and explainability in machine learning, and contributes to more transparent and powerful applications of machine learning in bioinformatics. © 2021, The Author(s). 
650 0 4 |a algorithm 
650 0 4 |a Algorithms 
650 0 4 |a Alzheimer’s disease 
650 0 4 |a Association reactions 
650 0 4 |a biology 
650 0 4 |a Biomedical research 
650 0 4 |a Biomolecules 
650 0 4 |a Computational Biology 
650 0 4 |a Computational framework 
650 0 4 |a Decision making 
650 0 4 |a Decision-making mechanisms 
650 0 4 |a Diagnosis 
650 0 4 |a Disease development 
650 0 4 |a Evolutionary algorithm 
650 0 4 |a Evolutionary algorithms 
650 0 4 |a Feature interaction 
650 0 4 |a Graphical user interfaces 
650 0 4 |a High throughput studies 
650 0 4 |a Interpretable machine learning 
650 0 4 |a Learning and evolution 
650 0 4 |a machine learning 
650 0 4 |a Machine learning 
650 0 4 |a Machine Learning 
650 0 4 |a Machine learning methods 
650 0 4 |a Machine learning models 
650 0 4 |a Metabolism 
650 0 4 |a Metabolites 
650 0 4 |a metabolomics 
650 0 4 |a Metabolomics 
650 0 4 |a Metabolomics 
650 0 4 |a Predictive analytics 
700 1 |a Cuperlovic-Culf, M.  |e author 
700 1 |a Hu, T.  |e author 
700 1 |a Sha, C.  |e author 
773 |t BMC Bioinformatics