Machine learning approaches for the prediction of bone mineral density by using genomic and phenotypic data of 5130 older men

Abstract The study aimed to utilize machine learning (ML) approaches and genomic data to develop a prediction model for bone mineral density (BMD) and identify the best modeling approach for BMD prediction. The genomic and phenotypic data of Osteoporotic Fractures in Men Study (n = 5130) was analyze...

Full description

Bibliographic Details
Main Authors: Qing Wu, Fatma Nasoz, Jongyun Jung, Bibek Bhattarai, Mira V. Han, Robert A. Greenes, Kenneth G. Saag
Format: Article
Language:English
Published: Nature Publishing Group 2021-02-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-83828-3
id doaj-d62fca4849fd4679a89037fdc1c465b7
record_format Article
spelling doaj-d62fca4849fd4679a89037fdc1c465b72021-03-11T12:14:16ZengNature Publishing GroupScientific Reports2045-23222021-02-0111111010.1038/s41598-021-83828-3Machine learning approaches for the prediction of bone mineral density by using genomic and phenotypic data of 5130 older menQing Wu0Fatma Nasoz1Jongyun Jung2Bibek Bhattarai3Mira V. Han4Robert A. Greenes5Kenneth G. Saag6Nevada Institute of Personalized Medicine, University of Nevada Las VegasDepartment of Computer Science, University of NevadaNevada Institute of Personalized Medicine, University of Nevada Las VegasDepartment of Computer Science, University of NevadaNevada Institute of Personalized Medicine, University of Nevada Las VegasCollege of Health Solutions, Arizona State UniversityDepartment of Medicine, Division of Clinical Immunology and Rheumatology, the University of Alabama at BirminghamAbstract The study aimed to utilize machine learning (ML) approaches and genomic data to develop a prediction model for bone mineral density (BMD) and identify the best modeling approach for BMD prediction. The genomic and phenotypic data of Osteoporotic Fractures in Men Study (n = 5130) was analyzed. Genetic risk score (GRS) was calculated from 1103 associated SNPs for each participant after a comprehensive genotype imputation. Data were normalized and divided into a training set (80%) and a validation set (20%) for analysis. Random forest, gradient boosting, neural network, and linear regression were used to develop BMD prediction models separately. Ten-fold cross-validation was used for hyper-parameters optimization. Mean square error and mean absolute error were used to assess model performance. When using GRS and phenotypic covariates as the predictors, all ML models’ performance and linear regression in BMD prediction were similar. However, when replacing GRS with the 1103 individual SNPs in the model, ML models performed significantly better than linear regression (with lasso regularization), and the gradient boosting model performed the best. Our study suggested that ML models, especially gradient boosting, can improve BMD prediction in genomic data.https://doi.org/10.1038/s41598-021-83828-3
collection DOAJ
language English
format Article
sources DOAJ
author Qing Wu
Fatma Nasoz
Jongyun Jung
Bibek Bhattarai
Mira V. Han
Robert A. Greenes
Kenneth G. Saag
spellingShingle Qing Wu
Fatma Nasoz
Jongyun Jung
Bibek Bhattarai
Mira V. Han
Robert A. Greenes
Kenneth G. Saag
Machine learning approaches for the prediction of bone mineral density by using genomic and phenotypic data of 5130 older men
Scientific Reports
author_facet Qing Wu
Fatma Nasoz
Jongyun Jung
Bibek Bhattarai
Mira V. Han
Robert A. Greenes
Kenneth G. Saag
author_sort Qing Wu
title Machine learning approaches for the prediction of bone mineral density by using genomic and phenotypic data of 5130 older men
title_short Machine learning approaches for the prediction of bone mineral density by using genomic and phenotypic data of 5130 older men
title_full Machine learning approaches for the prediction of bone mineral density by using genomic and phenotypic data of 5130 older men
title_fullStr Machine learning approaches for the prediction of bone mineral density by using genomic and phenotypic data of 5130 older men
title_full_unstemmed Machine learning approaches for the prediction of bone mineral density by using genomic and phenotypic data of 5130 older men
title_sort machine learning approaches for the prediction of bone mineral density by using genomic and phenotypic data of 5130 older men
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-02-01
description Abstract The study aimed to utilize machine learning (ML) approaches and genomic data to develop a prediction model for bone mineral density (BMD) and identify the best modeling approach for BMD prediction. The genomic and phenotypic data of Osteoporotic Fractures in Men Study (n = 5130) was analyzed. Genetic risk score (GRS) was calculated from 1103 associated SNPs for each participant after a comprehensive genotype imputation. Data were normalized and divided into a training set (80%) and a validation set (20%) for analysis. Random forest, gradient boosting, neural network, and linear regression were used to develop BMD prediction models separately. Ten-fold cross-validation was used for hyper-parameters optimization. Mean square error and mean absolute error were used to assess model performance. When using GRS and phenotypic covariates as the predictors, all ML models’ performance and linear regression in BMD prediction were similar. However, when replacing GRS with the 1103 individual SNPs in the model, ML models performed significantly better than linear regression (with lasso regularization), and the gradient boosting model performed the best. Our study suggested that ML models, especially gradient boosting, can improve BMD prediction in genomic data.
url https://doi.org/10.1038/s41598-021-83828-3
work_keys_str_mv AT qingwu machinelearningapproachesforthepredictionofbonemineraldensitybyusinggenomicandphenotypicdataof5130oldermen
AT fatmanasoz machinelearningapproachesforthepredictionofbonemineraldensitybyusinggenomicandphenotypicdataof5130oldermen
AT jongyunjung machinelearningapproachesforthepredictionofbonemineraldensitybyusinggenomicandphenotypicdataof5130oldermen
AT bibekbhattarai machinelearningapproachesforthepredictionofbonemineraldensitybyusinggenomicandphenotypicdataof5130oldermen
AT miravhan machinelearningapproachesforthepredictionofbonemineraldensitybyusinggenomicandphenotypicdataof5130oldermen
AT robertagreenes machinelearningapproachesforthepredictionofbonemineraldensitybyusinggenomicandphenotypicdataof5130oldermen
AT kennethgsaag machinelearningapproachesforthepredictionofbonemineraldensitybyusinggenomicandphenotypicdataof5130oldermen
_version_ 1724224640217251840