Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population
Background: Coronary artery calcium score (CACS) is a reliable predictor for future cardiovascular disease risk. Although deep learning studies using computed tomography (CT) images to predict CACS have been reported, no study has assessed the feasibility of machine learning (ML) algorithms to predi...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-08-01
|
Series: | Journal of Personalized Medicine |
Subjects: | |
Online Access: | https://www.mdpi.com/2075-4426/10/3/96 |
id |
doaj-56e92e0b5ab1447991ebdc682915cae7 |
---|---|
record_format |
Article |
spelling |
doaj-56e92e0b5ab1447991ebdc682915cae72020-11-25T03:14:48ZengMDPI AGJournal of Personalized Medicine2075-44262020-08-0110969610.3390/jpm10030096Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy PopulationJongseok Lee0Jae-Sung Lim1Younggi Chu2Chang Hee Lee3Ohk-Hyun Ryu4Hyun Hee Choi5Yong Soon Park6Chulho Kim7School of Business Administration, Hallym University, Chuncheon 24252, KoreaDepartment of Neurology, Hallym University Sacred Heart Hospital, Anyang 14068, KoreaIndustry-University Cooperation Group, Hallym University, Chuncheon 24252, KoreaSchool of Business Administration, Hallym University, Chuncheon 24252, KoreaDepartment of Endocrinology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, KoreaDepartment of Cardiology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, KoreaDepartment of Family Medicine, Chuncheon Sacred Heart Hospital, Chuncheon 24253, KoreaDepartment of Neurology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, KoreaBackground: Coronary artery calcium score (CACS) is a reliable predictor for future cardiovascular disease risk. Although deep learning studies using computed tomography (CT) images to predict CACS have been reported, no study has assessed the feasibility of machine learning (ML) algorithms to predict the CACS using clinical variables in a healthy general population. Therefore, we aimed to assess whether ML algorithms other than binary logistic regression (BLR) could predict high CACS in a healthy population with general health examination data. Methods: This retrospective observational study included participants who had regular health screening including coronary CT angiography. High CACS was defined by the Agatston score ≥ 100. Univariable and multivariable BLR was performed to assess predictors for high CACS in the entire dataset. When performing ML prediction for high CACS, the dataset was randomly divided into a training and test dataset with a 7:3 ratio. BLR, catboost, and xgboost algorithms with 5-fold cross-validation and grid search technique were used to find the best performing classifier. Performance comparison of each ML algorithm was evaluated with the area under the receiver operating characteristic (AUROC) curve. Results: A total of 2133 participants were included in the final analysis. Mean age and proportion of male sex were 55.4 ± 11.3 years and 1483 (69.5%), respectively. In multivariable BLR analysis, age (odds ratio [OR], 1.12; 95% confidence interval [CI], 1.10–1.15, <i>p</i> < 0.001), male sex (OR, 2.91; 95% CI, 1.57–5.38, <i>p</i> < 0.001), systolic blood pressure (OR, 1.02; 95% CI, 1.00–1.03, <i>p</i> = 0.019), and low-density lipoprotein cholesterol (OR, 1.00; 95% CI, 0.99–1.00, <i>p</i> = 0.047) were significant predictors for high CACS. Performance in predicting high CACS of xgboost was AUROC of 0.823, followed by catboost (0.750) and BLR (0.585). The comparison of AUROC between xgboost and BLR was significant (<i>p</i> for AUROC comparison < 0.001). Conclusions: Xgboost ML algorithm was found to be a more reliable predictor of CACS in healthy participants compared to the BLR algorithm. ML algorithms may be useful for predicting CACS with only laboratory data in healthy participants.https://www.mdpi.com/2075-4426/10/3/96cardiovascular diseasemachine learningcoronary artery calcium scorecoronary computed tomography angiography |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jongseok Lee Jae-Sung Lim Younggi Chu Chang Hee Lee Ohk-Hyun Ryu Hyun Hee Choi Yong Soon Park Chulho Kim |
spellingShingle |
Jongseok Lee Jae-Sung Lim Younggi Chu Chang Hee Lee Ohk-Hyun Ryu Hyun Hee Choi Yong Soon Park Chulho Kim Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population Journal of Personalized Medicine cardiovascular disease machine learning coronary artery calcium score coronary computed tomography angiography |
author_facet |
Jongseok Lee Jae-Sung Lim Younggi Chu Chang Hee Lee Ohk-Hyun Ryu Hyun Hee Choi Yong Soon Park Chulho Kim |
author_sort |
Jongseok Lee |
title |
Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population |
title_short |
Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population |
title_full |
Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population |
title_fullStr |
Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population |
title_full_unstemmed |
Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population |
title_sort |
prediction of coronary artery calcium score using machine learning in a healthy population |
publisher |
MDPI AG |
series |
Journal of Personalized Medicine |
issn |
2075-4426 |
publishDate |
2020-08-01 |
description |
Background: Coronary artery calcium score (CACS) is a reliable predictor for future cardiovascular disease risk. Although deep learning studies using computed tomography (CT) images to predict CACS have been reported, no study has assessed the feasibility of machine learning (ML) algorithms to predict the CACS using clinical variables in a healthy general population. Therefore, we aimed to assess whether ML algorithms other than binary logistic regression (BLR) could predict high CACS in a healthy population with general health examination data. Methods: This retrospective observational study included participants who had regular health screening including coronary CT angiography. High CACS was defined by the Agatston score ≥ 100. Univariable and multivariable BLR was performed to assess predictors for high CACS in the entire dataset. When performing ML prediction for high CACS, the dataset was randomly divided into a training and test dataset with a 7:3 ratio. BLR, catboost, and xgboost algorithms with 5-fold cross-validation and grid search technique were used to find the best performing classifier. Performance comparison of each ML algorithm was evaluated with the area under the receiver operating characteristic (AUROC) curve. Results: A total of 2133 participants were included in the final analysis. Mean age and proportion of male sex were 55.4 ± 11.3 years and 1483 (69.5%), respectively. In multivariable BLR analysis, age (odds ratio [OR], 1.12; 95% confidence interval [CI], 1.10–1.15, <i>p</i> < 0.001), male sex (OR, 2.91; 95% CI, 1.57–5.38, <i>p</i> < 0.001), systolic blood pressure (OR, 1.02; 95% CI, 1.00–1.03, <i>p</i> = 0.019), and low-density lipoprotein cholesterol (OR, 1.00; 95% CI, 0.99–1.00, <i>p</i> = 0.047) were significant predictors for high CACS. Performance in predicting high CACS of xgboost was AUROC of 0.823, followed by catboost (0.750) and BLR (0.585). The comparison of AUROC between xgboost and BLR was significant (<i>p</i> for AUROC comparison < 0.001). Conclusions: Xgboost ML algorithm was found to be a more reliable predictor of CACS in healthy participants compared to the BLR algorithm. ML algorithms may be useful for predicting CACS with only laboratory data in healthy participants. |
topic |
cardiovascular disease machine learning coronary artery calcium score coronary computed tomography angiography |
url |
https://www.mdpi.com/2075-4426/10/3/96 |
work_keys_str_mv |
AT jongseoklee predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation AT jaesunglim predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation AT younggichu predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation AT changheelee predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation AT ohkhyunryu predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation AT hyunheechoi predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation AT yongsoonpark predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation AT chulhokim predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation |
_version_ |
1724642271454822400 |