Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population

Background: Coronary artery calcium score (CACS) is a reliable predictor for future cardiovascular disease risk. Although deep learning studies using computed tomography (CT) images to predict CACS have been reported, no study has assessed the feasibility of machine learning (ML) algorithms to predi...

Full description

Bibliographic Details
Main Authors: Jongseok Lee, Jae-Sung Lim, Younggi Chu, Chang Hee Lee, Ohk-Hyun Ryu, Hyun Hee Choi, Yong Soon Park, Chulho Kim
Format: Article
Language:English
Published: MDPI AG 2020-08-01
Series:Journal of Personalized Medicine
Subjects:
Online Access:https://www.mdpi.com/2075-4426/10/3/96
id doaj-56e92e0b5ab1447991ebdc682915cae7
record_format Article
spelling doaj-56e92e0b5ab1447991ebdc682915cae72020-11-25T03:14:48ZengMDPI AGJournal of Personalized Medicine2075-44262020-08-0110969610.3390/jpm10030096Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy PopulationJongseok Lee0Jae-Sung Lim1Younggi Chu2Chang Hee Lee3Ohk-Hyun Ryu4Hyun Hee Choi5Yong Soon Park6Chulho Kim7School of Business Administration, Hallym University, Chuncheon 24252, KoreaDepartment of Neurology, Hallym University Sacred Heart Hospital, Anyang 14068, KoreaIndustry-University Cooperation Group, Hallym University, Chuncheon 24252, KoreaSchool of Business Administration, Hallym University, Chuncheon 24252, KoreaDepartment of Endocrinology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, KoreaDepartment of Cardiology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, KoreaDepartment of Family Medicine, Chuncheon Sacred Heart Hospital, Chuncheon 24253, KoreaDepartment of Neurology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, KoreaBackground: Coronary artery calcium score (CACS) is a reliable predictor for future cardiovascular disease risk. Although deep learning studies using computed tomography (CT) images to predict CACS have been reported, no study has assessed the feasibility of machine learning (ML) algorithms to predict the CACS using clinical variables in a healthy general population. Therefore, we aimed to assess whether ML algorithms other than binary logistic regression (BLR) could predict high CACS in a healthy population with general health examination data. Methods: This retrospective observational study included participants who had regular health screening including coronary CT angiography. High CACS was defined by the Agatston score ≥ 100. Univariable and multivariable BLR was performed to assess predictors for high CACS in the entire dataset. When performing ML prediction for high CACS, the dataset was randomly divided into a training and test dataset with a 7:3 ratio. BLR, catboost, and xgboost algorithms with 5-fold cross-validation and grid search technique were used to find the best performing classifier. Performance comparison of each ML algorithm was evaluated with the area under the receiver operating characteristic (AUROC) curve. Results: A total of 2133 participants were included in the final analysis. Mean age and proportion of male sex were 55.4 ± 11.3 years and 1483 (69.5%), respectively. In multivariable BLR analysis, age (odds ratio [OR], 1.12; 95% confidence interval [CI], 1.10–1.15, <i>p</i> < 0.001), male sex (OR, 2.91; 95% CI, 1.57–5.38, <i>p</i> < 0.001), systolic blood pressure (OR, 1.02; 95% CI, 1.00–1.03, <i>p</i> = 0.019), and low-density lipoprotein cholesterol (OR, 1.00; 95% CI, 0.99–1.00, <i>p</i> = 0.047) were significant predictors for high CACS. Performance in predicting high CACS of xgboost was AUROC of 0.823, followed by catboost (0.750) and BLR (0.585). The comparison of AUROC between xgboost and BLR was significant (<i>p</i> for AUROC comparison < 0.001). Conclusions: Xgboost ML algorithm was found to be a more reliable predictor of CACS in healthy participants compared to the BLR algorithm. ML algorithms may be useful for predicting CACS with only laboratory data in healthy participants.https://www.mdpi.com/2075-4426/10/3/96cardiovascular diseasemachine learningcoronary artery calcium scorecoronary computed tomography angiography
collection DOAJ
language English
format Article
sources DOAJ
author Jongseok Lee
Jae-Sung Lim
Younggi Chu
Chang Hee Lee
Ohk-Hyun Ryu
Hyun Hee Choi
Yong Soon Park
Chulho Kim
spellingShingle Jongseok Lee
Jae-Sung Lim
Younggi Chu
Chang Hee Lee
Ohk-Hyun Ryu
Hyun Hee Choi
Yong Soon Park
Chulho Kim
Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population
Journal of Personalized Medicine
cardiovascular disease
machine learning
coronary artery calcium score
coronary computed tomography angiography
author_facet Jongseok Lee
Jae-Sung Lim
Younggi Chu
Chang Hee Lee
Ohk-Hyun Ryu
Hyun Hee Choi
Yong Soon Park
Chulho Kim
author_sort Jongseok Lee
title Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population
title_short Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population
title_full Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population
title_fullStr Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population
title_full_unstemmed Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population
title_sort prediction of coronary artery calcium score using machine learning in a healthy population
publisher MDPI AG
series Journal of Personalized Medicine
issn 2075-4426
publishDate 2020-08-01
description Background: Coronary artery calcium score (CACS) is a reliable predictor for future cardiovascular disease risk. Although deep learning studies using computed tomography (CT) images to predict CACS have been reported, no study has assessed the feasibility of machine learning (ML) algorithms to predict the CACS using clinical variables in a healthy general population. Therefore, we aimed to assess whether ML algorithms other than binary logistic regression (BLR) could predict high CACS in a healthy population with general health examination data. Methods: This retrospective observational study included participants who had regular health screening including coronary CT angiography. High CACS was defined by the Agatston score ≥ 100. Univariable and multivariable BLR was performed to assess predictors for high CACS in the entire dataset. When performing ML prediction for high CACS, the dataset was randomly divided into a training and test dataset with a 7:3 ratio. BLR, catboost, and xgboost algorithms with 5-fold cross-validation and grid search technique were used to find the best performing classifier. Performance comparison of each ML algorithm was evaluated with the area under the receiver operating characteristic (AUROC) curve. Results: A total of 2133 participants were included in the final analysis. Mean age and proportion of male sex were 55.4 ± 11.3 years and 1483 (69.5%), respectively. In multivariable BLR analysis, age (odds ratio [OR], 1.12; 95% confidence interval [CI], 1.10–1.15, <i>p</i> < 0.001), male sex (OR, 2.91; 95% CI, 1.57–5.38, <i>p</i> < 0.001), systolic blood pressure (OR, 1.02; 95% CI, 1.00–1.03, <i>p</i> = 0.019), and low-density lipoprotein cholesterol (OR, 1.00; 95% CI, 0.99–1.00, <i>p</i> = 0.047) were significant predictors for high CACS. Performance in predicting high CACS of xgboost was AUROC of 0.823, followed by catboost (0.750) and BLR (0.585). The comparison of AUROC between xgboost and BLR was significant (<i>p</i> for AUROC comparison < 0.001). Conclusions: Xgboost ML algorithm was found to be a more reliable predictor of CACS in healthy participants compared to the BLR algorithm. ML algorithms may be useful for predicting CACS with only laboratory data in healthy participants.
topic cardiovascular disease
machine learning
coronary artery calcium score
coronary computed tomography angiography
url https://www.mdpi.com/2075-4426/10/3/96
work_keys_str_mv AT jongseoklee predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation
AT jaesunglim predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation
AT younggichu predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation
AT changheelee predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation
AT ohkhyunryu predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation
AT hyunheechoi predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation
AT yongsoonpark predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation
AT chulhokim predictionofcoronaryarterycalciumscoreusingmachinelearninginahealthypopulation
_version_ 1724642271454822400