A biomarker discovery of acute myocardial infarction using feature selection and machine learning

Acute myocardial infarction (AMI) or heart attack is a significant global health threat and one of the leading causes of death. The evolution of machine learning has greatly revamped the risk stratification and death prediction of AMI. In this study, an integrated feature selection and machine learn...

Full description

Bibliographic Details
Main Authors: Chang, S.-W (Author), Hon, W.Y (Author), Khor, S.M (Author), Mohd Faizal, A.S (Author), Thevarajah, T.M (Author)
Format: Article
Language:English
Published: Springer Science and Business Media Deutschland GmbH 2023
Subjects:
Online Access:View Fulltext in Publisher
View in Scopus
LEADER 03436nam a2200457Ia 4500
001 10.1007-s11517-023-02841-y
008 230529s2023 CNT 000 0 und d
020 |a 01400118 (ISSN) 
245 1 0 |a A biomarker discovery of acute myocardial infarction using feature selection and machine learning 
260 0 |b Springer Science and Business Media Deutschland GmbH  |c 2023 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1007/s11517-023-02841-y 
856 |z View in Scopus  |u https://www.scopus.com/inward/record.uri?eid=2-s2.0-85159691386&doi=10.1007%2fs11517-023-02841-y&partnerID=40&md5=d53b55be6b76b8c9a0af0b6036b36297 
520 3 |a Acute myocardial infarction (AMI) or heart attack is a significant global health threat and one of the leading causes of death. The evolution of machine learning has greatly revamped the risk stratification and death prediction of AMI. In this study, an integrated feature selection and machine learning approach was used to identify potential biomarkers for early detection and treatment of AMI. First, feature selection was conducted and evaluated before all classification tasks with machine learning. Full classification models (using all 62 features) and reduced classification models (using various feature selection methods ranging from 5 to 30 features) were built and evaluated using six machine learning classification algorithms. The results showed that the reduced models performed generally better (mean AUPRC via random forest (RF) algorithm for recursive feature elimination (RFE) method ranges from 0.8048 to 0.8260, while for random forest importance (RFI) method, it ranges from 0.8301 to 0.8505) than the full models (mean AUPRC via RF: 0.8044). The most notable finding of this study was the identification of a five-feature model that included cardiac troponin I, HDL cholesterol, HbA1c, anion gap, and albumin, which had achieved comparable results (mean AUPRC via RF: 0.8462) as to the models that containing more features. These five features were proven by the previous studies as significant risk factors for AMI or cardiovascular disease and could be used as potential biomarkers to predict the prognosis of AMI patients. From the medical point of view, fewer features for diagnosis or prognosis could reduce the cost and time of a patient as lesser clinical and pathological tests are needed. Graphical Abstract: [Figure not available: see fulltext.]. © 2023, International Federation for Medical and Biological Engineering. 
650 0 4 |a Acute myocardial infarction 
650 0 4 |a Biomarker 
650 0 4 |a Bio-marker discovery 
650 0 4 |a Biomarkers 
650 0 4 |a Cardiology 
650 0 4 |a Causes of death 
650 0 4 |a Classification 
650 0 4 |a Classification (of information) 
650 0 4 |a Classification models 
650 0 4 |a Diagnosis 
650 0 4 |a Feature selection 
650 0 4 |a Feature Selection 
650 0 4 |a Features selection 
650 0 4 |a Forestry 
650 0 4 |a Global health 
650 0 4 |a Health risks 
650 0 4 |a Heart 
650 0 4 |a Heart attack 
650 0 4 |a Machine learning 
650 0 4 |a Machine-learning 
650 0 4 |a Random forests 
650 0 4 |a Risk stratification 
700 1 0 |a Chang, S.-W.  |e author 
700 1 0 |a Hon, W.Y.  |e author 
700 1 0 |a Khor, S.M.  |e author 
700 1 0 |a Mohd Faizal, A.S.  |e author 
700 1 0 |a Thevarajah, T.M.  |e author 
773 |t Medical and Biological Engineering and Computing