Statistical and machine learning methods for analysis of multiplex protein data from a novel proximity extension assay in patients with ST-elevation myocardial infarction

Abstract Using data from patients with ST-elevation myocardial infarction (STEMI), we explored how machine learning methods can be used for analysing multiplex protein data obtained from proximity extension assays. Blood samples were obtained from 48 STEMI-patients at admission and after three month...

Full description

Bibliographic Details
Main Authors: Emil Maag, Archana Kulasingam, Erik Lerkevang Grove, Kamilla Sofie Pedersen, Steen Dalby Kristensen, Anne-Mette Hvas
Format: Article
Language:English
Published: Nature Publishing Group 2021-07-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-93162-3
id doaj-78ceba10a74a4ba4906cb33c355737e9
record_format Article
spelling doaj-78ceba10a74a4ba4906cb33c355737e92021-07-04T11:28:11ZengNature Publishing GroupScientific Reports2045-23222021-07-0111111010.1038/s41598-021-93162-3Statistical and machine learning methods for analysis of multiplex protein data from a novel proximity extension assay in patients with ST-elevation myocardial infarctionEmil Maag0Archana Kulasingam1Erik Lerkevang Grove2Kamilla Sofie Pedersen3Steen Dalby Kristensen4Anne-Mette Hvas5BioXpedia A/S, Palle Juul-JensensDepartment of Cardiology, Aarhus University HospitalDepartment of Cardiology, Aarhus University HospitalBioXpedia A/S, Palle Juul-JensensDepartment of Cardiology, Aarhus University HospitalDepartment of Clinical Medicine, Faculty of Health, Aarhus UniversityAbstract Using data from patients with ST-elevation myocardial infarction (STEMI), we explored how machine learning methods can be used for analysing multiplex protein data obtained from proximity extension assays. Blood samples were obtained from 48 STEMI-patients at admission and after three months. A subset of patients also had blood samples obtained at four and 12 h after admission. Multiplex protein data were obtained using a proximity extension assay. A random forest model was used to assess the predictive power and importance of biomarkers to distinguish between the acute and the stable phase. The similarity of response profiles was investigated using K-means clustering. Out of 92 proteins, 26 proteins were found to significantly distinguish the acute and the stable phase following STEMI. The five proteins tissue factor pathway inhibitor, azurocidin, spondin-1, myeloperoxidase and myoglobin were found to be highly important for differentiating between the acute and the stable phase. Four of these proteins shared response profiles over the four time-points. Machine learning methods can be used to identify and assess novel predictive biomarkers as showcased in the present study population of patients with STEMI.https://doi.org/10.1038/s41598-021-93162-3
collection DOAJ
language English
format Article
sources DOAJ
author Emil Maag
Archana Kulasingam
Erik Lerkevang Grove
Kamilla Sofie Pedersen
Steen Dalby Kristensen
Anne-Mette Hvas
spellingShingle Emil Maag
Archana Kulasingam
Erik Lerkevang Grove
Kamilla Sofie Pedersen
Steen Dalby Kristensen
Anne-Mette Hvas
Statistical and machine learning methods for analysis of multiplex protein data from a novel proximity extension assay in patients with ST-elevation myocardial infarction
Scientific Reports
author_facet Emil Maag
Archana Kulasingam
Erik Lerkevang Grove
Kamilla Sofie Pedersen
Steen Dalby Kristensen
Anne-Mette Hvas
author_sort Emil Maag
title Statistical and machine learning methods for analysis of multiplex protein data from a novel proximity extension assay in patients with ST-elevation myocardial infarction
title_short Statistical and machine learning methods for analysis of multiplex protein data from a novel proximity extension assay in patients with ST-elevation myocardial infarction
title_full Statistical and machine learning methods for analysis of multiplex protein data from a novel proximity extension assay in patients with ST-elevation myocardial infarction
title_fullStr Statistical and machine learning methods for analysis of multiplex protein data from a novel proximity extension assay in patients with ST-elevation myocardial infarction
title_full_unstemmed Statistical and machine learning methods for analysis of multiplex protein data from a novel proximity extension assay in patients with ST-elevation myocardial infarction
title_sort statistical and machine learning methods for analysis of multiplex protein data from a novel proximity extension assay in patients with st-elevation myocardial infarction
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-07-01
description Abstract Using data from patients with ST-elevation myocardial infarction (STEMI), we explored how machine learning methods can be used for analysing multiplex protein data obtained from proximity extension assays. Blood samples were obtained from 48 STEMI-patients at admission and after three months. A subset of patients also had blood samples obtained at four and 12 h after admission. Multiplex protein data were obtained using a proximity extension assay. A random forest model was used to assess the predictive power and importance of biomarkers to distinguish between the acute and the stable phase. The similarity of response profiles was investigated using K-means clustering. Out of 92 proteins, 26 proteins were found to significantly distinguish the acute and the stable phase following STEMI. The five proteins tissue factor pathway inhibitor, azurocidin, spondin-1, myeloperoxidase and myoglobin were found to be highly important for differentiating between the acute and the stable phase. Four of these proteins shared response profiles over the four time-points. Machine learning methods can be used to identify and assess novel predictive biomarkers as showcased in the present study population of patients with STEMI.
url https://doi.org/10.1038/s41598-021-93162-3
work_keys_str_mv AT emilmaag statisticalandmachinelearningmethodsforanalysisofmultiplexproteindatafromanovelproximityextensionassayinpatientswithstelevationmyocardialinfarction
AT archanakulasingam statisticalandmachinelearningmethodsforanalysisofmultiplexproteindatafromanovelproximityextensionassayinpatientswithstelevationmyocardialinfarction
AT eriklerkevanggrove statisticalandmachinelearningmethodsforanalysisofmultiplexproteindatafromanovelproximityextensionassayinpatientswithstelevationmyocardialinfarction
AT kamillasofiepedersen statisticalandmachinelearningmethodsforanalysisofmultiplexproteindatafromanovelproximityextensionassayinpatientswithstelevationmyocardialinfarction
AT steendalbykristensen statisticalandmachinelearningmethodsforanalysisofmultiplexproteindatafromanovelproximityextensionassayinpatientswithstelevationmyocardialinfarction
AT annemettehvas statisticalandmachinelearningmethodsforanalysisofmultiplexproteindatafromanovelproximityextensionassayinpatientswithstelevationmyocardialinfarction
_version_ 1721320314452836352