Prediction model development of late-onset preeclampsia using machine learning-based methods.

Preeclampsia is one of the leading causes of maternal and fetal morbidity and mortality. Due to the lack of effective preventive measures, its prediction is essential to its prompt management. This study aimed to develop models using machine learning to predict late-onset preeclampsia using hospital...

Full description

Bibliographic Details
Main Authors:	Jong Hyun Jhee, SungHee Lee, Yejin Park, Sang Eun Lee, Young Ah Kim, Shin-Wook Kang, Ja-Young Kwon, Jung Tak Park
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2019-01-01
Series:	PLoS ONE
Online Access:	https://doi.org/10.1371/journal.pone.0221202

id	doaj-0b0d918532b84a83baed698cf37e25b7
record_format	Article
spelling	doaj-0b0d918532b84a83baed698cf37e25b72021-03-03T19:50:24ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01148e022120210.1371/journal.pone.0221202Prediction model development of late-onset preeclampsia using machine learning-based methods.Jong Hyun JheeSungHee LeeYejin ParkSang Eun LeeYoung Ah KimShin-Wook KangJa-Young KwonJung Tak ParkPreeclampsia is one of the leading causes of maternal and fetal morbidity and mortality. Due to the lack of effective preventive measures, its prediction is essential to its prompt management. This study aimed to develop models using machine learning to predict late-onset preeclampsia using hospital electronic medical record data. The performance of the machine learning based models and models using conventional statistical methods were also compared. A total of 11,006 pregnant women who received antenatal care at Yonsei University Hospital were included. Maternal data were retrieved from electronic medical records during the early second trimester to 34 weeks. The prediction outcome was late-onset preeclampsia occurrence after 34 weeks' gestation. Pattern recognition and cluster analysis were used to select the parameters included in the prediction models. Logistic regression, decision tree model, naïve Bayes classification, support vector machine, random forest algorithm, and stochastic gradient boosting method were used to construct the prediction models. C-statistics was used to assess the performance of each model. The overall preeclampsia development rate was 4.7% (474 patients). Systolic blood pressure, serum blood urea nitrogen and creatinine levels, platelet counts, serum potassium level, white blood cell count, serum calcium level, and urinary protein were the most influential variables included in the prediction models. C-statistics for the decision tree model, naïve Bayes classification, support vector machine, random forest algorithm, stochastic gradient boosting method, and logistic regression models were 0.857, 0.776, 0.573, 0.894, 0.924, and 0.806, respectively. The stochastic gradient boosting model had the best prediction performance with an accuracy and false positive rate of 0.973 and 0.009, respectively. The combined use of maternal factors and common antenatal laboratory data of the early second trimester through early third trimester could effectively predict late-onset preeclampsia using machine learning algorithms. Future prospective studies are needed to verify the clinical applicability algorithms.https://doi.org/10.1371/journal.pone.0221202
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Jong Hyun Jhee SungHee Lee Yejin Park Sang Eun Lee Young Ah Kim Shin-Wook Kang Ja-Young Kwon Jung Tak Park
spellingShingle	Jong Hyun Jhee SungHee Lee Yejin Park Sang Eun Lee Young Ah Kim Shin-Wook Kang Ja-Young Kwon Jung Tak Park Prediction model development of late-onset preeclampsia using machine learning-based methods. PLoS ONE
author_facet	Jong Hyun Jhee SungHee Lee Yejin Park Sang Eun Lee Young Ah Kim Shin-Wook Kang Ja-Young Kwon Jung Tak Park
author_sort	Jong Hyun Jhee
title	Prediction model development of late-onset preeclampsia using machine learning-based methods.
title_short	Prediction model development of late-onset preeclampsia using machine learning-based methods.
title_full	Prediction model development of late-onset preeclampsia using machine learning-based methods.
title_fullStr	Prediction model development of late-onset preeclampsia using machine learning-based methods.
title_full_unstemmed	Prediction model development of late-onset preeclampsia using machine learning-based methods.
title_sort	prediction model development of late-onset preeclampsia using machine learning-based methods.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2019-01-01
description	Preeclampsia is one of the leading causes of maternal and fetal morbidity and mortality. Due to the lack of effective preventive measures, its prediction is essential to its prompt management. This study aimed to develop models using machine learning to predict late-onset preeclampsia using hospital electronic medical record data. The performance of the machine learning based models and models using conventional statistical methods were also compared. A total of 11,006 pregnant women who received antenatal care at Yonsei University Hospital were included. Maternal data were retrieved from electronic medical records during the early second trimester to 34 weeks. The prediction outcome was late-onset preeclampsia occurrence after 34 weeks' gestation. Pattern recognition and cluster analysis were used to select the parameters included in the prediction models. Logistic regression, decision tree model, naïve Bayes classification, support vector machine, random forest algorithm, and stochastic gradient boosting method were used to construct the prediction models. C-statistics was used to assess the performance of each model. The overall preeclampsia development rate was 4.7% (474 patients). Systolic blood pressure, serum blood urea nitrogen and creatinine levels, platelet counts, serum potassium level, white blood cell count, serum calcium level, and urinary protein were the most influential variables included in the prediction models. C-statistics for the decision tree model, naïve Bayes classification, support vector machine, random forest algorithm, stochastic gradient boosting method, and logistic regression models were 0.857, 0.776, 0.573, 0.894, 0.924, and 0.806, respectively. The stochastic gradient boosting model had the best prediction performance with an accuracy and false positive rate of 0.973 and 0.009, respectively. The combined use of maternal factors and common antenatal laboratory data of the early second trimester through early third trimester could effectively predict late-onset preeclampsia using machine learning algorithms. Future prospective studies are needed to verify the clinical applicability algorithms.
url	https://doi.org/10.1371/journal.pone.0221202
work_keys_str_mv	AT jonghyunjhee predictionmodeldevelopmentoflateonsetpreeclampsiausingmachinelearningbasedmethods AT sungheelee predictionmodeldevelopmentoflateonsetpreeclampsiausingmachinelearningbasedmethods AT yejinpark predictionmodeldevelopmentoflateonsetpreeclampsiausingmachinelearningbasedmethods AT sangeunlee predictionmodeldevelopmentoflateonsetpreeclampsiausingmachinelearningbasedmethods AT youngahkim predictionmodeldevelopmentoflateonsetpreeclampsiausingmachinelearningbasedmethods AT shinwookkang predictionmodeldevelopmentoflateonsetpreeclampsiausingmachinelearningbasedmethods AT jayoungkwon predictionmodeldevelopmentoflateonsetpreeclampsiausingmachinelearningbasedmethods AT jungtakpark predictionmodeldevelopmentoflateonsetpreeclampsiausingmachinelearningbasedmethods
_version_	1714825383355875328

Prediction model development of late-onset preeclampsia using machine learning-based methods.

Similar Items