Development and verification of prediction models for preventing cardiovascular diseases.

<h4>Objectives</h4>Cardiovascular disease (CVD) is one of the major causes of death worldwide. For improved accuracy of CVD prediction, risk classification was performed using national time-series health examination data. The data offers an opportunity to access deep learning (RNN-LSTM),...

Full description

Bibliographic Details
Main Authors: Ji Min Sung, In-Jeong Cho, David Sung, Sunhee Kim, Hyeon Chang Kim, Myeong-Hun Chae, Maryam Kavousi, Oscar L Rueda-Ochoa, M Arfan Ikram, Oscar H Franco, Hyuk-Jae Chang
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0222809
id doaj-9514073918c94e0990638e5cf95799a3
record_format Article
spelling doaj-9514073918c94e0990638e5cf95799a32021-03-04T10:24:12ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01149e022280910.1371/journal.pone.0222809Development and verification of prediction models for preventing cardiovascular diseases.Ji Min SungIn-Jeong ChoDavid SungSunhee KimHyeon Chang KimMyeong-Hun ChaeMaryam KavousiOscar L Rueda-OchoaM Arfan IkramOscar H FrancoHyuk-Jae Chang<h4>Objectives</h4>Cardiovascular disease (CVD) is one of the major causes of death worldwide. For improved accuracy of CVD prediction, risk classification was performed using national time-series health examination data. The data offers an opportunity to access deep learning (RNN-LSTM), which is widely known as an outstanding algorithm for analyzing time-series datasets. The objective of this study was to show the improved accuracy of deep learning by comparing the performance of a Cox hazard regression and RNN-LSTM based on survival analysis.<h4>Methods and findings</h4>We selected 361,239 subjects (age 40 to 79 years) with more than two health examination records from 2002-2006 using the National Health Insurance System-National Health Screening Cohort (NHIS-HEALS). The average number of health screenings (from 2002-2013) used in the analysis was 2.9 ± 1.0. Two CVD prediction models were developed from the NHIS-HEALS data: a Cox hazard regression model and a deep learning model. In an internal validation of the NHIS-HEALS dataset, the Cox regression model showed a highest time-dependent area under the curve (AUC) of 0.79 (95% CI 0.70 to 0.87) for in females and 0.75 (95% CI 0.70 to 0.80) in males at 2 years. The deep learning model showed a highest time-dependent AUC of 0.94 (95% CI 0.91 to 0.97) for in females and 0.96 (95% CI 0.95 to 0.97) in males at 2 years. Layer-wise Relevance Propagation (LRP) revealed that age was the variable that had the greatest effect on CVD, followed by systolic blood pressure (SBP) and diastolic blood pressure (DBP), in that order.<h4>Conclusion</h4>The performance of the deep learning model for predicting CVD occurrences was better than that of the Cox regression model. In addition, it was confirmed that the known risk factors shown to be important by previous clinical studies were extracted from the study results using LRP.https://doi.org/10.1371/journal.pone.0222809
collection DOAJ
language English
format Article
sources DOAJ
author Ji Min Sung
In-Jeong Cho
David Sung
Sunhee Kim
Hyeon Chang Kim
Myeong-Hun Chae
Maryam Kavousi
Oscar L Rueda-Ochoa
M Arfan Ikram
Oscar H Franco
Hyuk-Jae Chang
spellingShingle Ji Min Sung
In-Jeong Cho
David Sung
Sunhee Kim
Hyeon Chang Kim
Myeong-Hun Chae
Maryam Kavousi
Oscar L Rueda-Ochoa
M Arfan Ikram
Oscar H Franco
Hyuk-Jae Chang
Development and verification of prediction models for preventing cardiovascular diseases.
PLoS ONE
author_facet Ji Min Sung
In-Jeong Cho
David Sung
Sunhee Kim
Hyeon Chang Kim
Myeong-Hun Chae
Maryam Kavousi
Oscar L Rueda-Ochoa
M Arfan Ikram
Oscar H Franco
Hyuk-Jae Chang
author_sort Ji Min Sung
title Development and verification of prediction models for preventing cardiovascular diseases.
title_short Development and verification of prediction models for preventing cardiovascular diseases.
title_full Development and verification of prediction models for preventing cardiovascular diseases.
title_fullStr Development and verification of prediction models for preventing cardiovascular diseases.
title_full_unstemmed Development and verification of prediction models for preventing cardiovascular diseases.
title_sort development and verification of prediction models for preventing cardiovascular diseases.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2019-01-01
description <h4>Objectives</h4>Cardiovascular disease (CVD) is one of the major causes of death worldwide. For improved accuracy of CVD prediction, risk classification was performed using national time-series health examination data. The data offers an opportunity to access deep learning (RNN-LSTM), which is widely known as an outstanding algorithm for analyzing time-series datasets. The objective of this study was to show the improved accuracy of deep learning by comparing the performance of a Cox hazard regression and RNN-LSTM based on survival analysis.<h4>Methods and findings</h4>We selected 361,239 subjects (age 40 to 79 years) with more than two health examination records from 2002-2006 using the National Health Insurance System-National Health Screening Cohort (NHIS-HEALS). The average number of health screenings (from 2002-2013) used in the analysis was 2.9 ± 1.0. Two CVD prediction models were developed from the NHIS-HEALS data: a Cox hazard regression model and a deep learning model. In an internal validation of the NHIS-HEALS dataset, the Cox regression model showed a highest time-dependent area under the curve (AUC) of 0.79 (95% CI 0.70 to 0.87) for in females and 0.75 (95% CI 0.70 to 0.80) in males at 2 years. The deep learning model showed a highest time-dependent AUC of 0.94 (95% CI 0.91 to 0.97) for in females and 0.96 (95% CI 0.95 to 0.97) in males at 2 years. Layer-wise Relevance Propagation (LRP) revealed that age was the variable that had the greatest effect on CVD, followed by systolic blood pressure (SBP) and diastolic blood pressure (DBP), in that order.<h4>Conclusion</h4>The performance of the deep learning model for predicting CVD occurrences was better than that of the Cox regression model. In addition, it was confirmed that the known risk factors shown to be important by previous clinical studies were extracted from the study results using LRP.
url https://doi.org/10.1371/journal.pone.0222809
work_keys_str_mv AT jiminsung developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT injeongcho developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT davidsung developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT sunheekim developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT hyeonchangkim developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT myeonghunchae developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT maryamkavousi developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT oscarlruedaochoa developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT marfanikram developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT oscarhfranco developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
AT hyukjaechang developmentandverificationofpredictionmodelsforpreventingcardiovasculardiseases
_version_ 1714806210913370112