Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk.

BACKGROUND:Modeling studies using hypothetical polygenic risk data can be an efficient tool for investigating the effectiveness of downstream applications such as targeting interventions to risk groups to justify whether empirical investigation is warranted. We investigated the assumptions underlyin...

Full description

Bibliographic Details
Main Authors: Suman Kundu, Jannigje G Kers, A Cecile J W Janssens
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2016-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4811433?pdf=render
id doaj-f9152a055b014ebd8ba497c2174b8299
record_format Article
spelling doaj-f9152a055b014ebd8ba497c2174b82992020-11-25T00:05:34ZengPublic Library of Science (PLoS)PLoS ONE1932-62032016-01-01113e015235910.1371/journal.pone.0152359Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk.Suman KunduJannigje G KersA Cecile J W JanssensBACKGROUND:Modeling studies using hypothetical polygenic risk data can be an efficient tool for investigating the effectiveness of downstream applications such as targeting interventions to risk groups to justify whether empirical investigation is warranted. We investigated the assumptions underlying a method that simulates risk data for specific values of the area under the receiver operating characteristic curve (AUC). METHODS:The simulation method constructs risk data for a hypothetical population based on the population disease risk, and the odds ratios and frequencies of genetic variants. By systematically varying the parameters, we investigated under what conditions AUC values represent unique ROC curves with unique risk distributions for patients and nonpatients, and to what extend risk data can be simulated for precise values of the AUC. RESULTS:Using larger number of genetic variants each with a modest effect, we observed that the distributions of estimated risks of patients and nonpatients were similar for various combinations of the odds ratios and frequencies of the risk alleles. Simulated ROC curves overlapped empirical curves with the same AUC. CONCLUSIONS:Polygenic risk data can be effectively and efficiently created using a simulation method. This allows to further investigate the potential applications of stratifying interventions on the basis of polygenic risk.http://europepmc.org/articles/PMC4811433?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Suman Kundu
Jannigje G Kers
A Cecile J W Janssens
spellingShingle Suman Kundu
Jannigje G Kers
A Cecile J W Janssens
Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk.
PLoS ONE
author_facet Suman Kundu
Jannigje G Kers
A Cecile J W Janssens
author_sort Suman Kundu
title Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk.
title_short Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk.
title_full Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk.
title_fullStr Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk.
title_full_unstemmed Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk.
title_sort constructing hypothetical risk data from the area under the roc curve: modelling distributions of polygenic risk.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2016-01-01
description BACKGROUND:Modeling studies using hypothetical polygenic risk data can be an efficient tool for investigating the effectiveness of downstream applications such as targeting interventions to risk groups to justify whether empirical investigation is warranted. We investigated the assumptions underlying a method that simulates risk data for specific values of the area under the receiver operating characteristic curve (AUC). METHODS:The simulation method constructs risk data for a hypothetical population based on the population disease risk, and the odds ratios and frequencies of genetic variants. By systematically varying the parameters, we investigated under what conditions AUC values represent unique ROC curves with unique risk distributions for patients and nonpatients, and to what extend risk data can be simulated for precise values of the AUC. RESULTS:Using larger number of genetic variants each with a modest effect, we observed that the distributions of estimated risks of patients and nonpatients were similar for various combinations of the odds ratios and frequencies of the risk alleles. Simulated ROC curves overlapped empirical curves with the same AUC. CONCLUSIONS:Polygenic risk data can be effectively and efficiently created using a simulation method. This allows to further investigate the potential applications of stratifying interventions on the basis of polygenic risk.
url http://europepmc.org/articles/PMC4811433?pdf=render
work_keys_str_mv AT sumankundu constructinghypotheticalriskdatafromtheareaundertheroccurvemodellingdistributionsofpolygenicrisk
AT jannigjegkers constructinghypotheticalriskdatafromtheareaundertheroccurvemodellingdistributionsofpolygenicrisk
AT acecilejwjanssens constructinghypotheticalriskdatafromtheareaundertheroccurvemodellingdistributionsofpolygenicrisk
_version_ 1725424487189446656