Using genetic prediction from known complex disease Loci to guide the design of next-generation sequencing experiments.

A central focus of complex disease genetics after genome-wide association studies (GWAS) is to identify low frequency and rare risk variants, which may account for an important fraction of disease heritability unexplained by GWAS. A profusion of studies using next-generation sequencing are seeking s...

Full description

Bibliographic Details
Main Authors: Luke Jostins, Adam P Levine, Jeffrey C Barrett
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3799779?pdf=render
id doaj-330ffb3a92ac41d895d6636d2a3009ff
record_format Article
spelling doaj-330ffb3a92ac41d895d6636d2a3009ff2020-11-25T02:06:25ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-01810e7632810.1371/journal.pone.0076328Using genetic prediction from known complex disease Loci to guide the design of next-generation sequencing experiments.Luke JostinsAdam P LevineJeffrey C BarrettA central focus of complex disease genetics after genome-wide association studies (GWAS) is to identify low frequency and rare risk variants, which may account for an important fraction of disease heritability unexplained by GWAS. A profusion of studies using next-generation sequencing are seeking such risk alleles. We describe how already-known complex trait loci (largely from GWAS) can be used to guide the design of these new studies by selecting cases, controls, or families who are most likely to harbor undiscovered risk alleles. We show that genetic risk prediction can select unrelated cases from large cohorts who are enriched for unknown risk factors, or multiply-affected families that are more likely to harbor high-penetrance risk alleles. We derive the frequency of an undiscovered risk allele in selected cases and controls, and show how this relates to the variance explained by the risk score, the disease prevalence and the population frequency of the risk allele. We also describe a new method for informing the design of sequencing studies using genetic risk prediction in large partially-genotyped families using an extension of the Inside-Outside algorithm for inference on trees. We explore several study design scenarios using both simulated and real data, and show that in many cases genetic risk prediction can provide significant increases in power to detect low-frequency and rare risk alleles. The same approach can also be used to aid discovery of non-genetic risk factors, suggesting possible future utility of genetic risk prediction in conventional epidemiology. Software implementing the methods in this paper is available in the R package Mangrove.http://europepmc.org/articles/PMC3799779?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Luke Jostins
Adam P Levine
Jeffrey C Barrett
spellingShingle Luke Jostins
Adam P Levine
Jeffrey C Barrett
Using genetic prediction from known complex disease Loci to guide the design of next-generation sequencing experiments.
PLoS ONE
author_facet Luke Jostins
Adam P Levine
Jeffrey C Barrett
author_sort Luke Jostins
title Using genetic prediction from known complex disease Loci to guide the design of next-generation sequencing experiments.
title_short Using genetic prediction from known complex disease Loci to guide the design of next-generation sequencing experiments.
title_full Using genetic prediction from known complex disease Loci to guide the design of next-generation sequencing experiments.
title_fullStr Using genetic prediction from known complex disease Loci to guide the design of next-generation sequencing experiments.
title_full_unstemmed Using genetic prediction from known complex disease Loci to guide the design of next-generation sequencing experiments.
title_sort using genetic prediction from known complex disease loci to guide the design of next-generation sequencing experiments.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2013-01-01
description A central focus of complex disease genetics after genome-wide association studies (GWAS) is to identify low frequency and rare risk variants, which may account for an important fraction of disease heritability unexplained by GWAS. A profusion of studies using next-generation sequencing are seeking such risk alleles. We describe how already-known complex trait loci (largely from GWAS) can be used to guide the design of these new studies by selecting cases, controls, or families who are most likely to harbor undiscovered risk alleles. We show that genetic risk prediction can select unrelated cases from large cohorts who are enriched for unknown risk factors, or multiply-affected families that are more likely to harbor high-penetrance risk alleles. We derive the frequency of an undiscovered risk allele in selected cases and controls, and show how this relates to the variance explained by the risk score, the disease prevalence and the population frequency of the risk allele. We also describe a new method for informing the design of sequencing studies using genetic risk prediction in large partially-genotyped families using an extension of the Inside-Outside algorithm for inference on trees. We explore several study design scenarios using both simulated and real data, and show that in many cases genetic risk prediction can provide significant increases in power to detect low-frequency and rare risk alleles. The same approach can also be used to aid discovery of non-genetic risk factors, suggesting possible future utility of genetic risk prediction in conventional epidemiology. Software implementing the methods in this paper is available in the R package Mangrove.
url http://europepmc.org/articles/PMC3799779?pdf=render
work_keys_str_mv AT lukejostins usinggeneticpredictionfromknowncomplexdiseaselocitoguidethedesignofnextgenerationsequencingexperiments
AT adamplevine usinggeneticpredictionfromknowncomplexdiseaselocitoguidethedesignofnextgenerationsequencingexperiments
AT jeffreycbarrett usinggeneticpredictionfromknowncomplexdiseaselocitoguidethedesignofnextgenerationsequencingexperiments
_version_ 1724934070700343296