The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.
Genome and exome sequencing in large cohorts enables characterization of the role of rare variation in complex diseases. Success in this endeavor, however, requires investigators to test a diverse array of genetic hypotheses which differ in the number, frequency and effect sizes of underlying causal...
Main Authors: | , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2015-04-01
|
Series: | PLoS Genetics |
Online Access: | http://europepmc.org/articles/PMC4407972?pdf=render |
id |
doaj-23e3e7a41c6740f3b005952244d83e35 |
---|---|
record_format |
Article |
spelling |
doaj-23e3e7a41c6740f3b005952244d83e352020-11-25T01:38:40ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042015-04-01114e100516510.1371/journal.pgen.1005165The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.Loukas MoutsianasVineeta AgarwalaChristian FuchsbergerJason FlannickManuel A RivasKyle J GaultonPatrick K AlbersGoT2D ConsortiumGil McVeanMichael BoehnkeDavid AltshulerMark I McCarthyGenome and exome sequencing in large cohorts enables characterization of the role of rare variation in complex diseases. Success in this endeavor, however, requires investigators to test a diverse array of genetic hypotheses which differ in the number, frequency and effect sizes of underlying causal variants. In this study, we evaluated the power of gene-based association methods to interrogate such hypotheses, and examined the implications for study design. We developed a flexible simulation approach, using 1000 Genomes data, to (a) generate sequence variation at human genes in up to 10K case-control samples, and (b) quantify the statistical power of a panel of widely used gene-based association tests under a variety of allelic architectures, locus effect sizes, and significance thresholds. For loci explaining ~1% of phenotypic variance underlying a common dichotomous trait, we find that all methods have low absolute power to achieve exome-wide significance (~5-20% power at α = 2.5 × 10(-6)) in 3K individuals; even in 10K samples, power is modest (~60%). The combined application of multiple methods increases sensitivity, but does so at the expense of a higher false positive rate. MiST, SKAT-O, and KBAC have the highest individual mean power across simulated datasets, but we observe wide architecture-dependent variability in the individual loci detected by each test, suggesting that inferences about disease architecture from analysis of sequencing studies can differ depending on which methods are used. Our results imply that tens of thousands of individuals, extensive functional annotation, or highly targeted hypothesis testing will be required to confidently detect or exclude rare variant signals at complex disease loci.http://europepmc.org/articles/PMC4407972?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Loukas Moutsianas Vineeta Agarwala Christian Fuchsberger Jason Flannick Manuel A Rivas Kyle J Gaulton Patrick K Albers GoT2D Consortium Gil McVean Michael Boehnke David Altshuler Mark I McCarthy |
spellingShingle |
Loukas Moutsianas Vineeta Agarwala Christian Fuchsberger Jason Flannick Manuel A Rivas Kyle J Gaulton Patrick K Albers GoT2D Consortium Gil McVean Michael Boehnke David Altshuler Mark I McCarthy The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease. PLoS Genetics |
author_facet |
Loukas Moutsianas Vineeta Agarwala Christian Fuchsberger Jason Flannick Manuel A Rivas Kyle J Gaulton Patrick K Albers GoT2D Consortium Gil McVean Michael Boehnke David Altshuler Mark I McCarthy |
author_sort |
Loukas Moutsianas |
title |
The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease. |
title_short |
The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease. |
title_full |
The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease. |
title_fullStr |
The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease. |
title_full_unstemmed |
The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease. |
title_sort |
power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS Genetics |
issn |
1553-7390 1553-7404 |
publishDate |
2015-04-01 |
description |
Genome and exome sequencing in large cohorts enables characterization of the role of rare variation in complex diseases. Success in this endeavor, however, requires investigators to test a diverse array of genetic hypotheses which differ in the number, frequency and effect sizes of underlying causal variants. In this study, we evaluated the power of gene-based association methods to interrogate such hypotheses, and examined the implications for study design. We developed a flexible simulation approach, using 1000 Genomes data, to (a) generate sequence variation at human genes in up to 10K case-control samples, and (b) quantify the statistical power of a panel of widely used gene-based association tests under a variety of allelic architectures, locus effect sizes, and significance thresholds. For loci explaining ~1% of phenotypic variance underlying a common dichotomous trait, we find that all methods have low absolute power to achieve exome-wide significance (~5-20% power at α = 2.5 × 10(-6)) in 3K individuals; even in 10K samples, power is modest (~60%). The combined application of multiple methods increases sensitivity, but does so at the expense of a higher false positive rate. MiST, SKAT-O, and KBAC have the highest individual mean power across simulated datasets, but we observe wide architecture-dependent variability in the individual loci detected by each test, suggesting that inferences about disease architecture from analysis of sequencing studies can differ depending on which methods are used. Our results imply that tens of thousands of individuals, extensive functional annotation, or highly targeted hypothesis testing will be required to confidently detect or exclude rare variant signals at complex disease loci. |
url |
http://europepmc.org/articles/PMC4407972?pdf=render |
work_keys_str_mv |
AT loukasmoutsianas thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT vineetaagarwala thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT christianfuchsberger thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT jasonflannick thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT manuelarivas thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT kylejgaulton thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT patrickkalbers thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT got2dconsortium thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT gilmcvean thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT michaelboehnke thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT davidaltshuler thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT markimccarthy thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT loukasmoutsianas powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT vineetaagarwala powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT christianfuchsberger powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT jasonflannick powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT manuelarivas powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT kylejgaulton powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT patrickkalbers powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT got2dconsortium powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT gilmcvean powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT michaelboehnke powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT davidaltshuler powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease AT markimccarthy powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease |
_version_ |
1725052285757685760 |