The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.

Genome and exome sequencing in large cohorts enables characterization of the role of rare variation in complex diseases. Success in this endeavor, however, requires investigators to test a diverse array of genetic hypotheses which differ in the number, frequency and effect sizes of underlying causal...

Full description

Bibliographic Details
Main Authors: Loukas Moutsianas, Vineeta Agarwala, Christian Fuchsberger, Jason Flannick, Manuel A Rivas, Kyle J Gaulton, Patrick K Albers, GoT2D Consortium, Gil McVean, Michael Boehnke, David Altshuler, Mark I McCarthy
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-04-01
Series:PLoS Genetics
Online Access:http://europepmc.org/articles/PMC4407972?pdf=render
id doaj-23e3e7a41c6740f3b005952244d83e35
record_format Article
spelling doaj-23e3e7a41c6740f3b005952244d83e352020-11-25T01:38:40ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042015-04-01114e100516510.1371/journal.pgen.1005165The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.Loukas MoutsianasVineeta AgarwalaChristian FuchsbergerJason FlannickManuel A RivasKyle J GaultonPatrick K AlbersGoT2D ConsortiumGil McVeanMichael BoehnkeDavid AltshulerMark I McCarthyGenome and exome sequencing in large cohorts enables characterization of the role of rare variation in complex diseases. Success in this endeavor, however, requires investigators to test a diverse array of genetic hypotheses which differ in the number, frequency and effect sizes of underlying causal variants. In this study, we evaluated the power of gene-based association methods to interrogate such hypotheses, and examined the implications for study design. We developed a flexible simulation approach, using 1000 Genomes data, to (a) generate sequence variation at human genes in up to 10K case-control samples, and (b) quantify the statistical power of a panel of widely used gene-based association tests under a variety of allelic architectures, locus effect sizes, and significance thresholds. For loci explaining ~1% of phenotypic variance underlying a common dichotomous trait, we find that all methods have low absolute power to achieve exome-wide significance (~5-20% power at α = 2.5 × 10(-6)) in 3K individuals; even in 10K samples, power is modest (~60%). The combined application of multiple methods increases sensitivity, but does so at the expense of a higher false positive rate. MiST, SKAT-O, and KBAC have the highest individual mean power across simulated datasets, but we observe wide architecture-dependent variability in the individual loci detected by each test, suggesting that inferences about disease architecture from analysis of sequencing studies can differ depending on which methods are used. Our results imply that tens of thousands of individuals, extensive functional annotation, or highly targeted hypothesis testing will be required to confidently detect or exclude rare variant signals at complex disease loci.http://europepmc.org/articles/PMC4407972?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Loukas Moutsianas
Vineeta Agarwala
Christian Fuchsberger
Jason Flannick
Manuel A Rivas
Kyle J Gaulton
Patrick K Albers
GoT2D Consortium
Gil McVean
Michael Boehnke
David Altshuler
Mark I McCarthy
spellingShingle Loukas Moutsianas
Vineeta Agarwala
Christian Fuchsberger
Jason Flannick
Manuel A Rivas
Kyle J Gaulton
Patrick K Albers
GoT2D Consortium
Gil McVean
Michael Boehnke
David Altshuler
Mark I McCarthy
The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.
PLoS Genetics
author_facet Loukas Moutsianas
Vineeta Agarwala
Christian Fuchsberger
Jason Flannick
Manuel A Rivas
Kyle J Gaulton
Patrick K Albers
GoT2D Consortium
Gil McVean
Michael Boehnke
David Altshuler
Mark I McCarthy
author_sort Loukas Moutsianas
title The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.
title_short The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.
title_full The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.
title_fullStr The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.
title_full_unstemmed The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.
title_sort power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease.
publisher Public Library of Science (PLoS)
series PLoS Genetics
issn 1553-7390
1553-7404
publishDate 2015-04-01
description Genome and exome sequencing in large cohorts enables characterization of the role of rare variation in complex diseases. Success in this endeavor, however, requires investigators to test a diverse array of genetic hypotheses which differ in the number, frequency and effect sizes of underlying causal variants. In this study, we evaluated the power of gene-based association methods to interrogate such hypotheses, and examined the implications for study design. We developed a flexible simulation approach, using 1000 Genomes data, to (a) generate sequence variation at human genes in up to 10K case-control samples, and (b) quantify the statistical power of a panel of widely used gene-based association tests under a variety of allelic architectures, locus effect sizes, and significance thresholds. For loci explaining ~1% of phenotypic variance underlying a common dichotomous trait, we find that all methods have low absolute power to achieve exome-wide significance (~5-20% power at α = 2.5 × 10(-6)) in 3K individuals; even in 10K samples, power is modest (~60%). The combined application of multiple methods increases sensitivity, but does so at the expense of a higher false positive rate. MiST, SKAT-O, and KBAC have the highest individual mean power across simulated datasets, but we observe wide architecture-dependent variability in the individual loci detected by each test, suggesting that inferences about disease architecture from analysis of sequencing studies can differ depending on which methods are used. Our results imply that tens of thousands of individuals, extensive functional annotation, or highly targeted hypothesis testing will be required to confidently detect or exclude rare variant signals at complex disease loci.
url http://europepmc.org/articles/PMC4407972?pdf=render
work_keys_str_mv AT loukasmoutsianas thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT vineetaagarwala thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT christianfuchsberger thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT jasonflannick thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT manuelarivas thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT kylejgaulton thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT patrickkalbers thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT got2dconsortium thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT gilmcvean thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT michaelboehnke thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT davidaltshuler thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT markimccarthy thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT loukasmoutsianas powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT vineetaagarwala powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT christianfuchsberger powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT jasonflannick powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT manuelarivas powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT kylejgaulton powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT patrickkalbers powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT got2dconsortium powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT gilmcvean powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT michaelboehnke powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT davidaltshuler powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT markimccarthy powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
_version_ 1725052285757685760