Integrated model of de novo and inherited genetic variants yields greater power to identify risk genes.

De novo mutations affect risk for many diseases and disorders, especially those with early-onset. An example is autism spectrum disorders (ASD). Four recent whole-exome sequencing (WES) studies of ASD families revealed a handful of novel risk genes, based on independent de novo loss-of-function (LoF...

Full description

Bibliographic Details
Main Authors: Xin He, Stephan J Sanders, Li Liu, Silvia De Rubeis, Elaine T Lim, James S Sutcliffe, Gerard D Schellenberg, Richard A Gibbs, Mark J Daly, Joseph D Buxbaum, Matthew W State, Bernie Devlin, Kathryn Roeder
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS Genetics
Online Access:http://europepmc.org/articles/PMC3744441?pdf=render
id doaj-d44add84a10d49dd9e3b88a43eeb3a39
record_format Article
spelling doaj-d44add84a10d49dd9e3b88a43eeb3a392020-11-24T21:19:12ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042013-01-0198e100367110.1371/journal.pgen.1003671Integrated model of de novo and inherited genetic variants yields greater power to identify risk genes.Xin HeStephan J SandersLi LiuSilvia De RubeisElaine T LimJames S SutcliffeGerard D SchellenbergRichard A GibbsMark J DalyJoseph D BuxbaumMatthew W StateBernie DevlinKathryn RoederDe novo mutations affect risk for many diseases and disorders, especially those with early-onset. An example is autism spectrum disorders (ASD). Four recent whole-exome sequencing (WES) studies of ASD families revealed a handful of novel risk genes, based on independent de novo loss-of-function (LoF) mutations falling in the same gene, and found that de novo LoF mutations occurred at a twofold higher rate than expected by chance. However successful these studies were, they used only a small fraction of the data, excluding other types of de novo mutations and inherited rare variants. Moreover, such analyses cannot readily incorporate data from case-control studies. An important research challenge in gene discovery, therefore, is to develop statistical methods that accommodate a broader class of rare variation. We develop methods that can incorporate WES data regarding de novo mutations, inherited variants present, and variants identified within cases and controls. TADA, for Transmission And De novo Association, integrates these data by a gene-based likelihood model involving parameters for allele frequencies and gene-specific penetrances. Inference is based on a Hierarchical Bayes strategy that borrows information across all genes to infer parameters that would be difficult to estimate for individual genes. In addition to theoretical development we validated TADA using realistic simulations mimicking rare, large-effect mutations affecting risk for ASD and show it has dramatically better power than other common methods of analysis. Thus TADA's integration of various kinds of WES data can be a highly effective means of identifying novel risk genes. Indeed, application of TADA to WES data from subjects with ASD and their families, as well as from a study of ASD subjects and controls, revealed several novel and promising ASD candidate genes with strong statistical support.http://europepmc.org/articles/PMC3744441?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Xin He
Stephan J Sanders
Li Liu
Silvia De Rubeis
Elaine T Lim
James S Sutcliffe
Gerard D Schellenberg
Richard A Gibbs
Mark J Daly
Joseph D Buxbaum
Matthew W State
Bernie Devlin
Kathryn Roeder
spellingShingle Xin He
Stephan J Sanders
Li Liu
Silvia De Rubeis
Elaine T Lim
James S Sutcliffe
Gerard D Schellenberg
Richard A Gibbs
Mark J Daly
Joseph D Buxbaum
Matthew W State
Bernie Devlin
Kathryn Roeder
Integrated model of de novo and inherited genetic variants yields greater power to identify risk genes.
PLoS Genetics
author_facet Xin He
Stephan J Sanders
Li Liu
Silvia De Rubeis
Elaine T Lim
James S Sutcliffe
Gerard D Schellenberg
Richard A Gibbs
Mark J Daly
Joseph D Buxbaum
Matthew W State
Bernie Devlin
Kathryn Roeder
author_sort Xin He
title Integrated model of de novo and inherited genetic variants yields greater power to identify risk genes.
title_short Integrated model of de novo and inherited genetic variants yields greater power to identify risk genes.
title_full Integrated model of de novo and inherited genetic variants yields greater power to identify risk genes.
title_fullStr Integrated model of de novo and inherited genetic variants yields greater power to identify risk genes.
title_full_unstemmed Integrated model of de novo and inherited genetic variants yields greater power to identify risk genes.
title_sort integrated model of de novo and inherited genetic variants yields greater power to identify risk genes.
publisher Public Library of Science (PLoS)
series PLoS Genetics
issn 1553-7390
1553-7404
publishDate 2013-01-01
description De novo mutations affect risk for many diseases and disorders, especially those with early-onset. An example is autism spectrum disorders (ASD). Four recent whole-exome sequencing (WES) studies of ASD families revealed a handful of novel risk genes, based on independent de novo loss-of-function (LoF) mutations falling in the same gene, and found that de novo LoF mutations occurred at a twofold higher rate than expected by chance. However successful these studies were, they used only a small fraction of the data, excluding other types of de novo mutations and inherited rare variants. Moreover, such analyses cannot readily incorporate data from case-control studies. An important research challenge in gene discovery, therefore, is to develop statistical methods that accommodate a broader class of rare variation. We develop methods that can incorporate WES data regarding de novo mutations, inherited variants present, and variants identified within cases and controls. TADA, for Transmission And De novo Association, integrates these data by a gene-based likelihood model involving parameters for allele frequencies and gene-specific penetrances. Inference is based on a Hierarchical Bayes strategy that borrows information across all genes to infer parameters that would be difficult to estimate for individual genes. In addition to theoretical development we validated TADA using realistic simulations mimicking rare, large-effect mutations affecting risk for ASD and show it has dramatically better power than other common methods of analysis. Thus TADA's integration of various kinds of WES data can be a highly effective means of identifying novel risk genes. Indeed, application of TADA to WES data from subjects with ASD and their families, as well as from a study of ASD subjects and controls, revealed several novel and promising ASD candidate genes with strong statistical support.
url http://europepmc.org/articles/PMC3744441?pdf=render
work_keys_str_mv AT xinhe integratedmodelofdenovoandinheritedgeneticvariantsyieldsgreaterpowertoidentifyriskgenes
AT stephanjsanders integratedmodelofdenovoandinheritedgeneticvariantsyieldsgreaterpowertoidentifyriskgenes
AT liliu integratedmodelofdenovoandinheritedgeneticvariantsyieldsgreaterpowertoidentifyriskgenes
AT silviaderubeis integratedmodelofdenovoandinheritedgeneticvariantsyieldsgreaterpowertoidentifyriskgenes
AT elainetlim integratedmodelofdenovoandinheritedgeneticvariantsyieldsgreaterpowertoidentifyriskgenes
AT jamesssutcliffe integratedmodelofdenovoandinheritedgeneticvariantsyieldsgreaterpowertoidentifyriskgenes
AT gerarddschellenberg integratedmodelofdenovoandinheritedgeneticvariantsyieldsgreaterpowertoidentifyriskgenes
AT richardagibbs integratedmodelofdenovoandinheritedgeneticvariantsyieldsgreaterpowertoidentifyriskgenes
AT markjdaly integratedmodelofdenovoandinheritedgeneticvariantsyieldsgreaterpowertoidentifyriskgenes
AT josephdbuxbaum integratedmodelofdenovoandinheritedgeneticvariantsyieldsgreaterpowertoidentifyriskgenes
AT matthewwstate integratedmodelofdenovoandinheritedgeneticvariantsyieldsgreaterpowertoidentifyriskgenes
AT berniedevlin integratedmodelofdenovoandinheritedgeneticvariantsyieldsgreaterpowertoidentifyriskgenes
AT kathrynroeder integratedmodelofdenovoandinheritedgeneticvariantsyieldsgreaterpowertoidentifyriskgenes
_version_ 1726006505511059456