A Simulation-based Approach to Study Rare Variant Associations Across the Disease Spectrum

Although complete understanding of the mechanisms of rare genetic variants in disease continues to elude us, Next Generation Sequencing (NGS) has facilitated significant gene discoveries across the disease spectrum. However, the cost of NGS hinders its use for identifying rare variants in common di...

Full description

Bibliographic Details
Main Author: Banuelos, Rosa
Other Authors: Kimmel, Marek
Format: Others
Language:English
Published: 2013
Subjects:
Online Access:http://hdl.handle.net/1911/71920
id ndltd-RICE-oai-scholarship.rice.edu-1911-71920
record_format oai_dc
spelling ndltd-RICE-oai-scholarship.rice.edu-1911-719202013-09-18T03:28:45ZA Simulation-based Approach to Study Rare Variant Associations Across the Disease SpectrumBanuelos, Rosarare variantcommon diseasepowermendeliansamplingAlthough complete understanding of the mechanisms of rare genetic variants in disease continues to elude us, Next Generation Sequencing (NGS) has facilitated significant gene discoveries across the disease spectrum. However, the cost of NGS hinders its use for identifying rare variants in common diseases that require large samples. To circumvent the need for larger samples, designing efficient sampling studies is crucial in order to detect potential associations. This research therefore evaluates sampling designs for rare variant - quantitative trait association studies and assesses the effect on power that freely available public cohort data can have in the design. Performing simulations and evaluating common and unconventional sampling schemes results in several noteworthy findings. Specifically, the extreme-trait design is the most powerful design for analyzing quantitative traits. This research also shows that sampling more individuals from the extreme of clinical interest does not increase power. Variant filtering has served as a "proof-of-concept" approach for the discovery of disease-causing genes in Mendelian traits and formal statistical methods have been lacking in this area. However, combining variant filtering schemes with existing rare variant association tests is a practical alternative. Thus, this thesis also compares the robustness of six burden-based rare variant association tests for Mendelian traits after a variant filtering step in the presence of genetic heterogeneity and genotyping errors. This research shows that with low locus heterogeneity, these tests are powerful for testing association. With the exception of the weighted sum statistic (WSS), the remaining tests were very conservative in preserving the type I error when the number of affected and unaffected individuals was unequal. The WSS, on the other hand, had inflated type I error as the number of unaffected individuals increased. The framework presented can serve as a catalyst to improve sampling design and to develop robust statistical methods for association testing.Kimmel, Marek2013-09-16T14:30:07Z2013-09-16T14:30:10Z2013-09-16T14:30:07Z2013-09-16T14:30:10Z2013-052013-09-16May 20132013-09-16T14:30:10Zthesistextapplication/pdfhttp://hdl.handle.net/1911/71920123456789/ETD-2013-05-353eng
collection NDLTD
language English
format Others
sources NDLTD
topic rare variant
common disease
power
mendelian
sampling
spellingShingle rare variant
common disease
power
mendelian
sampling
Banuelos, Rosa
A Simulation-based Approach to Study Rare Variant Associations Across the Disease Spectrum
description Although complete understanding of the mechanisms of rare genetic variants in disease continues to elude us, Next Generation Sequencing (NGS) has facilitated significant gene discoveries across the disease spectrum. However, the cost of NGS hinders its use for identifying rare variants in common diseases that require large samples. To circumvent the need for larger samples, designing efficient sampling studies is crucial in order to detect potential associations. This research therefore evaluates sampling designs for rare variant - quantitative trait association studies and assesses the effect on power that freely available public cohort data can have in the design. Performing simulations and evaluating common and unconventional sampling schemes results in several noteworthy findings. Specifically, the extreme-trait design is the most powerful design for analyzing quantitative traits. This research also shows that sampling more individuals from the extreme of clinical interest does not increase power. Variant filtering has served as a "proof-of-concept" approach for the discovery of disease-causing genes in Mendelian traits and formal statistical methods have been lacking in this area. However, combining variant filtering schemes with existing rare variant association tests is a practical alternative. Thus, this thesis also compares the robustness of six burden-based rare variant association tests for Mendelian traits after a variant filtering step in the presence of genetic heterogeneity and genotyping errors. This research shows that with low locus heterogeneity, these tests are powerful for testing association. With the exception of the weighted sum statistic (WSS), the remaining tests were very conservative in preserving the type I error when the number of affected and unaffected individuals was unequal. The WSS, on the other hand, had inflated type I error as the number of unaffected individuals increased. The framework presented can serve as a catalyst to improve sampling design and to develop robust statistical methods for association testing.
author2 Kimmel, Marek
author_facet Kimmel, Marek
Banuelos, Rosa
author Banuelos, Rosa
author_sort Banuelos, Rosa
title A Simulation-based Approach to Study Rare Variant Associations Across the Disease Spectrum
title_short A Simulation-based Approach to Study Rare Variant Associations Across the Disease Spectrum
title_full A Simulation-based Approach to Study Rare Variant Associations Across the Disease Spectrum
title_fullStr A Simulation-based Approach to Study Rare Variant Associations Across the Disease Spectrum
title_full_unstemmed A Simulation-based Approach to Study Rare Variant Associations Across the Disease Spectrum
title_sort simulation-based approach to study rare variant associations across the disease spectrum
publishDate 2013
url http://hdl.handle.net/1911/71920
work_keys_str_mv AT banuelosrosa asimulationbasedapproachtostudyrarevariantassociationsacrossthediseasespectrum
AT banuelosrosa simulationbasedapproachtostudyrarevariantassociationsacrossthediseasespectrum
_version_ 1716597490636029952