Genome-wide mapping of quantitative trait loci in admixed populations using mixed linear model and Bayesian multiple regression analysis

Abstract Background Population stratification and cryptic relationships have been the main sources of excessive false-positives and false-negatives in population-based association studies. Many methods have been developed to model these confounding factors and minimize their impact on the results of...

Full description

Bibliographic Details
Main Authors:	Ali Toosi, Rohan L. Fernando, Jack C. M. Dekkers
Format:	Article
Language:	deu
Published:	BMC 2018-06-01
Series:	Genetics Selection Evolution
Online Access:	http://link.springer.com/article/10.1186/s12711-018-0402-1

id	doaj-a2fb2882a2b24f338821a650f88bf422
record_format	Article
spelling	doaj-a2fb2882a2b24f338821a650f88bf4222020-11-24T21:58:52ZdeuBMCGenetics Selection Evolution1297-96862018-06-0150111310.1186/s12711-018-0402-1Genome-wide mapping of quantitative trait loci in admixed populations using mixed linear model and Bayesian multiple regression analysisAli Toosi0Rohan L. Fernando1Jack C. M. Dekkers2Cobb-Vantress Inc.Department of Animal Science, Iowa State UniversityDepartment of Animal Science, Iowa State UniversityAbstract Background Population stratification and cryptic relationships have been the main sources of excessive false-positives and false-negatives in population-based association studies. Many methods have been developed to model these confounding factors and minimize their impact on the results of genome-wide association studies. In most of these methods, a two-stage approach is applied where: (1) methods are used to determine if there is a population structure in the sample dataset and (2) the effects of population structure are corrected either by modeling it or by running a separate analysis within each sub-population. The objective of this study was to evaluate the impact of population structure on the accuracy and power of genome-wide association studies using a Bayesian multiple regression method. Methods We conducted a genome-wide association study in a stochastically simulated admixed population. The genome was composed of six chromosomes, each with 1000 markers. Fifteen segregating quantitative trait loci contributed to the genetic variation of a quantitative trait with heritability of 0.30. The impact of genetic relationships and breed composition (BC) on three analysis methods were evaluated: single marker simple regression (SMR), single marker mixed linear model (MLM) and Bayesian multiple-regression analysis (BMR). Each method was fitted with and without BC. Accuracy, power, false-positive rate and the positive predictive value of each method were calculated and used for comparison. Results SMR and BMR, both without BC, were ranked as the worst and the best performing approaches, respectively. Our results showed that, while explicit modeling of genetic relationships and BC is essential for models SMR and MLM, BMR can disregard them and yet result in a higher power without compromising its false-positive rate. Conclusions This study showed that the Bayesian multiple-regression analysis is robust to population structure and to relationships among study subjects and performs better than a single marker mixed linear model approach.http://link.springer.com/article/10.1186/s12711-018-0402-1
collection	DOAJ
language	deu
format	Article
sources	DOAJ
author	Ali Toosi Rohan L. Fernando Jack C. M. Dekkers
spellingShingle	Ali Toosi Rohan L. Fernando Jack C. M. Dekkers Genome-wide mapping of quantitative trait loci in admixed populations using mixed linear model and Bayesian multiple regression analysis Genetics Selection Evolution
author_facet	Ali Toosi Rohan L. Fernando Jack C. M. Dekkers
author_sort	Ali Toosi
title	Genome-wide mapping of quantitative trait loci in admixed populations using mixed linear model and Bayesian multiple regression analysis
title_short	Genome-wide mapping of quantitative trait loci in admixed populations using mixed linear model and Bayesian multiple regression analysis
title_full	Genome-wide mapping of quantitative trait loci in admixed populations using mixed linear model and Bayesian multiple regression analysis
title_fullStr	Genome-wide mapping of quantitative trait loci in admixed populations using mixed linear model and Bayesian multiple regression analysis
title_full_unstemmed	Genome-wide mapping of quantitative trait loci in admixed populations using mixed linear model and Bayesian multiple regression analysis
title_sort	genome-wide mapping of quantitative trait loci in admixed populations using mixed linear model and bayesian multiple regression analysis
publisher	BMC
series	Genetics Selection Evolution
issn	1297-9686
publishDate	2018-06-01
description	Abstract Background Population stratification and cryptic relationships have been the main sources of excessive false-positives and false-negatives in population-based association studies. Many methods have been developed to model these confounding factors and minimize their impact on the results of genome-wide association studies. In most of these methods, a two-stage approach is applied where: (1) methods are used to determine if there is a population structure in the sample dataset and (2) the effects of population structure are corrected either by modeling it or by running a separate analysis within each sub-population. The objective of this study was to evaluate the impact of population structure on the accuracy and power of genome-wide association studies using a Bayesian multiple regression method. Methods We conducted a genome-wide association study in a stochastically simulated admixed population. The genome was composed of six chromosomes, each with 1000 markers. Fifteen segregating quantitative trait loci contributed to the genetic variation of a quantitative trait with heritability of 0.30. The impact of genetic relationships and breed composition (BC) on three analysis methods were evaluated: single marker simple regression (SMR), single marker mixed linear model (MLM) and Bayesian multiple-regression analysis (BMR). Each method was fitted with and without BC. Accuracy, power, false-positive rate and the positive predictive value of each method were calculated and used for comparison. Results SMR and BMR, both without BC, were ranked as the worst and the best performing approaches, respectively. Our results showed that, while explicit modeling of genetic relationships and BC is essential for models SMR and MLM, BMR can disregard them and yet result in a higher power without compromising its false-positive rate. Conclusions This study showed that the Bayesian multiple-regression analysis is robust to population structure and to relationships among study subjects and performs better than a single marker mixed linear model approach.
url	http://link.springer.com/article/10.1186/s12711-018-0402-1
work_keys_str_mv	AT alitoosi genomewidemappingofquantitativetraitlociinadmixedpopulationsusingmixedlinearmodelandbayesianmultipleregressionanalysis AT rohanlfernando genomewidemappingofquantitativetraitlociinadmixedpopulationsusingmixedlinearmodelandbayesianmultipleregressionanalysis AT jackcmdekkers genomewidemappingofquantitativetraitlociinadmixedpopulationsusingmixedlinearmodelandbayesianmultipleregressionanalysis
_version_	1725850381160808448

Genome-wide mapping of quantitative trait loci in admixed populations using mixed linear model and Bayesian multiple regression analysis

Similar Items