The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)

A typical plant genome-wide association study (GWAS) uses a mixed linear model (MLM) that includes a trait as the response variable, a marker as an explanatory variable, and fixed and random effect covariates accounting for population structure and relatedness. Although effective in controlling for...

Full description

Bibliographic Details
Main Authors: Angela H. Chen, Alexander E. Lipka
Format: Article
Language:English
Published: Oxford University Press 2016-08-01
Series:G3: Genes, Genomes, Genetics
Subjects:
Online Access:http://g3journal.org/lookup/doi/10.1534/g3.116.029090
id doaj-49bd584f1f2748edb1e084ba29a69cbf
record_format Article
spelling doaj-49bd584f1f2748edb1e084ba29a69cbf2021-07-02T01:47:23ZengOxford University PressG3: Genes, Genomes, Genetics2160-18362016-08-01682365237410.1534/g3.116.02909012The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)Angela H. ChenAlexander E. LipkaA typical plant genome-wide association study (GWAS) uses a mixed linear model (MLM) that includes a trait as the response variable, a marker as an explanatory variable, and fixed and random effect covariates accounting for population structure and relatedness. Although effective in controlling for false positive signals, this model typically fails to detect signals that are correlated with population structure or are located in high linkage disequilibrium (LD) genomic regions. This result likely arises from each tested marker being used to estimate population structure and relatedness. Previous work has demonstrated that it is possible to increase the power of the MLM by estimating relatedness (i.e., kinship) with markers that are not located on the chromosome where the tested marker resides. To quantify the amount of additional significant signals one can expect using this so-called K_chr model, we reanalyzed Mendelian, polygenic, and complex traits in two maize (Zea mays L.) diversity panels that have been previously assessed using the traditional MLM. We demonstrated that the K_chr model could find more significant associations, especially in high LD regions. This finding is underscored by our identification of novel genomic signals proximal to the tocochromanol biosynthetic pathway gene ZmVTE1 that are associated with a ratio of tocotrienols. We conclude that the K_chr model can detect more intricate sources of allelic variation underlying agronomically important traits, and should therefore become more widely used for GWAS. To facilitate the implementation of the K_chr model, we provide code written in the R programming language.http://g3journal.org/lookup/doi/10.1534/g3.116.029090GWASmixed modellinkage disequilibriummaizemarker subsets
collection DOAJ
language English
format Article
sources DOAJ
author Angela H. Chen
Alexander E. Lipka
spellingShingle Angela H. Chen
Alexander E. Lipka
The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)
G3: Genes, Genomes, Genetics
GWAS
mixed model
linkage disequilibrium
maize
marker subsets
author_facet Angela H. Chen
Alexander E. Lipka
author_sort Angela H. Chen
title The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)
title_short The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)
title_full The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)
title_fullStr The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)
title_full_unstemmed The Use of Targeted Marker Subsets to Account for Population Structure and Relatedness in Genome-Wide Association Studies of Maize (Zea mays L.)
title_sort use of targeted marker subsets to account for population structure and relatedness in genome-wide association studies of maize (zea mays l.)
publisher Oxford University Press
series G3: Genes, Genomes, Genetics
issn 2160-1836
publishDate 2016-08-01
description A typical plant genome-wide association study (GWAS) uses a mixed linear model (MLM) that includes a trait as the response variable, a marker as an explanatory variable, and fixed and random effect covariates accounting for population structure and relatedness. Although effective in controlling for false positive signals, this model typically fails to detect signals that are correlated with population structure or are located in high linkage disequilibrium (LD) genomic regions. This result likely arises from each tested marker being used to estimate population structure and relatedness. Previous work has demonstrated that it is possible to increase the power of the MLM by estimating relatedness (i.e., kinship) with markers that are not located on the chromosome where the tested marker resides. To quantify the amount of additional significant signals one can expect using this so-called K_chr model, we reanalyzed Mendelian, polygenic, and complex traits in two maize (Zea mays L.) diversity panels that have been previously assessed using the traditional MLM. We demonstrated that the K_chr model could find more significant associations, especially in high LD regions. This finding is underscored by our identification of novel genomic signals proximal to the tocochromanol biosynthetic pathway gene ZmVTE1 that are associated with a ratio of tocotrienols. We conclude that the K_chr model can detect more intricate sources of allelic variation underlying agronomically important traits, and should therefore become more widely used for GWAS. To facilitate the implementation of the K_chr model, we provide code written in the R programming language.
topic GWAS
mixed model
linkage disequilibrium
maize
marker subsets
url http://g3journal.org/lookup/doi/10.1534/g3.116.029090
work_keys_str_mv AT angelahchen theuseoftargetedmarkersubsetstoaccountforpopulationstructureandrelatednessingenomewideassociationstudiesofmaizezeamaysl
AT alexanderelipka theuseoftargetedmarkersubsetstoaccountforpopulationstructureandrelatednessingenomewideassociationstudiesofmaizezeamaysl
AT angelahchen useoftargetedmarkersubsetstoaccountforpopulationstructureandrelatednessingenomewideassociationstudiesofmaizezeamaysl
AT alexanderelipka useoftargetedmarkersubsetstoaccountforpopulationstructureandrelatednessingenomewideassociationstudiesofmaizezeamaysl
_version_ 1721344388033937408