Understanding the potential bias of variance components estimators when using genomic models

Abstract Background Genomic models that link phenotypes to dense genotype information are increasingly being used for infering variance parameters in genetics studies. The variance parameters of these models can be inferred using restricted maximum likelihood, which produces consistent, asymptotical...

Full description

Bibliographic Details
Main Authors: Beatriz C. D. Cuyabano, A. Christian Sørensen, Peter Sørensen
Format: Article
Language:deu
Published: BMC 2018-08-01
Series:Genetics Selection Evolution
Online Access:http://link.springer.com/article/10.1186/s12711-018-0411-0
id doaj-252e2ce3eb754ff18afc325fdc9c80f6
record_format Article
spelling doaj-252e2ce3eb754ff18afc325fdc9c80f62020-11-25T01:39:02ZdeuBMCGenetics Selection Evolution1297-96862018-08-0150112110.1186/s12711-018-0411-0Understanding the potential bias of variance components estimators when using genomic modelsBeatriz C. D. Cuyabano0A. Christian Sørensen1Peter Sørensen2Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus UniversityCenter for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus UniversityCenter for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus UniversityAbstract Background Genomic models that link phenotypes to dense genotype information are increasingly being used for infering variance parameters in genetics studies. The variance parameters of these models can be inferred using restricted maximum likelihood, which produces consistent, asymptotically normal estimates of variance components under the true model. These properties are not guaranteed to hold when the covariance structure of the data specified by the genomic model differs substantially from the covariance structure specified by the true model, and in this case, the likelihood of the model is said to be misspecified. If the covariance structure specified by the genomic model provides a poor description of that specified by the true model, the likelihood misspecification may lead to incorrect inferences. Results This work provides a theoretical analysis of the genomic models based on splitting the misspecified likelihood equations into components, which isolate those that contribute to incorrect inferences, providing an informative measure, defined as $$\varvec{\kappa }$$ κ , to compare the covariance structure of the data specified by the genomic and the true models. This comparison of the covariance structures allows us to determine whether or not bias in the variance components estimates is expected to occur. Conclusions The theory presented can be used to provide an explanation for the success of a number of recently reported approaches that are suggested to remove sources of bias of heritability estimates. Furthermore, however complex is the quantification of this bias, we can determine that, in genomic models that consider a single genomic component to estimate heritability (assuming SNP effects are all i.i.d.), the bias of the estimator tends to be downward, when it exists.http://link.springer.com/article/10.1186/s12711-018-0411-0
collection DOAJ
language deu
format Article
sources DOAJ
author Beatriz C. D. Cuyabano
A. Christian Sørensen
Peter Sørensen
spellingShingle Beatriz C. D. Cuyabano
A. Christian Sørensen
Peter Sørensen
Understanding the potential bias of variance components estimators when using genomic models
Genetics Selection Evolution
author_facet Beatriz C. D. Cuyabano
A. Christian Sørensen
Peter Sørensen
author_sort Beatriz C. D. Cuyabano
title Understanding the potential bias of variance components estimators when using genomic models
title_short Understanding the potential bias of variance components estimators when using genomic models
title_full Understanding the potential bias of variance components estimators when using genomic models
title_fullStr Understanding the potential bias of variance components estimators when using genomic models
title_full_unstemmed Understanding the potential bias of variance components estimators when using genomic models
title_sort understanding the potential bias of variance components estimators when using genomic models
publisher BMC
series Genetics Selection Evolution
issn 1297-9686
publishDate 2018-08-01
description Abstract Background Genomic models that link phenotypes to dense genotype information are increasingly being used for infering variance parameters in genetics studies. The variance parameters of these models can be inferred using restricted maximum likelihood, which produces consistent, asymptotically normal estimates of variance components under the true model. These properties are not guaranteed to hold when the covariance structure of the data specified by the genomic model differs substantially from the covariance structure specified by the true model, and in this case, the likelihood of the model is said to be misspecified. If the covariance structure specified by the genomic model provides a poor description of that specified by the true model, the likelihood misspecification may lead to incorrect inferences. Results This work provides a theoretical analysis of the genomic models based on splitting the misspecified likelihood equations into components, which isolate those that contribute to incorrect inferences, providing an informative measure, defined as $$\varvec{\kappa }$$ κ , to compare the covariance structure of the data specified by the genomic and the true models. This comparison of the covariance structures allows us to determine whether or not bias in the variance components estimates is expected to occur. Conclusions The theory presented can be used to provide an explanation for the success of a number of recently reported approaches that are suggested to remove sources of bias of heritability estimates. Furthermore, however complex is the quantification of this bias, we can determine that, in genomic models that consider a single genomic component to estimate heritability (assuming SNP effects are all i.i.d.), the bias of the estimator tends to be downward, when it exists.
url http://link.springer.com/article/10.1186/s12711-018-0411-0
work_keys_str_mv AT beatrizcdcuyabano understandingthepotentialbiasofvariancecomponentsestimatorswhenusinggenomicmodels
AT achristiansørensen understandingthepotentialbiasofvariancecomponentsestimatorswhenusinggenomicmodels
AT petersørensen understandingthepotentialbiasofvariancecomponentsestimatorswhenusinggenomicmodels
_version_ 1725050775851237376