Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study

Sequence data are expected to increase the reliability of genomic prediction by containing causative mutations directly, especially in cases where low linkage disequilibrium between markers and causative mutations limits prediction reliability, such as across-breed prediction in dairy cattle. In pra...

Full description

Bibliographic Details
Main Authors: Irene van den Berg, Didier Boichard, Bernt Guldbrandtsen, Mogens S. Lund
Format: Article
Language:English
Published: Oxford University Press 2016-08-01
Series:G3: Genes, Genomes, Genetics
Subjects:
Online Access:http://g3journal.org/lookup/doi/10.1534/g3.116.027730
id doaj-d811bab4110d4c8487ab4c82d7b8f81b
record_format Article
spelling doaj-d811bab4110d4c8487ab4c82d7b8f81b2021-07-02T04:16:46ZengOxford University PressG3: Genes, Genomes, Genetics2160-18362016-08-01682553256110.1534/g3.116.02773029Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation StudyIrene van den BergDidier BoichardBernt GuldbrandtsenMogens S. LundSequence data are expected to increase the reliability of genomic prediction by containing causative mutations directly, especially in cases where low linkage disequilibrium between markers and causative mutations limits prediction reliability, such as across-breed prediction in dairy cattle. In practice, the causative mutations are unknown, and prediction with only variants in perfect linkage disequilibrium with the causative mutations is not realistic, leading to a reduced reliability compared to knowing the causative variants. Our objective was to use sequence data to investigate the potential benefits of sequence data for the prediction of genomic relationships, and consequently reliability of genomic breeding values. We used sequence data from five dairy cattle breeds, and a larger number of imputed sequences for two of the five breeds. We focused on the influence of linkage disequilibrium between markers and causative mutations, and assumed that a fraction of the causative mutations was shared across breeds and had the same effect across breeds. By comparing the loss in reliability of different scenarios, varying the distance between markers and causative mutations, using either all genome wide markers from commercial SNP chips, or only the markers closest to the causative mutations, we demonstrate the importance of using only variants very close to the causative mutations, especially for across-breed prediction. Rare variants improved prediction only if they were very close to rare causative mutations, and all causative mutations were rare. Our results show that sequence data can potentially improve genomic prediction, but careful selection of markers is essential.http://g3journal.org/lookup/doi/10.1534/g3.116.027730across-breed predictionsequence datagenomic relationshipslinkage disequilibriumgenomic selectionGenPredshared data resource
collection DOAJ
language English
format Article
sources DOAJ
author Irene van den Berg
Didier Boichard
Bernt Guldbrandtsen
Mogens S. Lund
spellingShingle Irene van den Berg
Didier Boichard
Bernt Guldbrandtsen
Mogens S. Lund
Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study
G3: Genes, Genomes, Genetics
across-breed prediction
sequence data
genomic relationships
linkage disequilibrium
genomic selection
GenPred
shared data resource
author_facet Irene van den Berg
Didier Boichard
Bernt Guldbrandtsen
Mogens S. Lund
author_sort Irene van den Berg
title Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study
title_short Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study
title_full Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study
title_fullStr Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study
title_full_unstemmed Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study
title_sort using sequence variants in linkage disequilibrium with causative mutations to improve across-breed prediction in dairy cattle: a simulation study
publisher Oxford University Press
series G3: Genes, Genomes, Genetics
issn 2160-1836
publishDate 2016-08-01
description Sequence data are expected to increase the reliability of genomic prediction by containing causative mutations directly, especially in cases where low linkage disequilibrium between markers and causative mutations limits prediction reliability, such as across-breed prediction in dairy cattle. In practice, the causative mutations are unknown, and prediction with only variants in perfect linkage disequilibrium with the causative mutations is not realistic, leading to a reduced reliability compared to knowing the causative variants. Our objective was to use sequence data to investigate the potential benefits of sequence data for the prediction of genomic relationships, and consequently reliability of genomic breeding values. We used sequence data from five dairy cattle breeds, and a larger number of imputed sequences for two of the five breeds. We focused on the influence of linkage disequilibrium between markers and causative mutations, and assumed that a fraction of the causative mutations was shared across breeds and had the same effect across breeds. By comparing the loss in reliability of different scenarios, varying the distance between markers and causative mutations, using either all genome wide markers from commercial SNP chips, or only the markers closest to the causative mutations, we demonstrate the importance of using only variants very close to the causative mutations, especially for across-breed prediction. Rare variants improved prediction only if they were very close to rare causative mutations, and all causative mutations were rare. Our results show that sequence data can potentially improve genomic prediction, but careful selection of markers is essential.
topic across-breed prediction
sequence data
genomic relationships
linkage disequilibrium
genomic selection
GenPred
shared data resource
url http://g3journal.org/lookup/doi/10.1534/g3.116.027730
work_keys_str_mv AT irenevandenberg usingsequencevariantsinlinkagedisequilibriumwithcausativemutationstoimproveacrossbreedpredictionindairycattleasimulationstudy
AT didierboichard usingsequencevariantsinlinkagedisequilibriumwithcausativemutationstoimproveacrossbreedpredictionindairycattleasimulationstudy
AT berntguldbrandtsen usingsequencevariantsinlinkagedisequilibriumwithcausativemutationstoimproveacrossbreedpredictionindairycattleasimulationstudy
AT mogensslund usingsequencevariantsinlinkagedisequilibriumwithcausativemutationstoimproveacrossbreedpredictionindairycattleasimulationstudy
_version_ 1721340514522890240