Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study
Sequence data are expected to increase the reliability of genomic prediction by containing causative mutations directly, especially in cases where low linkage disequilibrium between markers and causative mutations limits prediction reliability, such as across-breed prediction in dairy cattle. In pra...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Oxford University Press
2016-08-01
|
Series: | G3: Genes, Genomes, Genetics |
Subjects: | |
Online Access: | http://g3journal.org/lookup/doi/10.1534/g3.116.027730 |
id |
doaj-d811bab4110d4c8487ab4c82d7b8f81b |
---|---|
record_format |
Article |
spelling |
doaj-d811bab4110d4c8487ab4c82d7b8f81b2021-07-02T04:16:46ZengOxford University PressG3: Genes, Genomes, Genetics2160-18362016-08-01682553256110.1534/g3.116.02773029Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation StudyIrene van den BergDidier BoichardBernt GuldbrandtsenMogens S. LundSequence data are expected to increase the reliability of genomic prediction by containing causative mutations directly, especially in cases where low linkage disequilibrium between markers and causative mutations limits prediction reliability, such as across-breed prediction in dairy cattle. In practice, the causative mutations are unknown, and prediction with only variants in perfect linkage disequilibrium with the causative mutations is not realistic, leading to a reduced reliability compared to knowing the causative variants. Our objective was to use sequence data to investigate the potential benefits of sequence data for the prediction of genomic relationships, and consequently reliability of genomic breeding values. We used sequence data from five dairy cattle breeds, and a larger number of imputed sequences for two of the five breeds. We focused on the influence of linkage disequilibrium between markers and causative mutations, and assumed that a fraction of the causative mutations was shared across breeds and had the same effect across breeds. By comparing the loss in reliability of different scenarios, varying the distance between markers and causative mutations, using either all genome wide markers from commercial SNP chips, or only the markers closest to the causative mutations, we demonstrate the importance of using only variants very close to the causative mutations, especially for across-breed prediction. Rare variants improved prediction only if they were very close to rare causative mutations, and all causative mutations were rare. Our results show that sequence data can potentially improve genomic prediction, but careful selection of markers is essential.http://g3journal.org/lookup/doi/10.1534/g3.116.027730across-breed predictionsequence datagenomic relationshipslinkage disequilibriumgenomic selectionGenPredshared data resource |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Irene van den Berg Didier Boichard Bernt Guldbrandtsen Mogens S. Lund |
spellingShingle |
Irene van den Berg Didier Boichard Bernt Guldbrandtsen Mogens S. Lund Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study G3: Genes, Genomes, Genetics across-breed prediction sequence data genomic relationships linkage disequilibrium genomic selection GenPred shared data resource |
author_facet |
Irene van den Berg Didier Boichard Bernt Guldbrandtsen Mogens S. Lund |
author_sort |
Irene van den Berg |
title |
Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study |
title_short |
Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study |
title_full |
Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study |
title_fullStr |
Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study |
title_full_unstemmed |
Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study |
title_sort |
using sequence variants in linkage disequilibrium with causative mutations to improve across-breed prediction in dairy cattle: a simulation study |
publisher |
Oxford University Press |
series |
G3: Genes, Genomes, Genetics |
issn |
2160-1836 |
publishDate |
2016-08-01 |
description |
Sequence data are expected to increase the reliability of genomic prediction by containing causative mutations directly, especially in cases where low linkage disequilibrium between markers and causative mutations limits prediction reliability, such as across-breed prediction in dairy cattle. In practice, the causative mutations are unknown, and prediction with only variants in perfect linkage disequilibrium with the causative mutations is not realistic, leading to a reduced reliability compared to knowing the causative variants. Our objective was to use sequence data to investigate the potential benefits of sequence data for the prediction of genomic relationships, and consequently reliability of genomic breeding values. We used sequence data from five dairy cattle breeds, and a larger number of imputed sequences for two of the five breeds. We focused on the influence of linkage disequilibrium between markers and causative mutations, and assumed that a fraction of the causative mutations was shared across breeds and had the same effect across breeds. By comparing the loss in reliability of different scenarios, varying the distance between markers and causative mutations, using either all genome wide markers from commercial SNP chips, or only the markers closest to the causative mutations, we demonstrate the importance of using only variants very close to the causative mutations, especially for across-breed prediction. Rare variants improved prediction only if they were very close to rare causative mutations, and all causative mutations were rare. Our results show that sequence data can potentially improve genomic prediction, but careful selection of markers is essential. |
topic |
across-breed prediction sequence data genomic relationships linkage disequilibrium genomic selection GenPred shared data resource |
url |
http://g3journal.org/lookup/doi/10.1534/g3.116.027730 |
work_keys_str_mv |
AT irenevandenberg usingsequencevariantsinlinkagedisequilibriumwithcausativemutationstoimproveacrossbreedpredictionindairycattleasimulationstudy AT didierboichard usingsequencevariantsinlinkagedisequilibriumwithcausativemutationstoimproveacrossbreedpredictionindairycattleasimulationstudy AT berntguldbrandtsen usingsequencevariantsinlinkagedisequilibriumwithcausativemutationstoimproveacrossbreedpredictionindairycattleasimulationstudy AT mogensslund usingsequencevariantsinlinkagedisequilibriumwithcausativemutationstoimproveacrossbreedpredictionindairycattleasimulationstudy |
_version_ |
1721340514522890240 |