Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study

Sequence data are expected to increase the reliability of genomic prediction by containing causative mutations directly, especially in cases where low linkage disequilibrium between markers and causative mutations limits prediction reliability, such as across-breed prediction in dairy cattle. In pra...

Full description

Bibliographic Details
Main Authors:	Irene van den Berg, Didier Boichard, Bernt Guldbrandtsen, Mogens S. Lund
Format:	Article
Language:	English
Published:	Oxford University Press 2016-08-01
Series:	G3: Genes, Genomes, Genetics
Subjects:	across-breed prediction sequence data genomic relationships linkage disequilibrium genomic selection GenPred shared data resource
Online Access:	http://g3journal.org/lookup/doi/10.1534/g3.116.027730

id	doaj-d811bab4110d4c8487ab4c82d7b8f81b
record_format	Article
spelling	doaj-d811bab4110d4c8487ab4c82d7b8f81b2021-07-02T04:16:46ZengOxford University PressG3: Genes, Genomes, Genetics2160-18362016-08-01682553256110.1534/g3.116.02773029Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation StudyIrene van den BergDidier BoichardBernt GuldbrandtsenMogens S. LundSequence data are expected to increase the reliability of genomic prediction by containing causative mutations directly, especially in cases where low linkage disequilibrium between markers and causative mutations limits prediction reliability, such as across-breed prediction in dairy cattle. In practice, the causative mutations are unknown, and prediction with only variants in perfect linkage disequilibrium with the causative mutations is not realistic, leading to a reduced reliability compared to knowing the causative variants. Our objective was to use sequence data to investigate the potential benefits of sequence data for the prediction of genomic relationships, and consequently reliability of genomic breeding values. We used sequence data from five dairy cattle breeds, and a larger number of imputed sequences for two of the five breeds. We focused on the influence of linkage disequilibrium between markers and causative mutations, and assumed that a fraction of the causative mutations was shared across breeds and had the same effect across breeds. By comparing the loss in reliability of different scenarios, varying the distance between markers and causative mutations, using either all genome wide markers from commercial SNP chips, or only the markers closest to the causative mutations, we demonstrate the importance of using only variants very close to the causative mutations, especially for across-breed prediction. Rare variants improved prediction only if they were very close to rare causative mutations, and all causative mutations were rare. Our results show that sequence data can potentially improve genomic prediction, but careful selection of markers is essential.http://g3journal.org/lookup/doi/10.1534/g3.116.027730across-breed predictionsequence datagenomic relationshipslinkage disequilibriumgenomic selectionGenPredshared data resource
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Irene van den Berg Didier Boichard Bernt Guldbrandtsen Mogens S. Lund
spellingShingle	Irene van den Berg Didier Boichard Bernt Guldbrandtsen Mogens S. Lund Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study G3: Genes, Genomes, Genetics across-breed prediction sequence data genomic relationships linkage disequilibrium genomic selection GenPred shared data resource
author_facet	Irene van den Berg Didier Boichard Bernt Guldbrandtsen Mogens S. Lund
author_sort	Irene van den Berg
title	Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study
title_short	Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study
title_full	Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study
title_fullStr	Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study
title_full_unstemmed	Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study
title_sort	using sequence variants in linkage disequilibrium with causative mutations to improve across-breed prediction in dairy cattle: a simulation study
publisher	Oxford University Press
series	G3: Genes, Genomes, Genetics
issn	2160-1836
publishDate	2016-08-01
description	Sequence data are expected to increase the reliability of genomic prediction by containing causative mutations directly, especially in cases where low linkage disequilibrium between markers and causative mutations limits prediction reliability, such as across-breed prediction in dairy cattle. In practice, the causative mutations are unknown, and prediction with only variants in perfect linkage disequilibrium with the causative mutations is not realistic, leading to a reduced reliability compared to knowing the causative variants. Our objective was to use sequence data to investigate the potential benefits of sequence data for the prediction of genomic relationships, and consequently reliability of genomic breeding values. We used sequence data from five dairy cattle breeds, and a larger number of imputed sequences for two of the five breeds. We focused on the influence of linkage disequilibrium between markers and causative mutations, and assumed that a fraction of the causative mutations was shared across breeds and had the same effect across breeds. By comparing the loss in reliability of different scenarios, varying the distance between markers and causative mutations, using either all genome wide markers from commercial SNP chips, or only the markers closest to the causative mutations, we demonstrate the importance of using only variants very close to the causative mutations, especially for across-breed prediction. Rare variants improved prediction only if they were very close to rare causative mutations, and all causative mutations were rare. Our results show that sequence data can potentially improve genomic prediction, but careful selection of markers is essential.
topic	across-breed prediction sequence data genomic relationships linkage disequilibrium genomic selection GenPred shared data resource
url	http://g3journal.org/lookup/doi/10.1534/g3.116.027730
work_keys_str_mv	AT irenevandenberg usingsequencevariantsinlinkagedisequilibriumwithcausativemutationstoimproveacrossbreedpredictionindairycattleasimulationstudy AT didierboichard usingsequencevariantsinlinkagedisequilibriumwithcausativemutationstoimproveacrossbreedpredictionindairycattleasimulationstudy AT berntguldbrandtsen usingsequencevariantsinlinkagedisequilibriumwithcausativemutationstoimproveacrossbreedpredictionindairycattleasimulationstudy AT mogensslund usingsequencevariantsinlinkagedisequilibriumwithcausativemutationstoimproveacrossbreedpredictionindairycattleasimulationstudy
_version_	1721340514522890240

Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study

Similar Items