Pan-genomic open reading frames: A potential supplement of single nucleotide polymorphisms in estimation of heritability and genomic prediction.

Pan-genomic open reading frames (ORFs) potentially carry protein-coding gene or coding variant information in a population. In this study, we suggest that pan-genomic ORFs are promising to be utilized in estimation of heritability and genomic prediction. A Saccharomyces cerevisiae dataset with whole...

Full description

Bibliographic Details
Main Authors: Zhengcao Li, Henner Simianer
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-08-01
Series:PLoS Genetics
Online Access:https://doi.org/10.1371/journal.pgen.1008995
id doaj-a74fae065a704036930a10369c61598e
record_format Article
spelling doaj-a74fae065a704036930a10369c61598e2021-04-21T13:53:34ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042020-08-01168e100899510.1371/journal.pgen.1008995Pan-genomic open reading frames: A potential supplement of single nucleotide polymorphisms in estimation of heritability and genomic prediction.Zhengcao LiHenner SimianerPan-genomic open reading frames (ORFs) potentially carry protein-coding gene or coding variant information in a population. In this study, we suggest that pan-genomic ORFs are promising to be utilized in estimation of heritability and genomic prediction. A Saccharomyces cerevisiae dataset with whole-genome SNPs, pan-genomic ORFs, and the copy numbers of those ORFs is used to test the effectiveness of ORF data as a predictor in three prediction models for 35 traits. Our results show that the ORF-based heritability can capture more genetic effects than SNP-based heritability for all traits. Compared to SNP-based genomic prediction (GBLUP), pan-genomic ORF-based genomic prediction (OBLUP) is distinctly more accurate for all traits, and the predictive abilities on average are more than doubled across all traits. For four traits, the copy number of ORF-based prediction(CBLUP) is more accurate than OBLUP. When using different numbers of isolates in training sets in ORF-based prediction, the predictive abilities for all traits increased as more isolates are added in the training sets, suggesting that with very large training sets the prediction accuracy will be in the range of the square root of the heritability. We conclude that pan-genomic ORFs have the potential to be a supplement of single nucleotide polymorphisms in estimation of heritability and genomic prediction.https://doi.org/10.1371/journal.pgen.1008995
collection DOAJ
language English
format Article
sources DOAJ
author Zhengcao Li
Henner Simianer
spellingShingle Zhengcao Li
Henner Simianer
Pan-genomic open reading frames: A potential supplement of single nucleotide polymorphisms in estimation of heritability and genomic prediction.
PLoS Genetics
author_facet Zhengcao Li
Henner Simianer
author_sort Zhengcao Li
title Pan-genomic open reading frames: A potential supplement of single nucleotide polymorphisms in estimation of heritability and genomic prediction.
title_short Pan-genomic open reading frames: A potential supplement of single nucleotide polymorphisms in estimation of heritability and genomic prediction.
title_full Pan-genomic open reading frames: A potential supplement of single nucleotide polymorphisms in estimation of heritability and genomic prediction.
title_fullStr Pan-genomic open reading frames: A potential supplement of single nucleotide polymorphisms in estimation of heritability and genomic prediction.
title_full_unstemmed Pan-genomic open reading frames: A potential supplement of single nucleotide polymorphisms in estimation of heritability and genomic prediction.
title_sort pan-genomic open reading frames: a potential supplement of single nucleotide polymorphisms in estimation of heritability and genomic prediction.
publisher Public Library of Science (PLoS)
series PLoS Genetics
issn 1553-7390
1553-7404
publishDate 2020-08-01
description Pan-genomic open reading frames (ORFs) potentially carry protein-coding gene or coding variant information in a population. In this study, we suggest that pan-genomic ORFs are promising to be utilized in estimation of heritability and genomic prediction. A Saccharomyces cerevisiae dataset with whole-genome SNPs, pan-genomic ORFs, and the copy numbers of those ORFs is used to test the effectiveness of ORF data as a predictor in three prediction models for 35 traits. Our results show that the ORF-based heritability can capture more genetic effects than SNP-based heritability for all traits. Compared to SNP-based genomic prediction (GBLUP), pan-genomic ORF-based genomic prediction (OBLUP) is distinctly more accurate for all traits, and the predictive abilities on average are more than doubled across all traits. For four traits, the copy number of ORF-based prediction(CBLUP) is more accurate than OBLUP. When using different numbers of isolates in training sets in ORF-based prediction, the predictive abilities for all traits increased as more isolates are added in the training sets, suggesting that with very large training sets the prediction accuracy will be in the range of the square root of the heritability. We conclude that pan-genomic ORFs have the potential to be a supplement of single nucleotide polymorphisms in estimation of heritability and genomic prediction.
url https://doi.org/10.1371/journal.pgen.1008995
work_keys_str_mv AT zhengcaoli pangenomicopenreadingframesapotentialsupplementofsinglenucleotidepolymorphismsinestimationofheritabilityandgenomicprediction
AT hennersimianer pangenomicopenreadingframesapotentialsupplementofsinglenucleotidepolymorphismsinestimationofheritabilityandgenomicprediction
_version_ 1714668564523253760