pCADD: SNV prioritisation in Sus scrofa
Abstract Background In animal breeding, identification of causative genetic variants is of major importance and high economical value. Usually, the number of candidate variants exceeds the number of variants that can be validated. One way of prioritizing probable candidates is by evaluating their po...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | deu |
Published: |
BMC
2020-02-01
|
Series: | Genetics Selection Evolution |
Online Access: | https://doi.org/10.1186/s12711-020-0528-9 |
id |
doaj-69a7c63b694b4f8ba04c4e9109410dc8 |
---|---|
record_format |
Article |
spelling |
doaj-69a7c63b694b4f8ba04c4e9109410dc82021-02-07T12:20:56ZdeuBMCGenetics Selection Evolution1297-96862020-02-0152111510.1186/s12711-020-0528-9pCADD: SNV prioritisation in Sus scrofaChristian Groß0Martijn Derks1Hendrik-Jan Megens2Mirte Bosse3Martien A. M. Groenen4Marcel Reinders5Dick de Ridder6Delft Bioinformatics Lab, University of Technology DelftAnimal Breeding and Genomics, Wageningen University & ResearchAnimal Breeding and Genomics, Wageningen University & ResearchAnimal Breeding and Genomics, Wageningen University & ResearchAnimal Breeding and Genomics, Wageningen University & ResearchDelft Bioinformatics Lab, University of Technology DelftBioinformatics Group, Wageningen University & ResearchAbstract Background In animal breeding, identification of causative genetic variants is of major importance and high economical value. Usually, the number of candidate variants exceeds the number of variants that can be validated. One way of prioritizing probable candidates is by evaluating their potential to have a deleterious effect, e.g. by predicting their consequence. Due to experimental difficulties to evaluate variants that do not cause an amino-acid substitution, other prioritization methods are needed. For human genomes, the prediction of deleterious genomic variants has taken a step forward with the introduction of the combined annotation dependent depletion (CADD) method. In theory, this approach can be applied to any species. Here, we present pCADD (p for pig), a model to score single nucleotide variants (SNVs) in pig genomes. Results To evaluate whether pCADD captures sites with biological meaning, we used transcripts from miRNAs and introns, sequences from genes that are specific for a particular tissue, and the different sites of codons, to test how well pCADD scores differentiate between functional and non-functional elements. Furthermore, we conducted an assessment of examples of non-coding and coding SNVs, which are causal for changes in phenotypes. Our results show that pCADD scores discriminate between functional and non-functional sequences and prioritize functional SNVs, and that pCADD is able to score the different positions in a codon relative to their redundancy. Taken together, these results indicate that based on pCADD scores, regions with biological relevance can be identified and distinguished according to their rate of adaptation. Conclusions We present the ability of pCADD to prioritize SNVs in the pig genome with respect to their putative deleteriousness, in accordance to the biological significance of the region in which they are located. We created scores for all possible SNVs, coding and non-coding, for all autosomes and the X chromosome of the pig reference sequence Sscrofa11.1, proposing a toolbox to prioritize variants and evaluate sequences to highlight new sites of interest to explain biological functions that are relevant to animal breeding.https://doi.org/10.1186/s12711-020-0528-9 |
collection |
DOAJ |
language |
deu |
format |
Article |
sources |
DOAJ |
author |
Christian Groß Martijn Derks Hendrik-Jan Megens Mirte Bosse Martien A. M. Groenen Marcel Reinders Dick de Ridder |
spellingShingle |
Christian Groß Martijn Derks Hendrik-Jan Megens Mirte Bosse Martien A. M. Groenen Marcel Reinders Dick de Ridder pCADD: SNV prioritisation in Sus scrofa Genetics Selection Evolution |
author_facet |
Christian Groß Martijn Derks Hendrik-Jan Megens Mirte Bosse Martien A. M. Groenen Marcel Reinders Dick de Ridder |
author_sort |
Christian Groß |
title |
pCADD: SNV prioritisation in Sus scrofa |
title_short |
pCADD: SNV prioritisation in Sus scrofa |
title_full |
pCADD: SNV prioritisation in Sus scrofa |
title_fullStr |
pCADD: SNV prioritisation in Sus scrofa |
title_full_unstemmed |
pCADD: SNV prioritisation in Sus scrofa |
title_sort |
pcadd: snv prioritisation in sus scrofa |
publisher |
BMC |
series |
Genetics Selection Evolution |
issn |
1297-9686 |
publishDate |
2020-02-01 |
description |
Abstract Background In animal breeding, identification of causative genetic variants is of major importance and high economical value. Usually, the number of candidate variants exceeds the number of variants that can be validated. One way of prioritizing probable candidates is by evaluating their potential to have a deleterious effect, e.g. by predicting their consequence. Due to experimental difficulties to evaluate variants that do not cause an amino-acid substitution, other prioritization methods are needed. For human genomes, the prediction of deleterious genomic variants has taken a step forward with the introduction of the combined annotation dependent depletion (CADD) method. In theory, this approach can be applied to any species. Here, we present pCADD (p for pig), a model to score single nucleotide variants (SNVs) in pig genomes. Results To evaluate whether pCADD captures sites with biological meaning, we used transcripts from miRNAs and introns, sequences from genes that are specific for a particular tissue, and the different sites of codons, to test how well pCADD scores differentiate between functional and non-functional elements. Furthermore, we conducted an assessment of examples of non-coding and coding SNVs, which are causal for changes in phenotypes. Our results show that pCADD scores discriminate between functional and non-functional sequences and prioritize functional SNVs, and that pCADD is able to score the different positions in a codon relative to their redundancy. Taken together, these results indicate that based on pCADD scores, regions with biological relevance can be identified and distinguished according to their rate of adaptation. Conclusions We present the ability of pCADD to prioritize SNVs in the pig genome with respect to their putative deleteriousness, in accordance to the biological significance of the region in which they are located. We created scores for all possible SNVs, coding and non-coding, for all autosomes and the X chromosome of the pig reference sequence Sscrofa11.1, proposing a toolbox to prioritize variants and evaluate sequences to highlight new sites of interest to explain biological functions that are relevant to animal breeding. |
url |
https://doi.org/10.1186/s12711-020-0528-9 |
work_keys_str_mv |
AT christiangroß pcaddsnvprioritisationinsusscrofa AT martijnderks pcaddsnvprioritisationinsusscrofa AT hendrikjanmegens pcaddsnvprioritisationinsusscrofa AT mirtebosse pcaddsnvprioritisationinsusscrofa AT martienamgroenen pcaddsnvprioritisationinsusscrofa AT marcelreinders pcaddsnvprioritisationinsusscrofa AT dickderidder pcaddsnvprioritisationinsusscrofa |
_version_ |
1724281338801946624 |