pCADD: SNV prioritisation in Sus scrofa

Abstract Background In animal breeding, identification of causative genetic variants is of major importance and high economical value. Usually, the number of candidate variants exceeds the number of variants that can be validated. One way of prioritizing probable candidates is by evaluating their po...

Full description

Bibliographic Details
Main Authors: Christian Groß, Martijn Derks, Hendrik-Jan Megens, Mirte Bosse, Martien A. M. Groenen, Marcel Reinders, Dick de Ridder
Format: Article
Language:deu
Published: BMC 2020-02-01
Series:Genetics Selection Evolution
Online Access:https://doi.org/10.1186/s12711-020-0528-9
id doaj-69a7c63b694b4f8ba04c4e9109410dc8
record_format Article
spelling doaj-69a7c63b694b4f8ba04c4e9109410dc82021-02-07T12:20:56ZdeuBMCGenetics Selection Evolution1297-96862020-02-0152111510.1186/s12711-020-0528-9pCADD: SNV prioritisation in Sus scrofaChristian Groß0Martijn Derks1Hendrik-Jan Megens2Mirte Bosse3Martien A. M. Groenen4Marcel Reinders5Dick de Ridder6Delft Bioinformatics Lab, University of Technology DelftAnimal Breeding and Genomics, Wageningen University & ResearchAnimal Breeding and Genomics, Wageningen University & ResearchAnimal Breeding and Genomics, Wageningen University & ResearchAnimal Breeding and Genomics, Wageningen University & ResearchDelft Bioinformatics Lab, University of Technology DelftBioinformatics Group, Wageningen University & ResearchAbstract Background In animal breeding, identification of causative genetic variants is of major importance and high economical value. Usually, the number of candidate variants exceeds the number of variants that can be validated. One way of prioritizing probable candidates is by evaluating their potential to have a deleterious effect, e.g. by predicting their consequence. Due to experimental difficulties to evaluate variants that do not cause an amino-acid substitution, other prioritization methods are needed. For human genomes, the prediction of deleterious genomic variants has taken a step forward with the introduction of the combined annotation dependent depletion (CADD) method. In theory, this approach can be applied to any species. Here, we present pCADD (p for pig), a model to score single nucleotide variants (SNVs) in pig genomes. Results To evaluate whether pCADD captures sites with biological meaning, we used transcripts from miRNAs and introns, sequences from genes that are specific for a particular tissue, and the different sites of codons, to test how well pCADD scores differentiate between functional and non-functional elements. Furthermore, we conducted an assessment of examples of non-coding and coding SNVs, which are causal for changes in phenotypes. Our results show that pCADD scores discriminate between functional and non-functional sequences and prioritize functional SNVs, and that pCADD is able to score the different positions in a codon relative to their redundancy. Taken together, these results indicate that based on pCADD scores, regions with biological relevance can be identified and distinguished according to their rate of adaptation. Conclusions We present the ability of pCADD to prioritize SNVs in the pig genome with respect to their putative deleteriousness, in accordance to the biological significance of the region in which they are located. We created scores for all possible SNVs, coding and non-coding, for all autosomes and the X chromosome of the pig reference sequence Sscrofa11.1, proposing a toolbox to prioritize variants and evaluate sequences to highlight new sites of interest to explain biological functions that are relevant to animal breeding.https://doi.org/10.1186/s12711-020-0528-9
collection DOAJ
language deu
format Article
sources DOAJ
author Christian Groß
Martijn Derks
Hendrik-Jan Megens
Mirte Bosse
Martien A. M. Groenen
Marcel Reinders
Dick de Ridder
spellingShingle Christian Groß
Martijn Derks
Hendrik-Jan Megens
Mirte Bosse
Martien A. M. Groenen
Marcel Reinders
Dick de Ridder
pCADD: SNV prioritisation in Sus scrofa
Genetics Selection Evolution
author_facet Christian Groß
Martijn Derks
Hendrik-Jan Megens
Mirte Bosse
Martien A. M. Groenen
Marcel Reinders
Dick de Ridder
author_sort Christian Groß
title pCADD: SNV prioritisation in Sus scrofa
title_short pCADD: SNV prioritisation in Sus scrofa
title_full pCADD: SNV prioritisation in Sus scrofa
title_fullStr pCADD: SNV prioritisation in Sus scrofa
title_full_unstemmed pCADD: SNV prioritisation in Sus scrofa
title_sort pcadd: snv prioritisation in sus scrofa
publisher BMC
series Genetics Selection Evolution
issn 1297-9686
publishDate 2020-02-01
description Abstract Background In animal breeding, identification of causative genetic variants is of major importance and high economical value. Usually, the number of candidate variants exceeds the number of variants that can be validated. One way of prioritizing probable candidates is by evaluating their potential to have a deleterious effect, e.g. by predicting their consequence. Due to experimental difficulties to evaluate variants that do not cause an amino-acid substitution, other prioritization methods are needed. For human genomes, the prediction of deleterious genomic variants has taken a step forward with the introduction of the combined annotation dependent depletion (CADD) method. In theory, this approach can be applied to any species. Here, we present pCADD (p for pig), a model to score single nucleotide variants (SNVs) in pig genomes. Results To evaluate whether pCADD captures sites with biological meaning, we used transcripts from miRNAs and introns, sequences from genes that are specific for a particular tissue, and the different sites of codons, to test how well pCADD scores differentiate between functional and non-functional elements. Furthermore, we conducted an assessment of examples of non-coding and coding SNVs, which are causal for changes in phenotypes. Our results show that pCADD scores discriminate between functional and non-functional sequences and prioritize functional SNVs, and that pCADD is able to score the different positions in a codon relative to their redundancy. Taken together, these results indicate that based on pCADD scores, regions with biological relevance can be identified and distinguished according to their rate of adaptation. Conclusions We present the ability of pCADD to prioritize SNVs in the pig genome with respect to their putative deleteriousness, in accordance to the biological significance of the region in which they are located. We created scores for all possible SNVs, coding and non-coding, for all autosomes and the X chromosome of the pig reference sequence Sscrofa11.1, proposing a toolbox to prioritize variants and evaluate sequences to highlight new sites of interest to explain biological functions that are relevant to animal breeding.
url https://doi.org/10.1186/s12711-020-0528-9
work_keys_str_mv AT christiangroß pcaddsnvprioritisationinsusscrofa
AT martijnderks pcaddsnvprioritisationinsusscrofa
AT hendrikjanmegens pcaddsnvprioritisationinsusscrofa
AT mirtebosse pcaddsnvprioritisationinsusscrofa
AT martienamgroenen pcaddsnvprioritisationinsusscrofa
AT marcelreinders pcaddsnvprioritisationinsusscrofa
AT dickderidder pcaddsnvprioritisationinsusscrofa
_version_ 1724281338801946624