Mining for single nucleotide polymorphisms in pig genome sequence data

<p>Abstract</p> <p>Background</p> <p>Single nucleotide polymorphisms (SNPs) are ideal genetic markers due to their high abundance and the highly automated way in which SNPs are detected and SNP assays are performed. The number of SNPs identified in the pig thus far is s...

Full description

Bibliographic Details
Main Authors: Kinders Sylvia M, Dibbits Bert, del Rosario Marisol, Kommadath Arun, Kollers Sonja, Kerstens Hindrik HD, Crooijmans Richard P, Groenen Martien AM
Format: Article
Language:English
Published: BMC 2009-01-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/10/4
id doaj-e900de2fe113441f8a1c18ef0a1193af
record_format Article
spelling doaj-e900de2fe113441f8a1c18ef0a1193af2020-11-24T21:15:21ZengBMCBMC Genomics1471-21642009-01-01101410.1186/1471-2164-10-4Mining for single nucleotide polymorphisms in pig genome sequence dataKinders Sylvia MDibbits Bertdel Rosario MarisolKommadath ArunKollers SonjaKerstens Hindrik HDCrooijmans Richard PGroenen Martien AM<p>Abstract</p> <p>Background</p> <p>Single nucleotide polymorphisms (SNPs) are ideal genetic markers due to their high abundance and the highly automated way in which SNPs are detected and SNP assays are performed. The number of SNPs identified in the pig thus far is still limited.</p> <p>Results</p> <p>A total of 4.8 million whole genome shotgun sequences obtained from the NCBI trace-repository with center name "SDJVP", and project name "Sino-Danish Pig Genome Project" were analysed for the presence of SNPs. Available BAC and BAC-end sequences and their naming and mapping information, all obtained from SangerInstitute FTP site, served as a rough assembly of a reference genome. In 1.2 Gb of pig genome sequence, we identified 98,151 SNPs in which one of the sequences in the alignment represented the polymorphism and 6,374 SNPs in which two sequences represent an identical polymorphism. To benchmark the SNP identification method, 163 SNPs, in which the polymorphism was represented twice in the sequence alignment, were selected and tested on a panel of three purebred boar lines and wild boar. Of these 163 in silico identified SNPs, 134 were shown to be polymorphic in our animal panel.</p> <p>Conclusion</p> <p>This SNP identification method, which mines for SNPs in publicly available porcine shotgun sequences repositories, provides thousands of high quality SNPs. Benchmarking in an animal panel showed that more than 80% of the predicted SNPs represented true genetic variation.</p> http://www.biomedcentral.com/1471-2164/10/4
collection DOAJ
language English
format Article
sources DOAJ
author Kinders Sylvia M
Dibbits Bert
del Rosario Marisol
Kommadath Arun
Kollers Sonja
Kerstens Hindrik HD
Crooijmans Richard P
Groenen Martien AM
spellingShingle Kinders Sylvia M
Dibbits Bert
del Rosario Marisol
Kommadath Arun
Kollers Sonja
Kerstens Hindrik HD
Crooijmans Richard P
Groenen Martien AM
Mining for single nucleotide polymorphisms in pig genome sequence data
BMC Genomics
author_facet Kinders Sylvia M
Dibbits Bert
del Rosario Marisol
Kommadath Arun
Kollers Sonja
Kerstens Hindrik HD
Crooijmans Richard P
Groenen Martien AM
author_sort Kinders Sylvia M
title Mining for single nucleotide polymorphisms in pig genome sequence data
title_short Mining for single nucleotide polymorphisms in pig genome sequence data
title_full Mining for single nucleotide polymorphisms in pig genome sequence data
title_fullStr Mining for single nucleotide polymorphisms in pig genome sequence data
title_full_unstemmed Mining for single nucleotide polymorphisms in pig genome sequence data
title_sort mining for single nucleotide polymorphisms in pig genome sequence data
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2009-01-01
description <p>Abstract</p> <p>Background</p> <p>Single nucleotide polymorphisms (SNPs) are ideal genetic markers due to their high abundance and the highly automated way in which SNPs are detected and SNP assays are performed. The number of SNPs identified in the pig thus far is still limited.</p> <p>Results</p> <p>A total of 4.8 million whole genome shotgun sequences obtained from the NCBI trace-repository with center name "SDJVP", and project name "Sino-Danish Pig Genome Project" were analysed for the presence of SNPs. Available BAC and BAC-end sequences and their naming and mapping information, all obtained from SangerInstitute FTP site, served as a rough assembly of a reference genome. In 1.2 Gb of pig genome sequence, we identified 98,151 SNPs in which one of the sequences in the alignment represented the polymorphism and 6,374 SNPs in which two sequences represent an identical polymorphism. To benchmark the SNP identification method, 163 SNPs, in which the polymorphism was represented twice in the sequence alignment, were selected and tested on a panel of three purebred boar lines and wild boar. Of these 163 in silico identified SNPs, 134 were shown to be polymorphic in our animal panel.</p> <p>Conclusion</p> <p>This SNP identification method, which mines for SNPs in publicly available porcine shotgun sequences repositories, provides thousands of high quality SNPs. Benchmarking in an animal panel showed that more than 80% of the predicted SNPs represented true genetic variation.</p>
url http://www.biomedcentral.com/1471-2164/10/4
work_keys_str_mv AT kinderssylviam miningforsinglenucleotidepolymorphismsinpiggenomesequencedata
AT dibbitsbert miningforsinglenucleotidepolymorphismsinpiggenomesequencedata
AT delrosariomarisol miningforsinglenucleotidepolymorphismsinpiggenomesequencedata
AT kommadatharun miningforsinglenucleotidepolymorphismsinpiggenomesequencedata
AT kollerssonja miningforsinglenucleotidepolymorphismsinpiggenomesequencedata
AT kerstenshindrikhd miningforsinglenucleotidepolymorphismsinpiggenomesequencedata
AT crooijmansrichardp miningforsinglenucleotidepolymorphismsinpiggenomesequencedata
AT groenenmartienam miningforsinglenucleotidepolymorphismsinpiggenomesequencedata
_version_ 1716745610290266112