Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT

<p>Abstract</p> <p>Background</p> <p>Haplotype inference based on unphased SNP markers is an important task in population genetics. Although there are different approaches to the inference of haplotypes in diploid species, the existing software is not suitable for infer...

Full description

Bibliographic Details
Main Authors: Achenbach Ute, Diehl Svenja, Basekow Rico, Gyetvai Gabor, Neigenfind Jost, Gebhardt Christiane, Selbig Joachim, Kersten Birgit
Format: Article
Language:English
Published: BMC 2008-07-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/9/356
id doaj-f3587e5a41c74e8abf56d94ccd62ca09
record_format Article
spelling doaj-f3587e5a41c74e8abf56d94ccd62ca092020-11-25T01:01:42ZengBMCBMC Genomics1471-21642008-07-019135610.1186/1471-2164-9-356Haplotype inference from unphased SNP data in heterozygous polyploids based on SATAchenbach UteDiehl SvenjaBasekow RicoGyetvai GaborNeigenfind JostGebhardt ChristianeSelbig JoachimKersten Birgit<p>Abstract</p> <p>Background</p> <p>Haplotype inference based on unphased SNP markers is an important task in population genetics. Although there are different approaches to the inference of haplotypes in diploid species, the existing software is not suitable for inferring haplotypes from unphased SNP data in polyploid species, such as the cultivated potato (<it>Solanum tuberosum</it>). Potato species are tetraploid and highly heterozygous.</p> <p>Results</p> <p>Here we present the software SATlotyper which is able to handle polyploid and polyallelic data. SATlo-typer uses the Boolean satisfiability problem to formulate Haplotype Inference by Pure Parsimony. The software excludes existing haplotype inferences, thus allowing for calculation of alternative inferences. As it is not known which of the multiple haplotype inferences are best supported by the given unphased data set, we use a bootstrapping procedure that allows for scoring of alternative inferences. Finally, by means of the bootstrapping scores, it is possible to optimise the phased genotypes belonging to a given haplotype inference. The program is evaluated with simulated and experimental SNP data generated for heterozygous tetraploid populations of potato. We show that, instead of taking the first haplotype inference reported by the program, we can significantly improve the quality of the final result by applying additional methods that include scoring of the alternative haplotype inferences and genotype optimisation. For a sub-population of nineteen individuals, the predicted results computed by SATlotyper were directly compared with results obtained by experimental haplotype inference via sequencing of cloned amplicons. Prediction and experiment gave similar results regarding the inferred haplotypes and phased genotypes.</p> <p>Conclusion</p> <p>Our results suggest that Haplotype Inference by Pure Parsimony can be solved efficiently by the SAT approach, even for data sets of unphased SNP from heterozygous polyploids. SATlotyper is freeware and is distributed as a Java JAR file. The software can be downloaded from the webpage of the GABI Primary Database at <url>http://www.gabipd.org/projects/satlotyper/</url>. The application of SATlotyper will provide haplotype information, which can be used in haplotype association mapping studies of polyploid plants.</p> http://www.biomedcentral.com/1471-2164/9/356
collection DOAJ
language English
format Article
sources DOAJ
author Achenbach Ute
Diehl Svenja
Basekow Rico
Gyetvai Gabor
Neigenfind Jost
Gebhardt Christiane
Selbig Joachim
Kersten Birgit
spellingShingle Achenbach Ute
Diehl Svenja
Basekow Rico
Gyetvai Gabor
Neigenfind Jost
Gebhardt Christiane
Selbig Joachim
Kersten Birgit
Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT
BMC Genomics
author_facet Achenbach Ute
Diehl Svenja
Basekow Rico
Gyetvai Gabor
Neigenfind Jost
Gebhardt Christiane
Selbig Joachim
Kersten Birgit
author_sort Achenbach Ute
title Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT
title_short Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT
title_full Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT
title_fullStr Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT
title_full_unstemmed Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT
title_sort haplotype inference from unphased snp data in heterozygous polyploids based on sat
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2008-07-01
description <p>Abstract</p> <p>Background</p> <p>Haplotype inference based on unphased SNP markers is an important task in population genetics. Although there are different approaches to the inference of haplotypes in diploid species, the existing software is not suitable for inferring haplotypes from unphased SNP data in polyploid species, such as the cultivated potato (<it>Solanum tuberosum</it>). Potato species are tetraploid and highly heterozygous.</p> <p>Results</p> <p>Here we present the software SATlotyper which is able to handle polyploid and polyallelic data. SATlo-typer uses the Boolean satisfiability problem to formulate Haplotype Inference by Pure Parsimony. The software excludes existing haplotype inferences, thus allowing for calculation of alternative inferences. As it is not known which of the multiple haplotype inferences are best supported by the given unphased data set, we use a bootstrapping procedure that allows for scoring of alternative inferences. Finally, by means of the bootstrapping scores, it is possible to optimise the phased genotypes belonging to a given haplotype inference. The program is evaluated with simulated and experimental SNP data generated for heterozygous tetraploid populations of potato. We show that, instead of taking the first haplotype inference reported by the program, we can significantly improve the quality of the final result by applying additional methods that include scoring of the alternative haplotype inferences and genotype optimisation. For a sub-population of nineteen individuals, the predicted results computed by SATlotyper were directly compared with results obtained by experimental haplotype inference via sequencing of cloned amplicons. Prediction and experiment gave similar results regarding the inferred haplotypes and phased genotypes.</p> <p>Conclusion</p> <p>Our results suggest that Haplotype Inference by Pure Parsimony can be solved efficiently by the SAT approach, even for data sets of unphased SNP from heterozygous polyploids. SATlotyper is freeware and is distributed as a Java JAR file. The software can be downloaded from the webpage of the GABI Primary Database at <url>http://www.gabipd.org/projects/satlotyper/</url>. The application of SATlotyper will provide haplotype information, which can be used in haplotype association mapping studies of polyploid plants.</p>
url http://www.biomedcentral.com/1471-2164/9/356
work_keys_str_mv AT achenbachute haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT diehlsvenja haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT basekowrico haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT gyetvaigabor haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT neigenfindjost haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT gebhardtchristiane haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT selbigjoachim haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
AT kerstenbirgit haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat
_version_ 1725207941740494848