Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT
<p>Abstract</p> <p>Background</p> <p>Haplotype inference based on unphased SNP markers is an important task in population genetics. Although there are different approaches to the inference of haplotypes in diploid species, the existing software is not suitable for infer...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2008-07-01
|
Series: | BMC Genomics |
Online Access: | http://www.biomedcentral.com/1471-2164/9/356 |
id |
doaj-f3587e5a41c74e8abf56d94ccd62ca09 |
---|---|
record_format |
Article |
spelling |
doaj-f3587e5a41c74e8abf56d94ccd62ca092020-11-25T01:01:42ZengBMCBMC Genomics1471-21642008-07-019135610.1186/1471-2164-9-356Haplotype inference from unphased SNP data in heterozygous polyploids based on SATAchenbach UteDiehl SvenjaBasekow RicoGyetvai GaborNeigenfind JostGebhardt ChristianeSelbig JoachimKersten Birgit<p>Abstract</p> <p>Background</p> <p>Haplotype inference based on unphased SNP markers is an important task in population genetics. Although there are different approaches to the inference of haplotypes in diploid species, the existing software is not suitable for inferring haplotypes from unphased SNP data in polyploid species, such as the cultivated potato (<it>Solanum tuberosum</it>). Potato species are tetraploid and highly heterozygous.</p> <p>Results</p> <p>Here we present the software SATlotyper which is able to handle polyploid and polyallelic data. SATlo-typer uses the Boolean satisfiability problem to formulate Haplotype Inference by Pure Parsimony. The software excludes existing haplotype inferences, thus allowing for calculation of alternative inferences. As it is not known which of the multiple haplotype inferences are best supported by the given unphased data set, we use a bootstrapping procedure that allows for scoring of alternative inferences. Finally, by means of the bootstrapping scores, it is possible to optimise the phased genotypes belonging to a given haplotype inference. The program is evaluated with simulated and experimental SNP data generated for heterozygous tetraploid populations of potato. We show that, instead of taking the first haplotype inference reported by the program, we can significantly improve the quality of the final result by applying additional methods that include scoring of the alternative haplotype inferences and genotype optimisation. For a sub-population of nineteen individuals, the predicted results computed by SATlotyper were directly compared with results obtained by experimental haplotype inference via sequencing of cloned amplicons. Prediction and experiment gave similar results regarding the inferred haplotypes and phased genotypes.</p> <p>Conclusion</p> <p>Our results suggest that Haplotype Inference by Pure Parsimony can be solved efficiently by the SAT approach, even for data sets of unphased SNP from heterozygous polyploids. SATlotyper is freeware and is distributed as a Java JAR file. The software can be downloaded from the webpage of the GABI Primary Database at <url>http://www.gabipd.org/projects/satlotyper/</url>. The application of SATlotyper will provide haplotype information, which can be used in haplotype association mapping studies of polyploid plants.</p> http://www.biomedcentral.com/1471-2164/9/356 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Achenbach Ute Diehl Svenja Basekow Rico Gyetvai Gabor Neigenfind Jost Gebhardt Christiane Selbig Joachim Kersten Birgit |
spellingShingle |
Achenbach Ute Diehl Svenja Basekow Rico Gyetvai Gabor Neigenfind Jost Gebhardt Christiane Selbig Joachim Kersten Birgit Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT BMC Genomics |
author_facet |
Achenbach Ute Diehl Svenja Basekow Rico Gyetvai Gabor Neigenfind Jost Gebhardt Christiane Selbig Joachim Kersten Birgit |
author_sort |
Achenbach Ute |
title |
Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT |
title_short |
Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT |
title_full |
Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT |
title_fullStr |
Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT |
title_full_unstemmed |
Haplotype inference from unphased SNP data in heterozygous polyploids based on SAT |
title_sort |
haplotype inference from unphased snp data in heterozygous polyploids based on sat |
publisher |
BMC |
series |
BMC Genomics |
issn |
1471-2164 |
publishDate |
2008-07-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Haplotype inference based on unphased SNP markers is an important task in population genetics. Although there are different approaches to the inference of haplotypes in diploid species, the existing software is not suitable for inferring haplotypes from unphased SNP data in polyploid species, such as the cultivated potato (<it>Solanum tuberosum</it>). Potato species are tetraploid and highly heterozygous.</p> <p>Results</p> <p>Here we present the software SATlotyper which is able to handle polyploid and polyallelic data. SATlo-typer uses the Boolean satisfiability problem to formulate Haplotype Inference by Pure Parsimony. The software excludes existing haplotype inferences, thus allowing for calculation of alternative inferences. As it is not known which of the multiple haplotype inferences are best supported by the given unphased data set, we use a bootstrapping procedure that allows for scoring of alternative inferences. Finally, by means of the bootstrapping scores, it is possible to optimise the phased genotypes belonging to a given haplotype inference. The program is evaluated with simulated and experimental SNP data generated for heterozygous tetraploid populations of potato. We show that, instead of taking the first haplotype inference reported by the program, we can significantly improve the quality of the final result by applying additional methods that include scoring of the alternative haplotype inferences and genotype optimisation. For a sub-population of nineteen individuals, the predicted results computed by SATlotyper were directly compared with results obtained by experimental haplotype inference via sequencing of cloned amplicons. Prediction and experiment gave similar results regarding the inferred haplotypes and phased genotypes.</p> <p>Conclusion</p> <p>Our results suggest that Haplotype Inference by Pure Parsimony can be solved efficiently by the SAT approach, even for data sets of unphased SNP from heterozygous polyploids. SATlotyper is freeware and is distributed as a Java JAR file. The software can be downloaded from the webpage of the GABI Primary Database at <url>http://www.gabipd.org/projects/satlotyper/</url>. The application of SATlotyper will provide haplotype information, which can be used in haplotype association mapping studies of polyploid plants.</p> |
url |
http://www.biomedcentral.com/1471-2164/9/356 |
work_keys_str_mv |
AT achenbachute haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat AT diehlsvenja haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat AT basekowrico haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat AT gyetvaigabor haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat AT neigenfindjost haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat AT gebhardtchristiane haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat AT selbigjoachim haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat AT kerstenbirgit haplotypeinferencefromunphasedsnpdatainheterozygouspolyploidsbasedonsat |
_version_ |
1725207941740494848 |