SeqAnt: A web service to rapidly identify and annotate DNA sequence variations

<p>Abstract</p> <p>Background</p> <p>The enormous throughput and low cost of second-generation sequencing platforms now allow research and clinical geneticists to routinely perform single experiments that identify tens of thousands to millions of variant sites. Existing...

Full description

Bibliographic Details
Main Authors: Patel Viren, Steinberg Karyn, Horner Vanessa L, Mondal Kajari, Athri Prashanth, Shetty Amol, Caspary Tamara, Cutler David J, Zwick Michael E
Format: Article
Language:English
Published: BMC 2010-09-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/11/471
Description
Summary:<p>Abstract</p> <p>Background</p> <p>The enormous throughput and low cost of second-generation sequencing platforms now allow research and clinical geneticists to routinely perform single experiments that identify tens of thousands to millions of variant sites. Existing methods to annotate variant sites using information from publicly available databases via web browsers are too slow to be useful for the large sequencing datasets being routinely generated by geneticists. Because sequence annotation of variant sites is required before functional characterization can proceed, the lack of a high-throughput pipeline to efficiently annotate variant sites can act as a significant bottleneck in genetics research.</p> <p>Results</p> <p>SeqAnt (<it>Seq</it>uence <it>An</it>notator) is an open source web service and software package that rapidly annotates DNA sequence variants and identifies recessive or compound heterozygous loci in human, mouse, fly, and worm genome sequencing experiments. Variants are characterized with respect to their functional type, frequency, and evolutionary conservation. Annotated variants can be viewed on a web browser, downloaded in a tab-delimited text file, or directly uploaded in a BED format to the UCSC genome browser. To demonstrate the speed of SeqAnt, we annotated a series of publicly available datasets that ranged in size from 37 to 3,439,107 variant sites. The total time to completely annotate these data completely ranged from 0.17 seconds to 28 minutes 49.8 seconds.</p> <p>Conclusion</p> <p>SeqAnt is an open source web service and software package that overcomes a critical bottleneck facing research and clinical geneticists using second-generation sequencing platforms. SeqAnt will prove especially useful for those investigators who lack dedicated bioinformatics personnel or infrastructure in their laboratories.</p>
ISSN:1471-2105