Dashing: fast and accurate genomic distances with HyperLogLog

Abstract Dashing is a fast and accurate software tool for estimating similarities of genomes or sequencing datasets. It uses the HyperLogLog sketch together with cardinality estimation methods that are specialized for set unions and intersections. Dashing summarizes genomes more rapidly than previou...

Full description

Bibliographic Details
Main Authors: Daniel N. Baker, Ben Langmead
Format: Article
Language:English
Published: BMC 2019-12-01
Series:Genome Biology
Subjects:
Online Access:https://doi.org/10.1186/s13059-019-1875-0