Algorithm for DNA sequence assembly by quantum annealing

Background: The assembly task is an indispensable step in sequencing genomes of new organisms and studying structural genomic changes. In recent years, the dynamic development of next-generation sequencing (NGS) methods raises hopes for making whole-genome sequencing a fast and reliable tool used, f...

Full description

Bibliographic Details
Main Authors: Nałęcz-Charkiewicz, K. (Author), Nowak, R.M (Author)
Format: Article
Language:English
Published: BioMed Central Ltd 2022
Subjects:
DNA
TSP
VRP
Online Access:View Fulltext in Publisher
LEADER 03422nam a2200517Ia 4500
001 10-1186-s12859-022-04661-7
008 220425s2022 CNT 000 0 und d
020 |a 14712105 (ISSN) 
245 1 0 |a Algorithm for DNA sequence assembly by quantum annealing 
260 0 |b BioMed Central Ltd  |c 2022 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1186/s12859-022-04661-7 
520 3 |a Background: The assembly task is an indispensable step in sequencing genomes of new organisms and studying structural genomic changes. In recent years, the dynamic development of next-generation sequencing (NGS) methods raises hopes for making whole-genome sequencing a fast and reliable tool used, for example, in medical diagnostics. However, this is hampered by the slowness and computational requirements of the current processing algorithms, which raises the need to develop more efficient algorithms. One possible approach, still little explored, is the use of quantum computing. Results: We present a proof of concept of de novo assembly algorithm, using the Genomic Signal Processing approach, detecting overlaps between DNA reads by calculating the Pearson correlation coefficient and formulating the assembly problem as an optimization task (Traveling Salesman Problem). Computations performed on a classic computer were compared with the results achieved by a hybrid method combining CPU and QPU calculations. For this purpose quantum annealer by D-Wave was used. The experiments were performed with artificially generated data and DNA reads coming from a simulator, with actual organism genomes used as input sequences. To our knowledge, this work is one of the few where actual sequences of organisms were used to study the de novo assembly task on quantum annealer. Conclusions: Proof of concept carried out by us showed that the use of quantum annealer (QA) for the de novo assembly task might be a promising alternative to the computations performed in the classical model. The current computing power of the available devices requires a hybrid approach (combining CPU and QPU computations). The next step may be developing a hybrid algorithm strictly dedicated to the de novo assembly task, using its specificity (e.g. the sparsity and bounded degree of the overlap-layout-consensus graph). © 2022, The Author(s). 
650 0 4 |a Annealing 
650 0 4 |a Assembly tasks 
650 0 4 |a Bioinformatics 
650 0 4 |a Biology 
650 0 4 |a Computational complexity 
650 0 4 |a Computational efficiency 
650 0 4 |a Correlation methods 
650 0 4 |a De novo assemblies 
650 0 4 |a De novo assembly 
650 0 4 |a Diagnosis 
650 0 4 |a DNA 
650 0 4 |a DNA sequence assembly 
650 0 4 |a DNA sequences 
650 0 4 |a Heuristic algorithms 
650 0 4 |a Hybrid algorithm 
650 0 4 |a Hybrid algorithms 
650 0 4 |a Proof of concept 
650 0 4 |a Quantum annealing 
650 0 4 |a Quantum annealing 
650 0 4 |a Routing algorithms 
650 0 4 |a Signal processing 
650 0 4 |a Structural genomics 
650 0 4 |a Traveling salesman problem 
650 0 4 |a Travelling salesman problem 
650 0 4 |a TSP 
650 0 4 |a TSP 
650 0 4 |a Vehicle routing 
650 0 4 |a Vehicle routing problem 
650 0 4 |a Vehicle Routing Problems 
650 0 4 |a VRP 
650 0 4 |a VRP 
700 1 |a Nałęcz-Charkiewicz, K.  |e author 
700 1 |a Nowak, R.M.  |e author 
773 |t BMC Bioinformatics