RaGOO: fast and accurate reference-guided scaffolding of draft genomes

Abstract We present RaGOO, a reference-guided contig ordering and orienting tool that leverages the speed and sensitivity of Minimap2 to accurately achieve chromosome-scale assemblies in minutes. After the pseudomolecules are constructed, RaGOO identifies structural variants, including those spannin...

Full description

Bibliographic Details
Main Authors: Michael Alonge, Sebastian Soyk, Srividya Ramakrishnan, Xingang Wang, Sara Goodwin, Fritz J. Sedlazeck, Zachary B. Lippman, Michael C. Schatz
Format: Article
Language:English
Published: BMC 2019-10-01
Series:Genome Biology
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13059-019-1829-6
id doaj-d6f1cece618840d8afe7799d28bbb714
record_format Article
spelling doaj-d6f1cece618840d8afe7799d28bbb7142020-11-25T03:58:31ZengBMCGenome Biology1474-760X2019-10-0120111710.1186/s13059-019-1829-6RaGOO: fast and accurate reference-guided scaffolding of draft genomesMichael Alonge0Sebastian Soyk1Srividya Ramakrishnan2Xingang Wang3Sara Goodwin4Fritz J. Sedlazeck5Zachary B. Lippman6Michael C. Schatz7Department of Computer Science, Johns Hopkins UniversityCold Spring Harbor LaboratoryDepartment of Computer Science, Johns Hopkins UniversityCold Spring Harbor LaboratoryCold Spring Harbor LaboratoryHuman Genome Sequencing Center, Baylor College of MedicineCold Spring Harbor LaboratoryDepartment of Computer Science, Johns Hopkins UniversityAbstract We present RaGOO, a reference-guided contig ordering and orienting tool that leverages the speed and sensitivity of Minimap2 to accurately achieve chromosome-scale assemblies in minutes. After the pseudomolecules are constructed, RaGOO identifies structural variants, including those spanning sequencing gaps. We show that RaGOO accurately orders and orients 3 de novo tomato genome assemblies, including the widely used M82 reference cultivar. We then demonstrate the scalability and utility of RaGOO with a pan-genome analysis of 103 Arabidopsis thaliana accessions by examining the structural variants detected in the newly assembled pseudomolecules. RaGOO is available open source at https://github.com/malonge/RaGOO.http://link.springer.com/article/10.1186/s13059-019-1829-6PseudomoleculeReference-guidedGenome assemblyScaffoldingGenome alignmentLong-read sequencing
collection DOAJ
language English
format Article
sources DOAJ
author Michael Alonge
Sebastian Soyk
Srividya Ramakrishnan
Xingang Wang
Sara Goodwin
Fritz J. Sedlazeck
Zachary B. Lippman
Michael C. Schatz
spellingShingle Michael Alonge
Sebastian Soyk
Srividya Ramakrishnan
Xingang Wang
Sara Goodwin
Fritz J. Sedlazeck
Zachary B. Lippman
Michael C. Schatz
RaGOO: fast and accurate reference-guided scaffolding of draft genomes
Genome Biology
Pseudomolecule
Reference-guided
Genome assembly
Scaffolding
Genome alignment
Long-read sequencing
author_facet Michael Alonge
Sebastian Soyk
Srividya Ramakrishnan
Xingang Wang
Sara Goodwin
Fritz J. Sedlazeck
Zachary B. Lippman
Michael C. Schatz
author_sort Michael Alonge
title RaGOO: fast and accurate reference-guided scaffolding of draft genomes
title_short RaGOO: fast and accurate reference-guided scaffolding of draft genomes
title_full RaGOO: fast and accurate reference-guided scaffolding of draft genomes
title_fullStr RaGOO: fast and accurate reference-guided scaffolding of draft genomes
title_full_unstemmed RaGOO: fast and accurate reference-guided scaffolding of draft genomes
title_sort ragoo: fast and accurate reference-guided scaffolding of draft genomes
publisher BMC
series Genome Biology
issn 1474-760X
publishDate 2019-10-01
description Abstract We present RaGOO, a reference-guided contig ordering and orienting tool that leverages the speed and sensitivity of Minimap2 to accurately achieve chromosome-scale assemblies in minutes. After the pseudomolecules are constructed, RaGOO identifies structural variants, including those spanning sequencing gaps. We show that RaGOO accurately orders and orients 3 de novo tomato genome assemblies, including the widely used M82 reference cultivar. We then demonstrate the scalability and utility of RaGOO with a pan-genome analysis of 103 Arabidopsis thaliana accessions by examining the structural variants detected in the newly assembled pseudomolecules. RaGOO is available open source at https://github.com/malonge/RaGOO.
topic Pseudomolecule
Reference-guided
Genome assembly
Scaffolding
Genome alignment
Long-read sequencing
url http://link.springer.com/article/10.1186/s13059-019-1829-6
work_keys_str_mv AT michaelalonge ragoofastandaccuratereferenceguidedscaffoldingofdraftgenomes
AT sebastiansoyk ragoofastandaccuratereferenceguidedscaffoldingofdraftgenomes
AT srividyaramakrishnan ragoofastandaccuratereferenceguidedscaffoldingofdraftgenomes
AT xingangwang ragoofastandaccuratereferenceguidedscaffoldingofdraftgenomes
AT saragoodwin ragoofastandaccuratereferenceguidedscaffoldingofdraftgenomes
AT fritzjsedlazeck ragoofastandaccuratereferenceguidedscaffoldingofdraftgenomes
AT zacharyblippman ragoofastandaccuratereferenceguidedscaffoldingofdraftgenomes
AT michaelcschatz ragoofastandaccuratereferenceguidedscaffoldingofdraftgenomes
_version_ 1724456790517612544