The Pacific Biosciences de novo assembled genome dataset from a parthenogenetic New Zealand wild population of the longhorned tick, Haemaphysalis longicornis Neumann, 1901
The longhorned tick, Haemaphysalis longicornis, feeds upon a wide range of bird and mammalian hosts. Mammalian hosts include cattle, deer, sheep, goats, humans, and horses. This tick is known to transmit a number of pathogens causing tick-borne diseases, and was the vector of a recent serious outbre...
Main Authors: | , , , , , , , , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2019-12-01
|
Series: | Data in Brief |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340919309576 |
id |
doaj-e5112225802b48f2a79579debaef0c86 |
---|---|
record_format |
Article |
spelling |
doaj-e5112225802b48f2a79579debaef0c862020-11-24T21:56:33ZengElsevierData in Brief2352-34092019-12-0127The Pacific Biosciences de novo assembled genome dataset from a parthenogenetic New Zealand wild population of the longhorned tick, Haemaphysalis longicornis Neumann, 1901Felix D. Guerrero0Kylie G. Bendele1Noushin Ghaffari2Joseph Guhlin3Kristene R. Gedye4Kevin E. Lawrence5Peter K. Dearden6Thomas W.R. Harrop7Allen C.G. Heath8Yanni Lun9Richard P. Metz10Pete Teel11Adalberto Perez de Leon12Patrick J. Biggs13William E. Pomroy14Charles D. Johnson15Philip D. Blood16Stanley E. Bellgard17Daniel M. Tompkins18USDA-ARS Knipling-Bushland US Livestock Insects Research Laboratory, Kerrville, TX, USA; Corresponding author. USDA-ARS Knipling-Bushland US Livestock Insects Research Laboratory, 2700 Fredericksburg Rd., Kerrville, TX, USA.USDA-ARS Knipling-Bushland US Livestock Insects Research Laboratory, Kerrville, TX, USATexas A&M AgriLife, College Station, TX, USAGenomics Aotearoa and Biochemistry Department, University of Otago, Dunedin, New ZealandSchool of Veterinary Science, Massey University, Palmerston North, New ZealandSchool of Veterinary Science, Massey University, Palmerston North, New ZealandGenomics Aotearoa and Biochemistry Department, University of Otago, Dunedin, New ZealandGenomics Aotearoa and Biochemistry Department, University of Otago, Dunedin, New ZealandAgResearch Ltd., c/o Hopkirk Research Institute, Private Bag 11008, Palmerston North, 4442, New ZealandTexas A&M AgriLife, College Station, TX, USATexas A&M AgriLife, College Station, TX, USADepartment of Entomology, Texas A&M University, College Station, TX, USAUSDA-ARS Knipling-Bushland US Livestock Insects Research Laboratory, Kerrville, TX, USASchool of Veterinary Science, Massey University, Palmerston North, New Zealand; School of Fundamental Sciences, Massey University, Palmerston North, New ZealandSchool of Veterinary Science, Massey University, Palmerston North, New ZealandTexas A&M AgriLife, College Station, TX, USAPittsburgh Supercomputing Center, Carnegie Mellon University, Pittsburgh, PA, USAManaaki Whenua-Landcare Research, Auckland, New ZealandDepartment of Zoology, University of Otago, Dunedin, New ZealandThe longhorned tick, Haemaphysalis longicornis, feeds upon a wide range of bird and mammalian hosts. Mammalian hosts include cattle, deer, sheep, goats, humans, and horses. This tick is known to transmit a number of pathogens causing tick-borne diseases, and was the vector of a recent serious outbreak of oriental theileriosis in New Zealand. A New Zealand-USA consortium was established to sequence, assemble, and annotate the genome of this tick, using ticks obtained from New Zealand's North Island. In New Zealand, the tick is considered exclusively parthenogenetic and this trait was deemed useful for genome assembly. Very high molecular weight genomic DNA was sequenced on the Illumina HiSeq4000 and the long-read Pac Bio Sequel platforms. Twenty-eight SMRT cells produced a total of 21.3 million reads which were assembled with Canu on a reserved supercomputer node with access to 12 TB of RAM, running continuously for over 24 days. The final assembly dataset consisted of 34,211 contigs with an average contig length of 215,205 bp. The quality of the annotated genome was assessed by BUSCO analysis, an approach that provides quantitative measures for the quality of an assembled genome. Over 95% of the BUSCO gene set was found in the assembled genome. Only 48 of the 1066 BUSCO genes were missing and only 9 were present in a fragmented condition. The raw sequencing reads and the assembled contigs/scaffolds are archived at the National Center for Biotechnology Information. Keywords: Tick genome, Pac Bio de novo assembly, Genome annotation, Cattle tickhttp://www.sciencedirect.com/science/article/pii/S2352340919309576 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Felix D. Guerrero Kylie G. Bendele Noushin Ghaffari Joseph Guhlin Kristene R. Gedye Kevin E. Lawrence Peter K. Dearden Thomas W.R. Harrop Allen C.G. Heath Yanni Lun Richard P. Metz Pete Teel Adalberto Perez de Leon Patrick J. Biggs William E. Pomroy Charles D. Johnson Philip D. Blood Stanley E. Bellgard Daniel M. Tompkins |
spellingShingle |
Felix D. Guerrero Kylie G. Bendele Noushin Ghaffari Joseph Guhlin Kristene R. Gedye Kevin E. Lawrence Peter K. Dearden Thomas W.R. Harrop Allen C.G. Heath Yanni Lun Richard P. Metz Pete Teel Adalberto Perez de Leon Patrick J. Biggs William E. Pomroy Charles D. Johnson Philip D. Blood Stanley E. Bellgard Daniel M. Tompkins The Pacific Biosciences de novo assembled genome dataset from a parthenogenetic New Zealand wild population of the longhorned tick, Haemaphysalis longicornis Neumann, 1901 Data in Brief |
author_facet |
Felix D. Guerrero Kylie G. Bendele Noushin Ghaffari Joseph Guhlin Kristene R. Gedye Kevin E. Lawrence Peter K. Dearden Thomas W.R. Harrop Allen C.G. Heath Yanni Lun Richard P. Metz Pete Teel Adalberto Perez de Leon Patrick J. Biggs William E. Pomroy Charles D. Johnson Philip D. Blood Stanley E. Bellgard Daniel M. Tompkins |
author_sort |
Felix D. Guerrero |
title |
The Pacific Biosciences de novo assembled genome dataset from a parthenogenetic New Zealand wild population of the longhorned tick, Haemaphysalis longicornis Neumann, 1901 |
title_short |
The Pacific Biosciences de novo assembled genome dataset from a parthenogenetic New Zealand wild population of the longhorned tick, Haemaphysalis longicornis Neumann, 1901 |
title_full |
The Pacific Biosciences de novo assembled genome dataset from a parthenogenetic New Zealand wild population of the longhorned tick, Haemaphysalis longicornis Neumann, 1901 |
title_fullStr |
The Pacific Biosciences de novo assembled genome dataset from a parthenogenetic New Zealand wild population of the longhorned tick, Haemaphysalis longicornis Neumann, 1901 |
title_full_unstemmed |
The Pacific Biosciences de novo assembled genome dataset from a parthenogenetic New Zealand wild population of the longhorned tick, Haemaphysalis longicornis Neumann, 1901 |
title_sort |
pacific biosciences de novo assembled genome dataset from a parthenogenetic new zealand wild population of the longhorned tick, haemaphysalis longicornis neumann, 1901 |
publisher |
Elsevier |
series |
Data in Brief |
issn |
2352-3409 |
publishDate |
2019-12-01 |
description |
The longhorned tick, Haemaphysalis longicornis, feeds upon a wide range of bird and mammalian hosts. Mammalian hosts include cattle, deer, sheep, goats, humans, and horses. This tick is known to transmit a number of pathogens causing tick-borne diseases, and was the vector of a recent serious outbreak of oriental theileriosis in New Zealand. A New Zealand-USA consortium was established to sequence, assemble, and annotate the genome of this tick, using ticks obtained from New Zealand's North Island. In New Zealand, the tick is considered exclusively parthenogenetic and this trait was deemed useful for genome assembly. Very high molecular weight genomic DNA was sequenced on the Illumina HiSeq4000 and the long-read Pac Bio Sequel platforms. Twenty-eight SMRT cells produced a total of 21.3 million reads which were assembled with Canu on a reserved supercomputer node with access to 12 TB of RAM, running continuously for over 24 days. The final assembly dataset consisted of 34,211 contigs with an average contig length of 215,205 bp. The quality of the annotated genome was assessed by BUSCO analysis, an approach that provides quantitative measures for the quality of an assembled genome. Over 95% of the BUSCO gene set was found in the assembled genome. Only 48 of the 1066 BUSCO genes were missing and only 9 were present in a fragmented condition. The raw sequencing reads and the assembled contigs/scaffolds are archived at the National Center for Biotechnology Information. Keywords: Tick genome, Pac Bio de novo assembly, Genome annotation, Cattle tick |
url |
http://www.sciencedirect.com/science/article/pii/S2352340919309576 |
work_keys_str_mv |
AT felixdguerrero thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT kyliegbendele thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT noushinghaffari thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT josephguhlin thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT kristenergedye thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT kevinelawrence thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT peterkdearden thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT thomaswrharrop thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT allencgheath thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT yannilun thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT richardpmetz thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT peteteel thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT adalbertoperezdeleon thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT patrickjbiggs thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT williamepomroy thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT charlesdjohnson thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT philipdblood thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT stanleyebellgard thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT danielmtompkins thepacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT felixdguerrero pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT kyliegbendele pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT noushinghaffari pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT josephguhlin pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT kristenergedye pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT kevinelawrence pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT peterkdearden pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT thomaswrharrop pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT allencgheath pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT yannilun pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT richardpmetz pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT peteteel pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT adalbertoperezdeleon pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT patrickjbiggs pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT williamepomroy pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT charlesdjohnson pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT philipdblood pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT stanleyebellgard pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 AT danielmtompkins pacificbiosciencesdenovoassembledgenomedatasetfromaparthenogeneticnewzealandwildpopulationofthelonghornedtickhaemaphysalislongicornisneumann1901 |
_version_ |
1725858469755486208 |