De Novo Genome Sequence Assembly of Dwarf Coconut (Cocos nucifera L. ‘Catigan Green Dwarf’) Provides Insights into Genomic Variation Between Coconut Types and Related Palm Species

We report the first whole genome sequence (WGS) assembly and annotation of a dwarf coconut variety, ‘Catigan Green Dwarf’ (CATD). The genome sequence was generated using the PacBio SMRT sequencing platform at 15X coverage of the expected genome size of 2.15 Gbp, which was corrected with assembled 50...

Full description

Bibliographic Details
Main Authors: Darlon V. Lantican, Susan R. Strickler, Alma O. Canama, Roanne R. Gardoce, Lukas A. Mueller, Hayde F. Galvez
Format: Article
Language:English
Published: Oxford University Press 2019-08-01
Series:G3: Genes, Genomes, Genetics
Subjects:
Online Access:http://g3journal.org/lookup/doi/10.1534/g3.119.400215
id doaj-3a99e39ad4e94b7caa5b6cfd466b0296
record_format Article
spelling doaj-3a99e39ad4e94b7caa5b6cfd466b02962021-07-02T04:54:03ZengOxford University PressG3: Genes, Genomes, Genetics2160-18362019-08-01982377239310.1534/g3.119.4002151De Novo Genome Sequence Assembly of Dwarf Coconut (Cocos nucifera L. ‘Catigan Green Dwarf’) Provides Insights into Genomic Variation Between Coconut Types and Related Palm SpeciesDarlon V. LanticanSusan R. StricklerAlma O. CanamaRoanne R. GardoceLukas A. MuellerHayde F. GalvezWe report the first whole genome sequence (WGS) assembly and annotation of a dwarf coconut variety, ‘Catigan Green Dwarf’ (CATD). The genome sequence was generated using the PacBio SMRT sequencing platform at 15X coverage of the expected genome size of 2.15 Gbp, which was corrected with assembled 50X Illumina paired-end MiSeq reads of the same genome. The draft genome was improved through Chicago sequencing to generate a scaffold assembly that results in a total genome size of 2.1 Gbp consisting of 7,998 scaffolds with N50 of 570,487 bp. The final assembly covers around 97.6% of the estimated genome size of coconut ‘CATD’ based on homozygous k-mer peak analysis. A total of 34,958 high-confidence gene models were predicted and functionally associated to various economically important traits, such as pest/disease resistance, drought tolerance, coconut oil biosynthesis, and putative transcription factors. The assembled genome was used to infer the evolutionary relationship within the palm family based on genomic variations and synteny of coding gene sequences. Data show that at least three (3) rounds of whole genome duplication occurred and are commonly shared by these members of the Arecaceae family. A total of 7,139 unique SSR markers were designed to be used as a resource in marker-based breeding. In addition, we discovered 58,503 variants in coconut by aligning the Hainan Tall (HAT) WGS reads to the non-repetitive regions of the assembled CATD genome. The gene markers and genome-wide SSR markers established here will facilitate the development of varieties with resilience to climate change, resistance to pests and diseases, and improved oil yield and quality.http://g3journal.org/lookup/doi/10.1534/g3.119.400215Cocos nucifera L.dwarf coconutgenome assemblyIllumina Miseq SequencingPacBio SMRT sequencingDovetail Chicago sequencinghybrid assemblySSR and SNP markers
collection DOAJ
language English
format Article
sources DOAJ
author Darlon V. Lantican
Susan R. Strickler
Alma O. Canama
Roanne R. Gardoce
Lukas A. Mueller
Hayde F. Galvez
spellingShingle Darlon V. Lantican
Susan R. Strickler
Alma O. Canama
Roanne R. Gardoce
Lukas A. Mueller
Hayde F. Galvez
De Novo Genome Sequence Assembly of Dwarf Coconut (Cocos nucifera L. ‘Catigan Green Dwarf’) Provides Insights into Genomic Variation Between Coconut Types and Related Palm Species
G3: Genes, Genomes, Genetics
Cocos nucifera L.
dwarf coconut
genome assembly
Illumina Miseq Sequencing
PacBio SMRT sequencing
Dovetail Chicago sequencing
hybrid assembly
SSR and SNP markers
author_facet Darlon V. Lantican
Susan R. Strickler
Alma O. Canama
Roanne R. Gardoce
Lukas A. Mueller
Hayde F. Galvez
author_sort Darlon V. Lantican
title De Novo Genome Sequence Assembly of Dwarf Coconut (Cocos nucifera L. ‘Catigan Green Dwarf’) Provides Insights into Genomic Variation Between Coconut Types and Related Palm Species
title_short De Novo Genome Sequence Assembly of Dwarf Coconut (Cocos nucifera L. ‘Catigan Green Dwarf’) Provides Insights into Genomic Variation Between Coconut Types and Related Palm Species
title_full De Novo Genome Sequence Assembly of Dwarf Coconut (Cocos nucifera L. ‘Catigan Green Dwarf’) Provides Insights into Genomic Variation Between Coconut Types and Related Palm Species
title_fullStr De Novo Genome Sequence Assembly of Dwarf Coconut (Cocos nucifera L. ‘Catigan Green Dwarf’) Provides Insights into Genomic Variation Between Coconut Types and Related Palm Species
title_full_unstemmed De Novo Genome Sequence Assembly of Dwarf Coconut (Cocos nucifera L. ‘Catigan Green Dwarf’) Provides Insights into Genomic Variation Between Coconut Types and Related Palm Species
title_sort de novo genome sequence assembly of dwarf coconut (cocos nucifera l. ‘catigan green dwarf’) provides insights into genomic variation between coconut types and related palm species
publisher Oxford University Press
series G3: Genes, Genomes, Genetics
issn 2160-1836
publishDate 2019-08-01
description We report the first whole genome sequence (WGS) assembly and annotation of a dwarf coconut variety, ‘Catigan Green Dwarf’ (CATD). The genome sequence was generated using the PacBio SMRT sequencing platform at 15X coverage of the expected genome size of 2.15 Gbp, which was corrected with assembled 50X Illumina paired-end MiSeq reads of the same genome. The draft genome was improved through Chicago sequencing to generate a scaffold assembly that results in a total genome size of 2.1 Gbp consisting of 7,998 scaffolds with N50 of 570,487 bp. The final assembly covers around 97.6% of the estimated genome size of coconut ‘CATD’ based on homozygous k-mer peak analysis. A total of 34,958 high-confidence gene models were predicted and functionally associated to various economically important traits, such as pest/disease resistance, drought tolerance, coconut oil biosynthesis, and putative transcription factors. The assembled genome was used to infer the evolutionary relationship within the palm family based on genomic variations and synteny of coding gene sequences. Data show that at least three (3) rounds of whole genome duplication occurred and are commonly shared by these members of the Arecaceae family. A total of 7,139 unique SSR markers were designed to be used as a resource in marker-based breeding. In addition, we discovered 58,503 variants in coconut by aligning the Hainan Tall (HAT) WGS reads to the non-repetitive regions of the assembled CATD genome. The gene markers and genome-wide SSR markers established here will facilitate the development of varieties with resilience to climate change, resistance to pests and diseases, and improved oil yield and quality.
topic Cocos nucifera L.
dwarf coconut
genome assembly
Illumina Miseq Sequencing
PacBio SMRT sequencing
Dovetail Chicago sequencing
hybrid assembly
SSR and SNP markers
url http://g3journal.org/lookup/doi/10.1534/g3.119.400215
work_keys_str_mv AT darlonvlantican denovogenomesequenceassemblyofdwarfcoconutcocosnuciferalcatigangreendwarfprovidesinsightsintogenomicvariationbetweencoconuttypesandrelatedpalmspecies
AT susanrstrickler denovogenomesequenceassemblyofdwarfcoconutcocosnuciferalcatigangreendwarfprovidesinsightsintogenomicvariationbetweencoconuttypesandrelatedpalmspecies
AT almaocanama denovogenomesequenceassemblyofdwarfcoconutcocosnuciferalcatigangreendwarfprovidesinsightsintogenomicvariationbetweencoconuttypesandrelatedpalmspecies
AT roannergardoce denovogenomesequenceassemblyofdwarfcoconutcocosnuciferalcatigangreendwarfprovidesinsightsintogenomicvariationbetweencoconuttypesandrelatedpalmspecies
AT lukasamueller denovogenomesequenceassemblyofdwarfcoconutcocosnuciferalcatigangreendwarfprovidesinsightsintogenomicvariationbetweencoconuttypesandrelatedpalmspecies
AT haydefgalvez denovogenomesequenceassemblyofdwarfcoconutcocosnuciferalcatigangreendwarfprovidesinsightsintogenomicvariationbetweencoconuttypesandrelatedpalmspecies
_version_ 1721339316979892224