BAC-pool sequencing and analysis of large segments of A12 and D12 homoeologous chromosomes in upland cotton.

Although new and emerging next-generation sequencing (NGS) technologies have reduced sequencing costs significantly, much work remains to implement them for de novo sequencing of complex and highly repetitive genomes such as the tetraploid genome of Upland cotton (Gossypium hirsutum L.). Herein we r...

Full description

Bibliographic Details
Published in:PLoS ONE
Main Authors: Ramesh Buyyarapu, Ramesh V Kantety, John Z Yu, Zhanyou Xu, Russell J Kohel, Richard G Percy, Simone Macmil, Graham B Wiley, Bruce A Roe, Govind C Sharma
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Online Access:http://europepmc.org/articles/PMC3792896?pdf=render
_version_ 1852719224992038912
author Ramesh Buyyarapu
Ramesh V Kantety
John Z Yu
Zhanyou Xu
Russell J Kohel
Richard G Percy
Simone Macmil
Graham B Wiley
Bruce A Roe
Govind C Sharma
author_facet Ramesh Buyyarapu
Ramesh V Kantety
John Z Yu
Zhanyou Xu
Russell J Kohel
Richard G Percy
Simone Macmil
Graham B Wiley
Bruce A Roe
Govind C Sharma
author_sort Ramesh Buyyarapu
collection DOAJ
container_title PLoS ONE
description Although new and emerging next-generation sequencing (NGS) technologies have reduced sequencing costs significantly, much work remains to implement them for de novo sequencing of complex and highly repetitive genomes such as the tetraploid genome of Upland cotton (Gossypium hirsutum L.). Herein we report the results from implementing a novel, hybrid Sanger/454-based BAC-pool sequencing strategy using minimum tiling path (MTP) BACs from Ctg-3301 and Ctg-465, two large genomic segments in A12 and D12 homoeologous chromosomes (Ctg). To enable generation of longer contig sequences in assembly, we implemented a hybrid assembly method to process ~35x data from 454 technology and 2.8-3x data from Sanger method. Hybrid assemblies offered higher sequence coverage and better sequence assemblies. Homology studies revealed the presence of retrotransposon regions like Copia and Gypsy elements in these contigs and also helped in identifying new genomic SSRs. Unigenes were anchored to the sequences in Ctg-3301 and Ctg-465 to support the physical map. Gene density, gene structure and protein sequence information derived from protein prediction programs were used to obtain the functional annotation of these genes. Comparative analysis of both contigs with Arabidopsis genome exhibited synteny and microcollinearity with a conserved gene order in both genomes. This study provides insight about use of MTP-based BAC-pool sequencing approach for sequencing complex polyploid genomes with limited constraints in generating better sequence assemblies to build reference scaffold sequences. Combining the utilities of MTP-based BAC-pool sequencing with current longer and short read NGS technologies in multiplexed format would provide a new direction to cost-effectively and precisely sequence complex plant genomes.
format Article
id doaj-art-e08fbdbb8d0647bfb5a792dca545942f
institution Directory of Open Access Journals
issn 1932-6203
language English
publishDate 2013-01-01
publisher Public Library of Science (PLoS)
record_format Article
spelling doaj-art-e08fbdbb8d0647bfb5a792dca545942f2025-08-19T21:13:13ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-01810e7675710.1371/journal.pone.0076757BAC-pool sequencing and analysis of large segments of A12 and D12 homoeologous chromosomes in upland cotton.Ramesh BuyyarapuRamesh V KantetyJohn Z YuZhanyou XuRussell J KohelRichard G PercySimone MacmilGraham B WileyBruce A RoeGovind C SharmaAlthough new and emerging next-generation sequencing (NGS) technologies have reduced sequencing costs significantly, much work remains to implement them for de novo sequencing of complex and highly repetitive genomes such as the tetraploid genome of Upland cotton (Gossypium hirsutum L.). Herein we report the results from implementing a novel, hybrid Sanger/454-based BAC-pool sequencing strategy using minimum tiling path (MTP) BACs from Ctg-3301 and Ctg-465, two large genomic segments in A12 and D12 homoeologous chromosomes (Ctg). To enable generation of longer contig sequences in assembly, we implemented a hybrid assembly method to process ~35x data from 454 technology and 2.8-3x data from Sanger method. Hybrid assemblies offered higher sequence coverage and better sequence assemblies. Homology studies revealed the presence of retrotransposon regions like Copia and Gypsy elements in these contigs and also helped in identifying new genomic SSRs. Unigenes were anchored to the sequences in Ctg-3301 and Ctg-465 to support the physical map. Gene density, gene structure and protein sequence information derived from protein prediction programs were used to obtain the functional annotation of these genes. Comparative analysis of both contigs with Arabidopsis genome exhibited synteny and microcollinearity with a conserved gene order in both genomes. This study provides insight about use of MTP-based BAC-pool sequencing approach for sequencing complex polyploid genomes with limited constraints in generating better sequence assemblies to build reference scaffold sequences. Combining the utilities of MTP-based BAC-pool sequencing with current longer and short read NGS technologies in multiplexed format would provide a new direction to cost-effectively and precisely sequence complex plant genomes.http://europepmc.org/articles/PMC3792896?pdf=render
spellingShingle Ramesh Buyyarapu
Ramesh V Kantety
John Z Yu
Zhanyou Xu
Russell J Kohel
Richard G Percy
Simone Macmil
Graham B Wiley
Bruce A Roe
Govind C Sharma
BAC-pool sequencing and analysis of large segments of A12 and D12 homoeologous chromosomes in upland cotton.
title BAC-pool sequencing and analysis of large segments of A12 and D12 homoeologous chromosomes in upland cotton.
title_full BAC-pool sequencing and analysis of large segments of A12 and D12 homoeologous chromosomes in upland cotton.
title_fullStr BAC-pool sequencing and analysis of large segments of A12 and D12 homoeologous chromosomes in upland cotton.
title_full_unstemmed BAC-pool sequencing and analysis of large segments of A12 and D12 homoeologous chromosomes in upland cotton.
title_short BAC-pool sequencing and analysis of large segments of A12 and D12 homoeologous chromosomes in upland cotton.
title_sort bac pool sequencing and analysis of large segments of a12 and d12 homoeologous chromosomes in upland cotton
url http://europepmc.org/articles/PMC3792896?pdf=render
work_keys_str_mv AT rameshbuyyarapu bacpoolsequencingandanalysisoflargesegmentsofa12andd12homoeologouschromosomesinuplandcotton
AT rameshvkantety bacpoolsequencingandanalysisoflargesegmentsofa12andd12homoeologouschromosomesinuplandcotton
AT johnzyu bacpoolsequencingandanalysisoflargesegmentsofa12andd12homoeologouschromosomesinuplandcotton
AT zhanyouxu bacpoolsequencingandanalysisoflargesegmentsofa12andd12homoeologouschromosomesinuplandcotton
AT russelljkohel bacpoolsequencingandanalysisoflargesegmentsofa12andd12homoeologouschromosomesinuplandcotton
AT richardgpercy bacpoolsequencingandanalysisoflargesegmentsofa12andd12homoeologouschromosomesinuplandcotton
AT simonemacmil bacpoolsequencingandanalysisoflargesegmentsofa12andd12homoeologouschromosomesinuplandcotton
AT grahambwiley bacpoolsequencingandanalysisoflargesegmentsofa12andd12homoeologouschromosomesinuplandcotton
AT brucearoe bacpoolsequencingandanalysisoflargesegmentsofa12andd12homoeologouschromosomesinuplandcotton
AT govindcsharma bacpoolsequencingandanalysisoflargesegmentsofa12andd12homoeologouschromosomesinuplandcotton