Exploring the genes of yerba mate (Ilex paraguariensis A. St.-Hil.) by NGS and de novo transcriptome assembly.

Yerba mate (Ilex paraguariensis A. St.-Hil.) is an important subtropical tree crop cultivated on 326,000 ha in Argentina, Brazil and Paraguay, with a total yield production of more than 1,000,000 t. Yerba mate presents a strong limitation regarding sequence information. The NCBI GenBank lacks an EST...

Full description

Bibliographic Details
Main Authors: Humberto J Debat, Mauro Grabiele, Patricia M Aguilera, Rosana E Bubillo, Mónica B Otegui, Daniel A Ducasse, Pedro D Zapata, Dardo A Marti
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4199719?pdf=render
id doaj-430a5a5fb4e14b3b981ec731114dc95c
record_format Article
spelling doaj-430a5a5fb4e14b3b981ec731114dc95c2020-11-24T21:44:33ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-01910e10983510.1371/journal.pone.0109835Exploring the genes of yerba mate (Ilex paraguariensis A. St.-Hil.) by NGS and de novo transcriptome assembly.Humberto J DebatMauro GrabielePatricia M AguileraRosana E BubilloMónica B OteguiDaniel A DucassePedro D ZapataDardo A MartiYerba mate (Ilex paraguariensis A. St.-Hil.) is an important subtropical tree crop cultivated on 326,000 ha in Argentina, Brazil and Paraguay, with a total yield production of more than 1,000,000 t. Yerba mate presents a strong limitation regarding sequence information. The NCBI GenBank lacks an EST database of yerba mate and depicts only 80 DNA sequences, mostly uncharacterized. In this scenario, in order to elucidate the yerba mate gene landscape by means of NGS, we explored and discovered a vast collection of I. paraguariensis transcripts. Total RNA from I. paraguariensis was sequenced by Illumina HiSeq-2000 obtaining 72,031,388 pair-end 100 bp sequences. High quality reads were de novo assembled into 44,907 transcripts encompassing 40 million bases with an estimated coverage of 180X. Multiple sequence analysis allowed us to predict that yerba mate contains ∼ 32,355 genes and 12,551 gene variants or isoforms. We identified and categorized members of more than 100 metabolic pathways. Overall, we have identified ∼ 1,000 putative transcription factors, genes involved in heat and oxidative stress, pathogen response, as well as disease resistance and hormone response. We have also identified, based in sequence homology searches, novel transcripts related to osmotic, drought, salinity and cold stress, senescence and early flowering. We have also pinpointed several members of the gene silencing pathway, and characterized the silencing effector Argonaute1. We predicted a diverse supply of putative microRNA precursors involved in developmental processes. We present here the first draft of the transcribed genomes of the yerba mate chloroplast and mitochondrion. The putative sequence and predicted structure of the caffeine synthase of yerba mate is presented. Moreover, we provide a collection of over 10,800 SSR accessible to the scientific community interested in yerba mate genetic improvement. This contribution broadly expands the limited knowledge of yerba mate genes, and is presented as the first genomic resource of this important crop.http://europepmc.org/articles/PMC4199719?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Humberto J Debat
Mauro Grabiele
Patricia M Aguilera
Rosana E Bubillo
Mónica B Otegui
Daniel A Ducasse
Pedro D Zapata
Dardo A Marti
spellingShingle Humberto J Debat
Mauro Grabiele
Patricia M Aguilera
Rosana E Bubillo
Mónica B Otegui
Daniel A Ducasse
Pedro D Zapata
Dardo A Marti
Exploring the genes of yerba mate (Ilex paraguariensis A. St.-Hil.) by NGS and de novo transcriptome assembly.
PLoS ONE
author_facet Humberto J Debat
Mauro Grabiele
Patricia M Aguilera
Rosana E Bubillo
Mónica B Otegui
Daniel A Ducasse
Pedro D Zapata
Dardo A Marti
author_sort Humberto J Debat
title Exploring the genes of yerba mate (Ilex paraguariensis A. St.-Hil.) by NGS and de novo transcriptome assembly.
title_short Exploring the genes of yerba mate (Ilex paraguariensis A. St.-Hil.) by NGS and de novo transcriptome assembly.
title_full Exploring the genes of yerba mate (Ilex paraguariensis A. St.-Hil.) by NGS and de novo transcriptome assembly.
title_fullStr Exploring the genes of yerba mate (Ilex paraguariensis A. St.-Hil.) by NGS and de novo transcriptome assembly.
title_full_unstemmed Exploring the genes of yerba mate (Ilex paraguariensis A. St.-Hil.) by NGS and de novo transcriptome assembly.
title_sort exploring the genes of yerba mate (ilex paraguariensis a. st.-hil.) by ngs and de novo transcriptome assembly.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2014-01-01
description Yerba mate (Ilex paraguariensis A. St.-Hil.) is an important subtropical tree crop cultivated on 326,000 ha in Argentina, Brazil and Paraguay, with a total yield production of more than 1,000,000 t. Yerba mate presents a strong limitation regarding sequence information. The NCBI GenBank lacks an EST database of yerba mate and depicts only 80 DNA sequences, mostly uncharacterized. In this scenario, in order to elucidate the yerba mate gene landscape by means of NGS, we explored and discovered a vast collection of I. paraguariensis transcripts. Total RNA from I. paraguariensis was sequenced by Illumina HiSeq-2000 obtaining 72,031,388 pair-end 100 bp sequences. High quality reads were de novo assembled into 44,907 transcripts encompassing 40 million bases with an estimated coverage of 180X. Multiple sequence analysis allowed us to predict that yerba mate contains ∼ 32,355 genes and 12,551 gene variants or isoforms. We identified and categorized members of more than 100 metabolic pathways. Overall, we have identified ∼ 1,000 putative transcription factors, genes involved in heat and oxidative stress, pathogen response, as well as disease resistance and hormone response. We have also identified, based in sequence homology searches, novel transcripts related to osmotic, drought, salinity and cold stress, senescence and early flowering. We have also pinpointed several members of the gene silencing pathway, and characterized the silencing effector Argonaute1. We predicted a diverse supply of putative microRNA precursors involved in developmental processes. We present here the first draft of the transcribed genomes of the yerba mate chloroplast and mitochondrion. The putative sequence and predicted structure of the caffeine synthase of yerba mate is presented. Moreover, we provide a collection of over 10,800 SSR accessible to the scientific community interested in yerba mate genetic improvement. This contribution broadly expands the limited knowledge of yerba mate genes, and is presented as the first genomic resource of this important crop.
url http://europepmc.org/articles/PMC4199719?pdf=render
work_keys_str_mv AT humbertojdebat exploringthegenesofyerbamateilexparaguariensisasthilbyngsanddenovotranscriptomeassembly
AT maurograbiele exploringthegenesofyerbamateilexparaguariensisasthilbyngsanddenovotranscriptomeassembly
AT patriciamaguilera exploringthegenesofyerbamateilexparaguariensisasthilbyngsanddenovotranscriptomeassembly
AT rosanaebubillo exploringthegenesofyerbamateilexparaguariensisasthilbyngsanddenovotranscriptomeassembly
AT monicabotegui exploringthegenesofyerbamateilexparaguariensisasthilbyngsanddenovotranscriptomeassembly
AT danieladucasse exploringthegenesofyerbamateilexparaguariensisasthilbyngsanddenovotranscriptomeassembly
AT pedrodzapata exploringthegenesofyerbamateilexparaguariensisasthilbyngsanddenovotranscriptomeassembly
AT dardoamarti exploringthegenesofyerbamateilexparaguariensisasthilbyngsanddenovotranscriptomeassembly
_version_ 1725909461638316032