Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut.

The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm...

Full description

Bibliographic Details
Main Authors: Alix Armero, Luc Baudouin, Stéphanie Bocs, Dominique This
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5363918?pdf=render
id doaj-484be2dae68146bbbcee0fd3de3788f4
record_format Article
spelling doaj-484be2dae68146bbbcee0fd3de3788f42020-11-24T21:35:48ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-01123e017330010.1371/journal.pone.0173300Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut.Alix ArmeroLuc BaudouinStéphanie BocsDominique ThisThe palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm (Elaeis guineensis Jacq) is the main protagonist of the oil market. In this study, we present a workflow that exploits the comparative genomics between a target species (coconut) and a reference species (oil palm) to improve the transcriptomic data, providing a proteome useful to answer functional or evolutionary questions. This workflow reduces redundancy and fragmentation, two inherent problems of transcriptomic data, while preserving the functional representation of the target species. Our approach was validated in Arabidopsis thaliana using Arabidopsis lyrata and Capsella rubella as references species. This analysis showed the high sensitivity and specificity of our strategy, relatively independent of the reference proteome. The workflow increased the length of proteins products in A. thaliana by 13%, allowing, often, to recover 100% of the protein sequence length. In addition redundancy was reduced by a factor greater than 3. In coconut, the approach generated 29,366 proteins, 1,246 of these proteins deriving from new contigs obtained with the BRANCH software. The coconut proteome presented a functional profile similar to that observed in rice and an important number of metabolic pathways related to secondary metabolism. The new sequences found with BRANCH software were enriched in functions related to biotic stress. Our strategy can be used as a complementary step to de novo transcriptome assembly to get a representative proteome of a target species. The results of the current analysis are available on the website PalmComparomics (http://palm-comparomics.southgreen.fr/).http://europepmc.org/articles/PMC5363918?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Alix Armero
Luc Baudouin
Stéphanie Bocs
Dominique This
spellingShingle Alix Armero
Luc Baudouin
Stéphanie Bocs
Dominique This
Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut.
PLoS ONE
author_facet Alix Armero
Luc Baudouin
Stéphanie Bocs
Dominique This
author_sort Alix Armero
title Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut.
title_short Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut.
title_full Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut.
title_fullStr Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut.
title_full_unstemmed Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut.
title_sort improving transcriptome de novo assembly by using a reference genome of a related species: translational genomics from oil palm to coconut.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2017-01-01
description The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm (Elaeis guineensis Jacq) is the main protagonist of the oil market. In this study, we present a workflow that exploits the comparative genomics between a target species (coconut) and a reference species (oil palm) to improve the transcriptomic data, providing a proteome useful to answer functional or evolutionary questions. This workflow reduces redundancy and fragmentation, two inherent problems of transcriptomic data, while preserving the functional representation of the target species. Our approach was validated in Arabidopsis thaliana using Arabidopsis lyrata and Capsella rubella as references species. This analysis showed the high sensitivity and specificity of our strategy, relatively independent of the reference proteome. The workflow increased the length of proteins products in A. thaliana by 13%, allowing, often, to recover 100% of the protein sequence length. In addition redundancy was reduced by a factor greater than 3. In coconut, the approach generated 29,366 proteins, 1,246 of these proteins deriving from new contigs obtained with the BRANCH software. The coconut proteome presented a functional profile similar to that observed in rice and an important number of metabolic pathways related to secondary metabolism. The new sequences found with BRANCH software were enriched in functions related to biotic stress. Our strategy can be used as a complementary step to de novo transcriptome assembly to get a representative proteome of a target species. The results of the current analysis are available on the website PalmComparomics (http://palm-comparomics.southgreen.fr/).
url http://europepmc.org/articles/PMC5363918?pdf=render
work_keys_str_mv AT alixarmero improvingtranscriptomedenovoassemblybyusingareferencegenomeofarelatedspeciestranslationalgenomicsfromoilpalmtococonut
AT lucbaudouin improvingtranscriptomedenovoassemblybyusingareferencegenomeofarelatedspeciestranslationalgenomicsfromoilpalmtococonut
AT stephaniebocs improvingtranscriptomedenovoassemblybyusingareferencegenomeofarelatedspeciestranslationalgenomicsfromoilpalmtococonut
AT dominiquethis improvingtranscriptomedenovoassemblybyusingareferencegenomeofarelatedspeciestranslationalgenomicsfromoilpalmtococonut
_version_ 1725943911956873216