De Novo Assembly and Characterization of Sophora japonica Transcriptome Using RNA-seq
Sophora japonica Linn (Chinese Scholar Tree) is a shrub species belonging to the subfamily Faboideae of the pea family Fabaceae. In this study, RNA sequencing of S. japonica transcriptome was performed to produce large expression datasets for functional genomic analysis. Approximate 86.1 million hig...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2014-01-01
|
Series: | BioMed Research International |
Online Access: | http://dx.doi.org/10.1155/2014/750961 |
id |
doaj-2be5c745fdb243c5969363e539d842cc |
---|---|
record_format |
Article |
spelling |
doaj-2be5c745fdb243c5969363e539d842cc2020-11-25T00:01:27ZengHindawi LimitedBioMed Research International2314-61332314-61412014-01-01201410.1155/2014/750961750961De Novo Assembly and Characterization of Sophora japonica Transcriptome Using RNA-seqLiucun Zhu0Ying Zhang1Wenna Guo2Xin-Jian Xu3Qiang Wang4Institute of System Biology, Shanghai University, Shanghai 200444, ChinaYangzhou Breeding Biological Agriculture Technology Co. Ltd., Yangzhou 225200, ChinaInstitute of System Biology, Shanghai University, Shanghai 200444, ChinaDepartment of Mathematics, Shanghai University, Shanghai 200444, ChinaState Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing 210093, ChinaSophora japonica Linn (Chinese Scholar Tree) is a shrub species belonging to the subfamily Faboideae of the pea family Fabaceae. In this study, RNA sequencing of S. japonica transcriptome was performed to produce large expression datasets for functional genomic analysis. Approximate 86.1 million high-quality clean reads were generated and assembled de novo into 143010 unique transcripts and 57614 unigenes. The average length of unigenes was 901 bps with an N50 of 545 bps. Four public databases, including the NCBI nonredundant protein (NR), Swiss-Prot, Kyoto Encyclopedia of Genes and Genomes (KEGG), and the Cluster of Orthologous Groups (COG), were used to annotate unigenes through NCBI BLAST procedure. A total of 27541 of 57614 unigenes (47.8%) were annotated for gene descriptions, conserved protein domains, or gene ontology. Moreover, an interaction network of unigenes in S. japonica was predicted based on known protein-protein interactions of putative orthologs of well-studied plant genomes. The transcriptome data of S. japonica reported here represents first genome-scale investigation of gene expressions in Faboideae plants. We expect that our study will provide a useful resource for further studies on gene expression, genomics, functional genomics, and protein-protein interaction in S. japonica.http://dx.doi.org/10.1155/2014/750961 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Liucun Zhu Ying Zhang Wenna Guo Xin-Jian Xu Qiang Wang |
spellingShingle |
Liucun Zhu Ying Zhang Wenna Guo Xin-Jian Xu Qiang Wang De Novo Assembly and Characterization of Sophora japonica Transcriptome Using RNA-seq BioMed Research International |
author_facet |
Liucun Zhu Ying Zhang Wenna Guo Xin-Jian Xu Qiang Wang |
author_sort |
Liucun Zhu |
title |
De Novo Assembly and Characterization of Sophora japonica Transcriptome Using RNA-seq |
title_short |
De Novo Assembly and Characterization of Sophora japonica Transcriptome Using RNA-seq |
title_full |
De Novo Assembly and Characterization of Sophora japonica Transcriptome Using RNA-seq |
title_fullStr |
De Novo Assembly and Characterization of Sophora japonica Transcriptome Using RNA-seq |
title_full_unstemmed |
De Novo Assembly and Characterization of Sophora japonica Transcriptome Using RNA-seq |
title_sort |
de novo assembly and characterization of sophora japonica transcriptome using rna-seq |
publisher |
Hindawi Limited |
series |
BioMed Research International |
issn |
2314-6133 2314-6141 |
publishDate |
2014-01-01 |
description |
Sophora japonica Linn (Chinese Scholar Tree) is a shrub species belonging to the subfamily Faboideae of the pea family Fabaceae. In this study, RNA sequencing of S. japonica transcriptome was performed to produce large expression datasets for functional genomic analysis. Approximate 86.1 million high-quality clean reads were generated and assembled de novo into 143010 unique transcripts and 57614 unigenes. The average length of unigenes was 901 bps with an N50 of 545 bps. Four public databases, including the NCBI nonredundant protein (NR), Swiss-Prot, Kyoto Encyclopedia of Genes and Genomes (KEGG), and the Cluster of Orthologous Groups (COG), were used to annotate unigenes through NCBI BLAST procedure. A total of 27541 of 57614 unigenes (47.8%) were annotated for gene descriptions, conserved protein domains, or gene ontology. Moreover, an interaction network of unigenes in S. japonica was predicted based on known protein-protein interactions of putative orthologs of well-studied plant genomes. The transcriptome data of S. japonica reported here represents first genome-scale investigation of gene expressions in Faboideae plants. We expect that our study will provide a useful resource for further studies on gene expression, genomics, functional genomics, and protein-protein interaction in S. japonica. |
url |
http://dx.doi.org/10.1155/2014/750961 |
work_keys_str_mv |
AT liucunzhu denovoassemblyandcharacterizationofsophorajaponicatranscriptomeusingrnaseq AT yingzhang denovoassemblyandcharacterizationofsophorajaponicatranscriptomeusingrnaseq AT wennaguo denovoassemblyandcharacterizationofsophorajaponicatranscriptomeusingrnaseq AT xinjianxu denovoassemblyandcharacterizationofsophorajaponicatranscriptomeusingrnaseq AT qiangwang denovoassemblyandcharacterizationofsophorajaponicatranscriptomeusingrnaseq |
_version_ |
1725441915446362112 |