Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation
Objective: There is a scarcity of transcriptome sequencing data available for the Leptopetalum biflorum, and numerous cyclotides remain undiscovered. It is urgent to establish a workflow based on de novo transcriptome assembly and make systematic prediction of cyclotides in Leptopetalum biflorum, to...
| Published in: | Journal of Holistic Integrative Pharmacy |
|---|---|
| Main Authors: | , , , , |
| Format: | Article |
| Language: | English |
| Published: |
KeAi Communications Co., Ltd.
2024-06-01
|
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S2707368824000323 |
| _version_ | 1850016551675101184 |
|---|---|
| author | Xi Liu Linlin Cai Zhiming Zhou Peiming Huang Zhonglu Ren |
| author_facet | Xi Liu Linlin Cai Zhiming Zhou Peiming Huang Zhonglu Ren |
| author_sort | Xi Liu |
| collection | DOAJ |
| container_title | Journal of Holistic Integrative Pharmacy |
| description | Objective: There is a scarcity of transcriptome sequencing data available for the Leptopetalum biflorum, and numerous cyclotides remain undiscovered. It is urgent to establish a workflow based on de novo transcriptome assembly and make systematic prediction of cyclotides in Leptopetalum biflorum, to provide a reference for functional analysis of cyclotides. Methods: In this study, we performed RNA-seq on roots, leaves, and flowers of Leptopetalum biflorum to obtain two sets of transcriptome data. The quality assessment of the sequencing was conducted using FastQC and MultiQC. De novo transcriptome assembly of Leptopetalum biflorum was carried out using Trinity, with assembly quality evaluated through the Read Support method and BUSCO tool analysis. The eggnog-mapper and Trinotate were used to annotate functional terms in GO and pathways in KEGG. The Transdecoder was utilized to predict ORFs and coding regions while SignalP software was employed to predict amino acid sequences containing signal peptides and signal peptide splicing sites. The mature protein sequences are subsequently used for cyclotide prediction in Leptopetalum biflorum via FindCRP 2.0 (Find Cyclotide Peptide), a cyclotide prediction tool developed by our team. Results: Trinity assembled a total of 171,310 transcripts and 103,299 isoforms (genes). The average transcript length was 1139.89, while the average gene length was 780.87. Approximately 30% of the genes exhibited homology within other plant species. Among these genes, 23,265 (22.52%) were annotated into 41 GO terms at Level 2. The KEGG pathway annotation revealed that 23,682 genes (22.92%) contained 5171 KO annotations and were involved in 484 pathways. FindCRP predicted 17 potential cyclotides, among which 15 sequences had homologous genes; notably five potential cyclotides showed complete identity (100%) to their respective homologous genes. Additionally, two potential cyclotide sequences without any identified homologous demonstrated circle-forming ability based on the 3D structure prediction results. Conclusion: In this study, we developed a de novo transcriptome assembly workflow for the identification of cyclotides using RNA-seq data from Leptopetalum biflorum. Our custom-built tool, FindCRP, was employed in this workflow to detect potential cyclotides. This meticulously designed workflow ensures the reproducibility and reliability of our study findings. We successfully performed transcript annotation and predicted putative cyclotides. These potential cyclotides show significant homology to known cyclotides. |
| format | Article |
| id | doaj-art-e1b239fa29444d1887e15922d021197e |
| institution | Directory of Open Access Journals |
| issn | 2707-3688 |
| language | English |
| publishDate | 2024-06-01 |
| publisher | KeAi Communications Co., Ltd. |
| record_format | Article |
| spelling | doaj-art-e1b239fa29444d1887e15922d021197e2025-08-20T00:41:50ZengKeAi Communications Co., Ltd.Journal of Holistic Integrative Pharmacy2707-36882024-06-015210311210.1016/j.jhip.2024.06.003Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotationXi Liu0Linlin Cai1Zhiming Zhou2Peiming Huang3Zhonglu Ren4School of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, China; Guangdong Province Precise Medicine Big Data of Traditional Chinese Medicine Engineering Technology Research Center, Guangzhou, 510006, China; Corresponding author. School of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, China.School of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, China; Guangdong Province Precise Medicine Big Data of Traditional Chinese Medicine Engineering Technology Research Center, Guangzhou, 510006, ChinaSchool of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, China; Guangdong Province Precise Medicine Big Data of Traditional Chinese Medicine Engineering Technology Research Center, Guangzhou, 510006, ChinaSchool of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, ChinaSchool of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, China; Guangdong Province Precise Medicine Big Data of Traditional Chinese Medicine Engineering Technology Research Center, Guangzhou, 510006, China; Corresponding author. School of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, China. Lead Contact.Objective: There is a scarcity of transcriptome sequencing data available for the Leptopetalum biflorum, and numerous cyclotides remain undiscovered. It is urgent to establish a workflow based on de novo transcriptome assembly and make systematic prediction of cyclotides in Leptopetalum biflorum, to provide a reference for functional analysis of cyclotides. Methods: In this study, we performed RNA-seq on roots, leaves, and flowers of Leptopetalum biflorum to obtain two sets of transcriptome data. The quality assessment of the sequencing was conducted using FastQC and MultiQC. De novo transcriptome assembly of Leptopetalum biflorum was carried out using Trinity, with assembly quality evaluated through the Read Support method and BUSCO tool analysis. The eggnog-mapper and Trinotate were used to annotate functional terms in GO and pathways in KEGG. The Transdecoder was utilized to predict ORFs and coding regions while SignalP software was employed to predict amino acid sequences containing signal peptides and signal peptide splicing sites. The mature protein sequences are subsequently used for cyclotide prediction in Leptopetalum biflorum via FindCRP 2.0 (Find Cyclotide Peptide), a cyclotide prediction tool developed by our team. Results: Trinity assembled a total of 171,310 transcripts and 103,299 isoforms (genes). The average transcript length was 1139.89, while the average gene length was 780.87. Approximately 30% of the genes exhibited homology within other plant species. Among these genes, 23,265 (22.52%) were annotated into 41 GO terms at Level 2. The KEGG pathway annotation revealed that 23,682 genes (22.92%) contained 5171 KO annotations and were involved in 484 pathways. FindCRP predicted 17 potential cyclotides, among which 15 sequences had homologous genes; notably five potential cyclotides showed complete identity (100%) to their respective homologous genes. Additionally, two potential cyclotide sequences without any identified homologous demonstrated circle-forming ability based on the 3D structure prediction results. Conclusion: In this study, we developed a de novo transcriptome assembly workflow for the identification of cyclotides using RNA-seq data from Leptopetalum biflorum. Our custom-built tool, FindCRP, was employed in this workflow to detect potential cyclotides. This meticulously designed workflow ensures the reproducibility and reliability of our study findings. We successfully performed transcript annotation and predicted putative cyclotides. These potential cyclotides show significant homology to known cyclotides.http://www.sciencedirect.com/science/article/pii/S2707368824000323Leptopetalum biflorumDe novo assemblyCyclotide |
| spellingShingle | Xi Liu Linlin Cai Zhiming Zhou Peiming Huang Zhonglu Ren Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation Leptopetalum biflorum De novo assembly Cyclotide |
| title | Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation |
| title_full | Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation |
| title_fullStr | Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation |
| title_full_unstemmed | Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation |
| title_short | Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation |
| title_sort | cyclotides prediction in leptopetalum biflorum based on de novo transcriptome assembly and annotation |
| topic | Leptopetalum biflorum De novo assembly Cyclotide |
| url | http://www.sciencedirect.com/science/article/pii/S2707368824000323 |
| work_keys_str_mv | AT xiliu cyclotidespredictioninleptopetalumbiflorumbasedondenovotranscriptomeassemblyandannotation AT linlincai cyclotidespredictioninleptopetalumbiflorumbasedondenovotranscriptomeassemblyandannotation AT zhimingzhou cyclotidespredictioninleptopetalumbiflorumbasedondenovotranscriptomeassemblyandannotation AT peiminghuang cyclotidespredictioninleptopetalumbiflorumbasedondenovotranscriptomeassemblyandannotation AT zhongluren cyclotidespredictioninleptopetalumbiflorumbasedondenovotranscriptomeassemblyandannotation |
