Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation

Objective: There is a scarcity of transcriptome sequencing data available for the Leptopetalum biflorum, and numerous cyclotides remain undiscovered. It is urgent to establish a workflow based on de novo transcriptome assembly and make systematic prediction of cyclotides in Leptopetalum biflorum, to...

Full description

Bibliographic Details
Published in:Journal of Holistic Integrative Pharmacy
Main Authors: Xi Liu, Linlin Cai, Zhiming Zhou, Peiming Huang, Zhonglu Ren
Format: Article
Language:English
Published: KeAi Communications Co., Ltd. 2024-06-01
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2707368824000323
_version_ 1850016551675101184
author Xi Liu
Linlin Cai
Zhiming Zhou
Peiming Huang
Zhonglu Ren
author_facet Xi Liu
Linlin Cai
Zhiming Zhou
Peiming Huang
Zhonglu Ren
author_sort Xi Liu
collection DOAJ
container_title Journal of Holistic Integrative Pharmacy
description Objective: There is a scarcity of transcriptome sequencing data available for the Leptopetalum biflorum, and numerous cyclotides remain undiscovered. It is urgent to establish a workflow based on de novo transcriptome assembly and make systematic prediction of cyclotides in Leptopetalum biflorum, to provide a reference for functional analysis of cyclotides. Methods: In this study, we performed RNA-seq on roots, leaves, and flowers of Leptopetalum biflorum to obtain two sets of transcriptome data. The quality assessment of the sequencing was conducted using FastQC and MultiQC. De novo transcriptome assembly of Leptopetalum biflorum was carried out using Trinity, with assembly quality evaluated through the Read Support method and BUSCO tool analysis. The eggnog-mapper and Trinotate were used to annotate functional terms in GO and pathways in KEGG. The Transdecoder was utilized to predict ORFs and coding regions while SignalP software was employed to predict amino acid sequences containing signal peptides and signal peptide splicing sites. The mature protein sequences are subsequently used for cyclotide prediction in Leptopetalum biflorum via FindCRP 2.0 (Find Cyclotide Peptide), a cyclotide prediction tool developed by our team. Results: Trinity assembled a total of 171,310 transcripts and 103,299 isoforms (genes). The average transcript length was 1139.89, while the average gene length was 780.87. Approximately 30% of the genes exhibited homology within other plant species. Among these genes, 23,265 (22.52%) were annotated into 41 GO terms at Level 2. The KEGG pathway annotation revealed that 23,682 genes (22.92%) contained 5171 KO annotations and were involved in 484 pathways. FindCRP predicted 17 potential cyclotides, among which 15 sequences had homologous genes; notably five potential cyclotides showed complete identity (100%) to their respective homologous genes. Additionally, two potential cyclotide sequences without any identified homologous demonstrated circle-forming ability based on the 3D structure prediction results. Conclusion: In this study, we developed a de novo transcriptome assembly workflow for the identification of cyclotides using RNA-seq data from Leptopetalum biflorum. Our custom-built tool, FindCRP, was employed in this workflow to detect potential cyclotides. This meticulously designed workflow ensures the reproducibility and reliability of our study findings. We successfully performed transcript annotation and predicted putative cyclotides. These potential cyclotides show significant homology to known cyclotides.
format Article
id doaj-art-e1b239fa29444d1887e15922d021197e
institution Directory of Open Access Journals
issn 2707-3688
language English
publishDate 2024-06-01
publisher KeAi Communications Co., Ltd.
record_format Article
spelling doaj-art-e1b239fa29444d1887e15922d021197e2025-08-20T00:41:50ZengKeAi Communications Co., Ltd.Journal of Holistic Integrative Pharmacy2707-36882024-06-015210311210.1016/j.jhip.2024.06.003Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotationXi Liu0Linlin Cai1Zhiming Zhou2Peiming Huang3Zhonglu Ren4School of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, China; Guangdong Province Precise Medicine Big Data of Traditional Chinese Medicine Engineering Technology Research Center, Guangzhou, 510006, China; Corresponding author. School of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, China.School of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, China; Guangdong Province Precise Medicine Big Data of Traditional Chinese Medicine Engineering Technology Research Center, Guangzhou, 510006, ChinaSchool of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, China; Guangdong Province Precise Medicine Big Data of Traditional Chinese Medicine Engineering Technology Research Center, Guangzhou, 510006, ChinaSchool of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, ChinaSchool of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, China; Guangdong Province Precise Medicine Big Data of Traditional Chinese Medicine Engineering Technology Research Center, Guangzhou, 510006, China; Corresponding author. School of Medical Information and Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, China. Lead Contact.Objective: There is a scarcity of transcriptome sequencing data available for the Leptopetalum biflorum, and numerous cyclotides remain undiscovered. It is urgent to establish a workflow based on de novo transcriptome assembly and make systematic prediction of cyclotides in Leptopetalum biflorum, to provide a reference for functional analysis of cyclotides. Methods: In this study, we performed RNA-seq on roots, leaves, and flowers of Leptopetalum biflorum to obtain two sets of transcriptome data. The quality assessment of the sequencing was conducted using FastQC and MultiQC. De novo transcriptome assembly of Leptopetalum biflorum was carried out using Trinity, with assembly quality evaluated through the Read Support method and BUSCO tool analysis. The eggnog-mapper and Trinotate were used to annotate functional terms in GO and pathways in KEGG. The Transdecoder was utilized to predict ORFs and coding regions while SignalP software was employed to predict amino acid sequences containing signal peptides and signal peptide splicing sites. The mature protein sequences are subsequently used for cyclotide prediction in Leptopetalum biflorum via FindCRP 2.0 (Find Cyclotide Peptide), a cyclotide prediction tool developed by our team. Results: Trinity assembled a total of 171,310 transcripts and 103,299 isoforms (genes). The average transcript length was 1139.89, while the average gene length was 780.87. Approximately 30% of the genes exhibited homology within other plant species. Among these genes, 23,265 (22.52%) were annotated into 41 GO terms at Level 2. The KEGG pathway annotation revealed that 23,682 genes (22.92%) contained 5171 KO annotations and were involved in 484 pathways. FindCRP predicted 17 potential cyclotides, among which 15 sequences had homologous genes; notably five potential cyclotides showed complete identity (100%) to their respective homologous genes. Additionally, two potential cyclotide sequences without any identified homologous demonstrated circle-forming ability based on the 3D structure prediction results. Conclusion: In this study, we developed a de novo transcriptome assembly workflow for the identification of cyclotides using RNA-seq data from Leptopetalum biflorum. Our custom-built tool, FindCRP, was employed in this workflow to detect potential cyclotides. This meticulously designed workflow ensures the reproducibility and reliability of our study findings. We successfully performed transcript annotation and predicted putative cyclotides. These potential cyclotides show significant homology to known cyclotides.http://www.sciencedirect.com/science/article/pii/S2707368824000323Leptopetalum biflorumDe novo assemblyCyclotide
spellingShingle Xi Liu
Linlin Cai
Zhiming Zhou
Peiming Huang
Zhonglu Ren
Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation
Leptopetalum biflorum
De novo assembly
Cyclotide
title Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation
title_full Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation
title_fullStr Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation
title_full_unstemmed Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation
title_short Cyclotides prediction in Leptopetalum biflorum based on de novo transcriptome assembly and annotation
title_sort cyclotides prediction in leptopetalum biflorum based on de novo transcriptome assembly and annotation
topic Leptopetalum biflorum
De novo assembly
Cyclotide
url http://www.sciencedirect.com/science/article/pii/S2707368824000323
work_keys_str_mv AT xiliu cyclotidespredictioninleptopetalumbiflorumbasedondenovotranscriptomeassemblyandannotation
AT linlincai cyclotidespredictioninleptopetalumbiflorumbasedondenovotranscriptomeassemblyandannotation
AT zhimingzhou cyclotidespredictioninleptopetalumbiflorumbasedondenovotranscriptomeassemblyandannotation
AT peiminghuang cyclotidespredictioninleptopetalumbiflorumbasedondenovotranscriptomeassemblyandannotation
AT zhongluren cyclotidespredictioninleptopetalumbiflorumbasedondenovotranscriptomeassemblyandannotation