Full-length transcriptome analysis and identification of transcript structures in Eimeria necatrix from different developmental stages by single-molecule real-time sequencing

Abstract Background Eimeria necatrix is one of the most pathogenic parasites, causing high mortality in chickens. Although its genome sequence has been published, the sequences and complete structures of its mRNA transcripts remain unclear, limiting exploration of novel biomarkers, drug targets and...

Full description

Bibliographic Details
Main Authors: Yang Gao, Zeyang Suding, Lele Wang, Dandan Liu, Shijie Su, Jinjun Xu, Junjie Hu, Jianping Tao
Format: Article
Language:English
Published: BMC 2021-09-01
Series:Parasites & Vectors
Subjects:
Online Access:https://doi.org/10.1186/s13071-021-05015-7
id doaj-01f6c2692a5642fbb1279d0191c5cfbf
record_format Article
spelling doaj-01f6c2692a5642fbb1279d0191c5cfbf2021-10-03T11:37:55ZengBMCParasites & Vectors1756-33052021-09-0114111310.1186/s13071-021-05015-7Full-length transcriptome analysis and identification of transcript structures in Eimeria necatrix from different developmental stages by single-molecule real-time sequencingYang Gao0Zeyang Suding1Lele Wang2Dandan Liu3Shijie Su4Jinjun Xu5Junjie Hu6Jianping Tao7College of Veterinary Medicine, Yangzhou UniversityCollege of Veterinary Medicine, Yangzhou UniversityCollege of Veterinary Medicine, Yangzhou UniversityCollege of Veterinary Medicine, Yangzhou UniversityCollege of Veterinary Medicine, Yangzhou UniversityCollege of Veterinary Medicine, Yangzhou UniversityBiology Department, Yunnan UniversityCollege of Veterinary Medicine, Yangzhou UniversityAbstract Background Eimeria necatrix is one of the most pathogenic parasites, causing high mortality in chickens. Although its genome sequence has been published, the sequences and complete structures of its mRNA transcripts remain unclear, limiting exploration of novel biomarkers, drug targets and genetic functions in E. necatrix. Methods Second-generation merozoites (MZ-2) of E. necatrix were collected using Percoll density gradients, and high-quality RNA was extracted from them. Single-molecule real-time (SMRT) sequencing and Illumina sequencing were combined to generate the transcripts of MZ-2. Combined with the SMRT sequencing data of sporozoites (SZ) collected in our previous study, the transcriptome and transcript structures of E. necatrix were studied. Results SMRT sequencing yielded 21,923 consensus isoforms in MZ-2. A total of 17,151 novel isoforms of known genes and 3918 isoforms of novel genes were successfully identified. We also identified 2752 (SZ) and 3255 (MZ-2) alternative splicing (AS) events, 1705 (SZ) and 1874 (MZ-2) genes with alternative polyadenylation (APA) sites, 4019 (SZ) and 2588 (MZ-2) fusion transcripts, 159 (SZ) and 84 (MZ-2) putative transcription factors (TFs) and 3581 (SZ) and 2039 (MZ-2) long non-coding RNAs (lncRNAs). To validate fusion transcripts, reverse transcription-PCR was performed on 16 candidates, with an accuracy reaching up to 87.5%. Sanger sequencing of the PCR products further confirmed the authenticity of chimeric transcripts. Comparative analysis of transcript structures revealed a total of 3710 consensus isoforms, 815 AS events, 1139 genes with APA sites, 20 putative TFs and 352 lncRNAs in both SZ and MZ-2. Conclusions We obtained many long-read isoforms in E. necatrix SZ and MZ-2, from which a series of lncRNAs, AS events, APA events and fusion transcripts were identified. Information on TFs will improve understanding of transcriptional regulation, and fusion event data will greatly improve draft versions of gene models in E. necatrix. This information offers insights into the mechanisms governing the development of E. necatrix and will aid in the development of novel strategies for coccidiosis control. Graphical Abstracthttps://doi.org/10.1186/s13071-021-05015-7Eimeria necatrixNovel genesAlternative splicingAlternative polyadenylationLong non-coding RNAsFusion transcripts
collection DOAJ
language English
format Article
sources DOAJ
author Yang Gao
Zeyang Suding
Lele Wang
Dandan Liu
Shijie Su
Jinjun Xu
Junjie Hu
Jianping Tao
spellingShingle Yang Gao
Zeyang Suding
Lele Wang
Dandan Liu
Shijie Su
Jinjun Xu
Junjie Hu
Jianping Tao
Full-length transcriptome analysis and identification of transcript structures in Eimeria necatrix from different developmental stages by single-molecule real-time sequencing
Parasites & Vectors
Eimeria necatrix
Novel genes
Alternative splicing
Alternative polyadenylation
Long non-coding RNAs
Fusion transcripts
author_facet Yang Gao
Zeyang Suding
Lele Wang
Dandan Liu
Shijie Su
Jinjun Xu
Junjie Hu
Jianping Tao
author_sort Yang Gao
title Full-length transcriptome analysis and identification of transcript structures in Eimeria necatrix from different developmental stages by single-molecule real-time sequencing
title_short Full-length transcriptome analysis and identification of transcript structures in Eimeria necatrix from different developmental stages by single-molecule real-time sequencing
title_full Full-length transcriptome analysis and identification of transcript structures in Eimeria necatrix from different developmental stages by single-molecule real-time sequencing
title_fullStr Full-length transcriptome analysis and identification of transcript structures in Eimeria necatrix from different developmental stages by single-molecule real-time sequencing
title_full_unstemmed Full-length transcriptome analysis and identification of transcript structures in Eimeria necatrix from different developmental stages by single-molecule real-time sequencing
title_sort full-length transcriptome analysis and identification of transcript structures in eimeria necatrix from different developmental stages by single-molecule real-time sequencing
publisher BMC
series Parasites & Vectors
issn 1756-3305
publishDate 2021-09-01
description Abstract Background Eimeria necatrix is one of the most pathogenic parasites, causing high mortality in chickens. Although its genome sequence has been published, the sequences and complete structures of its mRNA transcripts remain unclear, limiting exploration of novel biomarkers, drug targets and genetic functions in E. necatrix. Methods Second-generation merozoites (MZ-2) of E. necatrix were collected using Percoll density gradients, and high-quality RNA was extracted from them. Single-molecule real-time (SMRT) sequencing and Illumina sequencing were combined to generate the transcripts of MZ-2. Combined with the SMRT sequencing data of sporozoites (SZ) collected in our previous study, the transcriptome and transcript structures of E. necatrix were studied. Results SMRT sequencing yielded 21,923 consensus isoforms in MZ-2. A total of 17,151 novel isoforms of known genes and 3918 isoforms of novel genes were successfully identified. We also identified 2752 (SZ) and 3255 (MZ-2) alternative splicing (AS) events, 1705 (SZ) and 1874 (MZ-2) genes with alternative polyadenylation (APA) sites, 4019 (SZ) and 2588 (MZ-2) fusion transcripts, 159 (SZ) and 84 (MZ-2) putative transcription factors (TFs) and 3581 (SZ) and 2039 (MZ-2) long non-coding RNAs (lncRNAs). To validate fusion transcripts, reverse transcription-PCR was performed on 16 candidates, with an accuracy reaching up to 87.5%. Sanger sequencing of the PCR products further confirmed the authenticity of chimeric transcripts. Comparative analysis of transcript structures revealed a total of 3710 consensus isoforms, 815 AS events, 1139 genes with APA sites, 20 putative TFs and 352 lncRNAs in both SZ and MZ-2. Conclusions We obtained many long-read isoforms in E. necatrix SZ and MZ-2, from which a series of lncRNAs, AS events, APA events and fusion transcripts were identified. Information on TFs will improve understanding of transcriptional regulation, and fusion event data will greatly improve draft versions of gene models in E. necatrix. This information offers insights into the mechanisms governing the development of E. necatrix and will aid in the development of novel strategies for coccidiosis control. Graphical Abstract
topic Eimeria necatrix
Novel genes
Alternative splicing
Alternative polyadenylation
Long non-coding RNAs
Fusion transcripts
url https://doi.org/10.1186/s13071-021-05015-7
work_keys_str_mv AT yanggao fulllengthtranscriptomeanalysisandidentificationoftranscriptstructuresineimerianecatrixfromdifferentdevelopmentalstagesbysinglemoleculerealtimesequencing
AT zeyangsuding fulllengthtranscriptomeanalysisandidentificationoftranscriptstructuresineimerianecatrixfromdifferentdevelopmentalstagesbysinglemoleculerealtimesequencing
AT lelewang fulllengthtranscriptomeanalysisandidentificationoftranscriptstructuresineimerianecatrixfromdifferentdevelopmentalstagesbysinglemoleculerealtimesequencing
AT dandanliu fulllengthtranscriptomeanalysisandidentificationoftranscriptstructuresineimerianecatrixfromdifferentdevelopmentalstagesbysinglemoleculerealtimesequencing
AT shijiesu fulllengthtranscriptomeanalysisandidentificationoftranscriptstructuresineimerianecatrixfromdifferentdevelopmentalstagesbysinglemoleculerealtimesequencing
AT jinjunxu fulllengthtranscriptomeanalysisandidentificationoftranscriptstructuresineimerianecatrixfromdifferentdevelopmentalstagesbysinglemoleculerealtimesequencing
AT junjiehu fulllengthtranscriptomeanalysisandidentificationoftranscriptstructuresineimerianecatrixfromdifferentdevelopmentalstagesbysinglemoleculerealtimesequencing
AT jianpingtao fulllengthtranscriptomeanalysisandidentificationoftranscriptstructuresineimerianecatrixfromdifferentdevelopmentalstagesbysinglemoleculerealtimesequencing
_version_ 1716845320235646976