<i>Fugacium</i> Spliced Leader Genes Identified from Stranded RNA-Seq Datasets

<i>Trans</i>-splicing mechanisms have been documented in many lineages that are widely distributed phylogenetically, including dinoflagellates. The spliced leader (SL) sequence itself is conserved in dinoflagellates, although its gene sequences and arrangements have diversified within or...

Full description

Bibliographic Details
Main Authors: Yue Song, Bahareh Zaheri, Min Liu, Sunil Kumar Sahu, Huan Liu, Wenbin Chen, Bo Song, David Morse
Format: Article
Language:English
Published: MDPI AG 2019-06-01
Series:Microorganisms
Subjects:
Online Access:https://www.mdpi.com/2076-2607/7/6/171
id doaj-209401c8d82a4b3fa7024ef7fbb1910c
record_format Article
spelling doaj-209401c8d82a4b3fa7024ef7fbb1910c2020-11-25T00:16:48ZengMDPI AGMicroorganisms2076-26072019-06-017617110.3390/microorganisms7060171microorganisms7060171<i>Fugacium</i> Spliced Leader Genes Identified from Stranded RNA-Seq DatasetsYue Song0Bahareh Zaheri1Min Liu2Sunil Kumar Sahu3Huan Liu4Wenbin Chen5Bo Song6David Morse7BGI-Qingdao, BGI-Shenzhen, Qingdao 266555, ChinaInstitut de Recherche en Biologie Végétale, Département de Sciences Biologiques, Université de Montréal, Montréal, QC H1X 2B2, CanadaBGI-Shenzhen, Beishan Industrial Zone, Yantian District, Shenzhen 518083, ChinaBGI-Shenzhen, Beishan Industrial Zone, Yantian District, Shenzhen 518083, ChinaBGI-Shenzhen, Beishan Industrial Zone, Yantian District, Shenzhen 518083, ChinaBGI-Shenzhen, Beishan Industrial Zone, Yantian District, Shenzhen 518083, ChinaAgricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, ChinaInstitut de Recherche en Biologie Végétale, Département de Sciences Biologiques, Université de Montréal, Montréal, QC H1X 2B2, Canada<i>Trans</i>-splicing mechanisms have been documented in many lineages that are widely distributed phylogenetically, including dinoflagellates. The spliced leader (SL) sequence itself is conserved in dinoflagellates, although its gene sequences and arrangements have diversified within or across different species. In this study, we present 18 <i>Fugacium kawagutii</i> SL genes identified from stranded RNA-seq reads. These genes typically have a single SL but can contain several partial SLs with lengths ranging from 103 to 292 bp. Unexpectedly, we find the SL gene transcripts contain sequences upstream of the canonical SL, suggesting that generation of mature transcripts will require additional modifications following <i>trans</i>-splicing. We have also identified 13 SL-like genes whose expression levels and length are comparable to Dino-SL genes. Lastly, introns in these genes were identified and a new site for Sm-protein binding was proposed. Overall, this study provides a strategy for fast identification of SL genes and identifies new sequences of <i>F. kawagutii</i> SL genes to supplement our understanding of <i>trans</i>-splicing.https://www.mdpi.com/2076-2607/7/6/171dinoflagellates<i>Symbiodinium</i><i>Fugacium</i><i>trans</i>-splicingspliced leader
collection DOAJ
language English
format Article
sources DOAJ
author Yue Song
Bahareh Zaheri
Min Liu
Sunil Kumar Sahu
Huan Liu
Wenbin Chen
Bo Song
David Morse
spellingShingle Yue Song
Bahareh Zaheri
Min Liu
Sunil Kumar Sahu
Huan Liu
Wenbin Chen
Bo Song
David Morse
<i>Fugacium</i> Spliced Leader Genes Identified from Stranded RNA-Seq Datasets
Microorganisms
dinoflagellates
<i>Symbiodinium</i>
<i>Fugacium</i>
<i>trans</i>-splicing
spliced leader
author_facet Yue Song
Bahareh Zaheri
Min Liu
Sunil Kumar Sahu
Huan Liu
Wenbin Chen
Bo Song
David Morse
author_sort Yue Song
title <i>Fugacium</i> Spliced Leader Genes Identified from Stranded RNA-Seq Datasets
title_short <i>Fugacium</i> Spliced Leader Genes Identified from Stranded RNA-Seq Datasets
title_full <i>Fugacium</i> Spliced Leader Genes Identified from Stranded RNA-Seq Datasets
title_fullStr <i>Fugacium</i> Spliced Leader Genes Identified from Stranded RNA-Seq Datasets
title_full_unstemmed <i>Fugacium</i> Spliced Leader Genes Identified from Stranded RNA-Seq Datasets
title_sort <i>fugacium</i> spliced leader genes identified from stranded rna-seq datasets
publisher MDPI AG
series Microorganisms
issn 2076-2607
publishDate 2019-06-01
description <i>Trans</i>-splicing mechanisms have been documented in many lineages that are widely distributed phylogenetically, including dinoflagellates. The spliced leader (SL) sequence itself is conserved in dinoflagellates, although its gene sequences and arrangements have diversified within or across different species. In this study, we present 18 <i>Fugacium kawagutii</i> SL genes identified from stranded RNA-seq reads. These genes typically have a single SL but can contain several partial SLs with lengths ranging from 103 to 292 bp. Unexpectedly, we find the SL gene transcripts contain sequences upstream of the canonical SL, suggesting that generation of mature transcripts will require additional modifications following <i>trans</i>-splicing. We have also identified 13 SL-like genes whose expression levels and length are comparable to Dino-SL genes. Lastly, introns in these genes were identified and a new site for Sm-protein binding was proposed. Overall, this study provides a strategy for fast identification of SL genes and identifies new sequences of <i>F. kawagutii</i> SL genes to supplement our understanding of <i>trans</i>-splicing.
topic dinoflagellates
<i>Symbiodinium</i>
<i>Fugacium</i>
<i>trans</i>-splicing
spliced leader
url https://www.mdpi.com/2076-2607/7/6/171
work_keys_str_mv AT yuesong ifugaciumisplicedleadergenesidentifiedfromstrandedrnaseqdatasets
AT baharehzaheri ifugaciumisplicedleadergenesidentifiedfromstrandedrnaseqdatasets
AT minliu ifugaciumisplicedleadergenesidentifiedfromstrandedrnaseqdatasets
AT sunilkumarsahu ifugaciumisplicedleadergenesidentifiedfromstrandedrnaseqdatasets
AT huanliu ifugaciumisplicedleadergenesidentifiedfromstrandedrnaseqdatasets
AT wenbinchen ifugaciumisplicedleadergenesidentifiedfromstrandedrnaseqdatasets
AT bosong ifugaciumisplicedleadergenesidentifiedfromstrandedrnaseqdatasets
AT davidmorse ifugaciumisplicedleadergenesidentifiedfromstrandedrnaseqdatasets
_version_ 1725382547273154560