Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing

Abstract Background The multifaceted control of gene expression requires tight coordination of regulatory mechanisms at transcriptional and post-transcriptional level. Here, we studied the interdependence of transcription initiation, splicing and polyadenylation events on single mRNA molecules by fu...

Full description

Bibliographic Details
Main Authors: Seyed Yahya Anvar, Guy Allard, Elizabeth Tseng, Gloria M. Sheynkman, Eleonora de Klerk, Martijn Vermaat, Raymund H. Yin, Hans E. Johansson, Yavuz Ariyurek, Johan T. den Dunnen, Stephen W. Turner, Peter A. C. ‘t Hoen
Format: Article
Language:English
Published: BMC 2018-03-01
Series:Genome Biology
Online Access:http://link.springer.com/article/10.1186/s13059-018-1418-0
id doaj-bf64d263aaa54535af8e5be27b8cb977
record_format Article
spelling doaj-bf64d263aaa54535af8e5be27b8cb9772020-11-25T00:47:43ZengBMCGenome Biology1474-760X2018-03-0119111810.1186/s13059-018-1418-0Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processingSeyed Yahya Anvar0Guy Allard1Elizabeth Tseng2Gloria M. Sheynkman3Eleonora de Klerk4Martijn Vermaat5Raymund H. Yin6Hans E. Johansson7Yavuz Ariyurek8Johan T. den Dunnen9Stephen W. Turner10Peter A. C. ‘t Hoen11Department of Human Genetics, Leiden University Medical CenterDepartment of Human Genetics, Leiden University Medical CenterPacific BiosciencesCenter for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer InstituteDepartment of Human Genetics, Leiden University Medical CenterDepartment of Human Genetics, Leiden University Medical CenterLGC Biosearch TechnologiesLGC Biosearch TechnologiesDepartment of Human Genetics, Leiden University Medical CenterDepartment of Human Genetics, Leiden University Medical CenterPacific BiosciencesDepartment of Human Genetics, Leiden University Medical CenterAbstract Background The multifaceted control of gene expression requires tight coordination of regulatory mechanisms at transcriptional and post-transcriptional level. Here, we studied the interdependence of transcription initiation, splicing and polyadenylation events on single mRNA molecules by full-length mRNA sequencing. Results In MCF-7 breast cancer cells, we find 2700 genes with interdependent alternative transcription initiation, splicing and polyadenylation events, both in proximal and distant parts of mRNA molecules, including examples of coupling between transcription start sites and polyadenylation sites. The analysis of three human primary tissues (brain, heart and liver) reveals similar patterns of interdependency between transcription initiation and mRNA processing events. We predict thousands of novel open reading frames from full-length mRNA sequences and obtained evidence for their translation by shotgun proteomics. The mapping database rescues 358 previously unassigned peptides and improves the assignment of others. By recognizing sample-specific amino-acid changes and novel splicing patterns, full-length mRNA sequencing improves proteogenomics analysis of MCF-7 cells. Conclusions Our findings demonstrate that our understanding of transcriptome complexity is far from complete and provides a basis to reveal largely unresolved mechanisms that coordinate transcription initiation and mRNA processing.http://link.springer.com/article/10.1186/s13059-018-1418-0
collection DOAJ
language English
format Article
sources DOAJ
author Seyed Yahya Anvar
Guy Allard
Elizabeth Tseng
Gloria M. Sheynkman
Eleonora de Klerk
Martijn Vermaat
Raymund H. Yin
Hans E. Johansson
Yavuz Ariyurek
Johan T. den Dunnen
Stephen W. Turner
Peter A. C. ‘t Hoen
spellingShingle Seyed Yahya Anvar
Guy Allard
Elizabeth Tseng
Gloria M. Sheynkman
Eleonora de Klerk
Martijn Vermaat
Raymund H. Yin
Hans E. Johansson
Yavuz Ariyurek
Johan T. den Dunnen
Stephen W. Turner
Peter A. C. ‘t Hoen
Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing
Genome Biology
author_facet Seyed Yahya Anvar
Guy Allard
Elizabeth Tseng
Gloria M. Sheynkman
Eleonora de Klerk
Martijn Vermaat
Raymund H. Yin
Hans E. Johansson
Yavuz Ariyurek
Johan T. den Dunnen
Stephen W. Turner
Peter A. C. ‘t Hoen
author_sort Seyed Yahya Anvar
title Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing
title_short Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing
title_full Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing
title_fullStr Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing
title_full_unstemmed Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing
title_sort full-length mrna sequencing uncovers a widespread coupling between transcription initiation and mrna processing
publisher BMC
series Genome Biology
issn 1474-760X
publishDate 2018-03-01
description Abstract Background The multifaceted control of gene expression requires tight coordination of regulatory mechanisms at transcriptional and post-transcriptional level. Here, we studied the interdependence of transcription initiation, splicing and polyadenylation events on single mRNA molecules by full-length mRNA sequencing. Results In MCF-7 breast cancer cells, we find 2700 genes with interdependent alternative transcription initiation, splicing and polyadenylation events, both in proximal and distant parts of mRNA molecules, including examples of coupling between transcription start sites and polyadenylation sites. The analysis of three human primary tissues (brain, heart and liver) reveals similar patterns of interdependency between transcription initiation and mRNA processing events. We predict thousands of novel open reading frames from full-length mRNA sequences and obtained evidence for their translation by shotgun proteomics. The mapping database rescues 358 previously unassigned peptides and improves the assignment of others. By recognizing sample-specific amino-acid changes and novel splicing patterns, full-length mRNA sequencing improves proteogenomics analysis of MCF-7 cells. Conclusions Our findings demonstrate that our understanding of transcriptome complexity is far from complete and provides a basis to reveal largely unresolved mechanisms that coordinate transcription initiation and mRNA processing.
url http://link.springer.com/article/10.1186/s13059-018-1418-0
work_keys_str_mv AT seyedyahyaanvar fulllengthmrnasequencinguncoversawidespreadcouplingbetweentranscriptioninitiationandmrnaprocessing
AT guyallard fulllengthmrnasequencinguncoversawidespreadcouplingbetweentranscriptioninitiationandmrnaprocessing
AT elizabethtseng fulllengthmrnasequencinguncoversawidespreadcouplingbetweentranscriptioninitiationandmrnaprocessing
AT gloriamsheynkman fulllengthmrnasequencinguncoversawidespreadcouplingbetweentranscriptioninitiationandmrnaprocessing
AT eleonoradeklerk fulllengthmrnasequencinguncoversawidespreadcouplingbetweentranscriptioninitiationandmrnaprocessing
AT martijnvermaat fulllengthmrnasequencinguncoversawidespreadcouplingbetweentranscriptioninitiationandmrnaprocessing
AT raymundhyin fulllengthmrnasequencinguncoversawidespreadcouplingbetweentranscriptioninitiationandmrnaprocessing
AT hansejohansson fulllengthmrnasequencinguncoversawidespreadcouplingbetweentranscriptioninitiationandmrnaprocessing
AT yavuzariyurek fulllengthmrnasequencinguncoversawidespreadcouplingbetweentranscriptioninitiationandmrnaprocessing
AT johantdendunnen fulllengthmrnasequencinguncoversawidespreadcouplingbetweentranscriptioninitiationandmrnaprocessing
AT stephenwturner fulllengthmrnasequencinguncoversawidespreadcouplingbetweentranscriptioninitiationandmrnaprocessing
AT peteracthoen fulllengthmrnasequencinguncoversawidespreadcouplingbetweentranscriptioninitiationandmrnaprocessing
_version_ 1725258965090041856