Differentiating protein-coding and noncoding RNA: challenges and ambiguities.

The assumption that RNA can be readily classified into either protein-coding or non-protein-coding categories has pervaded biology for close to 50 years. Until recently, discrimination between these two categories was relatively straightforward: most transcripts were clearly identifiable as protein-...

Full description

Bibliographic Details
Main Authors: Marcel E Dinger, Ken C Pang, Tim R Mercer, John S Mattick
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2008-11-01
Series:PLoS Computational Biology
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/19043537/?tool=EBI
id doaj-93919d3375964c7b83ac10be8a57c50b
record_format Article
spelling doaj-93919d3375964c7b83ac10be8a57c50b2021-04-21T15:20:14ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582008-11-01411e100017610.1371/journal.pcbi.1000176Differentiating protein-coding and noncoding RNA: challenges and ambiguities.Marcel E DingerKen C PangTim R MercerJohn S MattickThe assumption that RNA can be readily classified into either protein-coding or non-protein-coding categories has pervaded biology for close to 50 years. Until recently, discrimination between these two categories was relatively straightforward: most transcripts were clearly identifiable as protein-coding messenger RNAs (mRNAs), and readily distinguished from the small number of well-characterized non-protein-coding RNAs (ncRNAs), such as transfer, ribosomal, and spliceosomal RNAs. Recent genome-wide studies have revealed the existence of thousands of noncoding transcripts, whose function and significance are unclear. The discovery of this hidden transcriptome and the implicit challenge it presents to our understanding of the expression and regulation of genetic information has made the need to distinguish between mRNAs and ncRNAs both more pressing and more complicated. In this Review, we consider the diverse strategies employed to discriminate between protein-coding and noncoding transcripts and the fundamental difficulties that are inherent in what may superficially appear to be a simple problem. Misannotations can also run in both directions: some ncRNAs may actually encode peptides, and some of those currently thought to do so may not. Moreover, recent studies have shown that some RNAs can function both as mRNAs and intrinsically as functional ncRNAs, which may be a relatively widespread phenomenon. We conclude that it is difficult to annotate an RNA unequivocally as protein-coding or noncoding, with overlapping protein-coding and noncoding transcripts further confounding this distinction. In addition, the finding that some transcripts can function both intrinsically at the RNA level and to encode proteins suggests a false dichotomy between mRNAs and ncRNAs. Therefore, the functionality of any transcript at the RNA level should not be discounted.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/19043537/?tool=EBI
collection DOAJ
language English
format Article
sources DOAJ
author Marcel E Dinger
Ken C Pang
Tim R Mercer
John S Mattick
spellingShingle Marcel E Dinger
Ken C Pang
Tim R Mercer
John S Mattick
Differentiating protein-coding and noncoding RNA: challenges and ambiguities.
PLoS Computational Biology
author_facet Marcel E Dinger
Ken C Pang
Tim R Mercer
John S Mattick
author_sort Marcel E Dinger
title Differentiating protein-coding and noncoding RNA: challenges and ambiguities.
title_short Differentiating protein-coding and noncoding RNA: challenges and ambiguities.
title_full Differentiating protein-coding and noncoding RNA: challenges and ambiguities.
title_fullStr Differentiating protein-coding and noncoding RNA: challenges and ambiguities.
title_full_unstemmed Differentiating protein-coding and noncoding RNA: challenges and ambiguities.
title_sort differentiating protein-coding and noncoding rna: challenges and ambiguities.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2008-11-01
description The assumption that RNA can be readily classified into either protein-coding or non-protein-coding categories has pervaded biology for close to 50 years. Until recently, discrimination between these two categories was relatively straightforward: most transcripts were clearly identifiable as protein-coding messenger RNAs (mRNAs), and readily distinguished from the small number of well-characterized non-protein-coding RNAs (ncRNAs), such as transfer, ribosomal, and spliceosomal RNAs. Recent genome-wide studies have revealed the existence of thousands of noncoding transcripts, whose function and significance are unclear. The discovery of this hidden transcriptome and the implicit challenge it presents to our understanding of the expression and regulation of genetic information has made the need to distinguish between mRNAs and ncRNAs both more pressing and more complicated. In this Review, we consider the diverse strategies employed to discriminate between protein-coding and noncoding transcripts and the fundamental difficulties that are inherent in what may superficially appear to be a simple problem. Misannotations can also run in both directions: some ncRNAs may actually encode peptides, and some of those currently thought to do so may not. Moreover, recent studies have shown that some RNAs can function both as mRNAs and intrinsically as functional ncRNAs, which may be a relatively widespread phenomenon. We conclude that it is difficult to annotate an RNA unequivocally as protein-coding or noncoding, with overlapping protein-coding and noncoding transcripts further confounding this distinction. In addition, the finding that some transcripts can function both intrinsically at the RNA level and to encode proteins suggests a false dichotomy between mRNAs and ncRNAs. Therefore, the functionality of any transcript at the RNA level should not be discounted.
url https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/19043537/?tool=EBI
work_keys_str_mv AT marceledinger differentiatingproteincodingandnoncodingrnachallengesandambiguities
AT kencpang differentiatingproteincodingandnoncodingrnachallengesandambiguities
AT timrmercer differentiatingproteincodingandnoncodingrnachallengesandambiguities
AT johnsmattick differentiatingproteincodingandnoncodingrnachallengesandambiguities
_version_ 1714667415000842240