Bioinformatics prediction of overlapping frameshifted translation products in mammalian transcripts

<p>Abstract</p> <p>Background</p> <p>Exceptionally, a single nucleotide sequence can be translated <it>in vivo </it>in two different frames to yield distinct proteins. In the case of the G-protein alpha subunit XL-alpha-s transcript, a frameshifted open read...

Full description

Bibliographic Details
Main Authors: Seuwen Klaus, Baumgarten Birgit, Brüngger Adrian, Ribrioux Sebastien, John Markus R
Format: Article
Language:English
Published: BMC 2008-03-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/9/122
Description
Summary:<p>Abstract</p> <p>Background</p> <p>Exceptionally, a single nucleotide sequence can be translated <it>in vivo </it>in two different frames to yield distinct proteins. In the case of the G-protein alpha subunit XL-alpha-s transcript, a frameshifted open reading frame (ORF) in exon 1 is translated to yield a structurally distinct protein called Alex, which plays a role in platelet aggregation and neurological processes. We carried out a novel bioinformatics screen for other possible dual-frame translated sequences, based on comparative genomics.</p> <p>Results</p> <p>Our method searched human, mouse and rat transcripts in frames +1 and -1 for ORFs which are unusually well conserved at the amino acid level. We name these conserved frameshifted overlapping ORFs 'matreshkas' to reflect their nested character. Select findings of our analysis revealed that the G-protein coupled receptor GPR27 is entirely contained within a frame -1 matreshka, thrombopoietin contains a matreshka which spans ~70% of its length, platelet glycoprotein IIIa (ITGB3) contains a matreshka with the predicted characteristics of a secreted peptide hormone, while the potassium channel KCNK12 contains a matreshka spanning >400 amino acids.</p> <p>Conclusion</p> <p>Although the <it>in vivo </it>existence of translated matreshkas has not been experimentally verified, this genome-wide analysis provides strong evidence that substantial overlapping coding sequences exist in a number of human and rodent transcripts.</p>
ISSN:1471-2164