The common origin of symmetry and structure in genetic sequences

Abstract Biologists have long sought a way to explain how statistical properties of genetic sequences emerged and are maintained through evolution. On the one hand, non-random structures at different scales indicate a complex genome organisation. On the other hand, single-strand symmetry has been sc...

Full description

Bibliographic Details
Main Authors: Giampaolo Cristadoro, Mirko Degli Esposti, Eduardo G. Altmann
Format: Article
Language:English
Published: Nature Publishing Group 2018-10-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-018-34136-w
Description
Summary:Abstract Biologists have long sought a way to explain how statistical properties of genetic sequences emerged and are maintained through evolution. On the one hand, non-random structures at different scales indicate a complex genome organisation. On the other hand, single-strand symmetry has been scrutinised using neutral models in which correlations are not considered or irrelevant, contrary to empirical evidence. Different studies investigated these two statistical features separately, reaching minimal consensus despite sustained efforts. Here we unravel previously unknown symmetries in genetic sequences, which are organized hierarchically through scales in which non-random structures are known to be present. These observations are confirmed through the statistical analysis of the human genome and explained through a simple domain model. These results suggest that domain models which account for the cumulative action of mobile elements can explain simultaneously non-random structures and symmetries in genetic sequences.
ISSN:2045-2322