Minimal absent words in prokaryotic and eukaryotic genomes.

Minimal absent words have been computed in genomes of organisms from all domains of life. Here, we explore different sets of minimal absent words in the genomes of 22 organisms (one archaeota, thirteen bacteria and eight eukaryotes). We investigate if the mutational biases that may explain the defic...

Full description

Bibliographic Details
Published in:PLoS ONE
Main Authors: Sara P Garcia, Armando J Pinho, João M O S Rodrigues, Carlos A C Bastos, Paulo J S G Ferreira
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2011-01-01
Online Access:http://europepmc.org/articles/PMC3031530?pdf=render
Description
Summary:Minimal absent words have been computed in genomes of organisms from all domains of life. Here, we explore different sets of minimal absent words in the genomes of 22 organisms (one archaeota, thirteen bacteria and eight eukaryotes). We investigate if the mutational biases that may explain the deficit of the shortest absent words in vertebrates are also pervasive in other absent words, namely in minimal absent words, as well as to other organisms. We find that the compositional biases observed for the shortest absent words in vertebrates are not uniform throughout different sets of minimal absent words. We further investigate the hypothesis of the inheritance of minimal absent words through common ancestry from the similarity in dinucleotide relative abundances of different sets of minimal absent words, and find that this inheritance may be exclusive to vertebrates.
ISSN:1932-6203