Where Does the Alignment Score Distribution Shape Come from?

Alignment algorithms are powerful tools for searching for homologous proteins in databases, providing a score for each sequence present in the database. It has been well known for 20 years that the shape of the score distribution looks like an extreme value distribution. The extremely large number o...

Full description

Bibliographic Details
Main Authors: Philippe Ortet, Olivier Bastien
Format: Article
Language:English
Published: SAGE Publishing 2010-01-01
Series:Evolutionary Bioinformatics
Online Access:https://doi.org/10.4137/EBO.S5875
id doaj-901a6e9be9db4520bf379a6833c0cef3
record_format Article
spelling doaj-901a6e9be9db4520bf379a6833c0cef32020-11-25T03:40:40ZengSAGE PublishingEvolutionary Bioinformatics1176-93432010-01-01610.4137/EBO.S5875Where Does the Alignment Score Distribution Shape Come from?Philippe Ortet0Olivier Bastien1CNRS (UMR 6191)-CEA Cadarache-Aix-Marseille Université, Laboratoire d'Ecologie Microbienne de la Rhizosphere, Institut de Biologie Environementale et Biotechnologie, CEA Cadarache, F-13108 Saint Paul-lez-Durance, France.UMR 5168 CNRS-CEA-INRA Université J. Fourier, Laboratoire de Physiologie Cellulaire Végétale, Département Réponse et Dynamique Cellulaire, CEA Grenoble, 17 rue des Martyrs, F-38054, Grenoble cedex 09, France.Alignment algorithms are powerful tools for searching for homologous proteins in databases, providing a score for each sequence present in the database. It has been well known for 20 years that the shape of the score distribution looks like an extreme value distribution. The extremely large number of times biologists face this class of distributions raises the question of the evolutionary origin of this probability law. We investigated the possibility of deriving the main properties of sequence alignment score distributions from a basic evolutionary process: a duplication-divergence protein evolution process in a sequence space. Firstly, the distribution of sequences in this space was defined with respect to the genetic distance between sequences. Secondly, we derived a basic relation between the genetic distance and the alignment score. We obtained a novel score probability distribution which is qualitatively very similar to that of Karlin-Altschul but performing better than all other previous model.https://doi.org/10.4137/EBO.S5875
collection DOAJ
language English
format Article
sources DOAJ
author Philippe Ortet
Olivier Bastien
spellingShingle Philippe Ortet
Olivier Bastien
Where Does the Alignment Score Distribution Shape Come from?
Evolutionary Bioinformatics
author_facet Philippe Ortet
Olivier Bastien
author_sort Philippe Ortet
title Where Does the Alignment Score Distribution Shape Come from?
title_short Where Does the Alignment Score Distribution Shape Come from?
title_full Where Does the Alignment Score Distribution Shape Come from?
title_fullStr Where Does the Alignment Score Distribution Shape Come from?
title_full_unstemmed Where Does the Alignment Score Distribution Shape Come from?
title_sort where does the alignment score distribution shape come from?
publisher SAGE Publishing
series Evolutionary Bioinformatics
issn 1176-9343
publishDate 2010-01-01
description Alignment algorithms are powerful tools for searching for homologous proteins in databases, providing a score for each sequence present in the database. It has been well known for 20 years that the shape of the score distribution looks like an extreme value distribution. The extremely large number of times biologists face this class of distributions raises the question of the evolutionary origin of this probability law. We investigated the possibility of deriving the main properties of sequence alignment score distributions from a basic evolutionary process: a duplication-divergence protein evolution process in a sequence space. Firstly, the distribution of sequences in this space was defined with respect to the genetic distance between sequences. Secondly, we derived a basic relation between the genetic distance and the alignment score. We obtained a novel score probability distribution which is qualitatively very similar to that of Karlin-Altschul but performing better than all other previous model.
url https://doi.org/10.4137/EBO.S5875
work_keys_str_mv AT philippeortet wheredoesthealignmentscoredistributionshapecomefrom
AT olivierbastien wheredoesthealignmentscoredistributionshapecomefrom
_version_ 1724533615287599104