The action of key factors in protein evolution at high temporal resolution.

BACKGROUND: Protein evolution is particularly shaped by the conservation of the amino acids' physico-chemical properties and the structure of the genetic code. While conservation is the result of negative selection against proteins with reduced functionality, the codon sequences determine the s...

Full description

Bibliographic Details
Main Authors: Armin Schmitt, Johannes Schuchhardt, Gudrun A Brockmann
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2009-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC2652826?pdf=render
id doaj-3f22f88493c24ee9b53bf3cfd60d86c5
record_format Article
spelling doaj-3f22f88493c24ee9b53bf3cfd60d86c52020-11-25T01:48:08ZengPublic Library of Science (PLoS)PLoS ONE1932-62032009-01-0143e482110.1371/journal.pone.0004821The action of key factors in protein evolution at high temporal resolution.Armin SchmittJohannes SchuchhardtGudrun A BrockmannBACKGROUND: Protein evolution is particularly shaped by the conservation of the amino acids' physico-chemical properties and the structure of the genetic code. While conservation is the result of negative selection against proteins with reduced functionality, the codon sequences determine the stochastic aspect of amino acid exchanges. Thus far, it is known that the genetic code is the dominant factor if little time has elapsed since the divergence of one gene into two, but physico-chemical forces gain importance at greater evolutionary distances. Further details, however, on how the influence of these factors varies with time are unknown to date. METHODOLOGY/PRINCIPAL FINDINGS: Here, we derive each 10,000 divergence specific substitution matrices for orthologues and paralogues from the Pfam collection of multiple protein alignments and quantify the action of three physico-chemical forces and of the structure of the genetic code at high resolution using correlation analysis. For closely related proteins, the codon sequence similarity is the most influential factor controlling protein evolution, but its influence decreases rapidly as divergence grows. From a protein sequence divergence of about 20 percent on the maintenance of the hydrophobic character of an amino acid is the most influential factor. All factors lose importance from about 40 percent divergence on. This suggests that the original protein structure often does no longer represent a constraint to the protein sequence. The proteins then become free to adopt new functions. We furthermore show that the constraints exerted by both physico-chemical forces and by the genetic code are quite comparable for orthologues and paralogues, however somewhat weaker for paralogues than for orthologues in weakly or moderately diverged proteins. CONCLUSION/SIGNIFICANCE: Our analysis substantiates earlier findings that protein evolution is mainly governed by the structure of the genetic code in the early phase after divergence and by the conservation of physico-chemical properties at the later phase. We determine the level of sequence divergence from which on the conservation of the hydrophobic character is gaining importance over the genetic code to be 20 percent. The evolution of orthologues and paralogues is shaped by evolutionary forces in quite comparable ways.http://europepmc.org/articles/PMC2652826?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Armin Schmitt
Johannes Schuchhardt
Gudrun A Brockmann
spellingShingle Armin Schmitt
Johannes Schuchhardt
Gudrun A Brockmann
The action of key factors in protein evolution at high temporal resolution.
PLoS ONE
author_facet Armin Schmitt
Johannes Schuchhardt
Gudrun A Brockmann
author_sort Armin Schmitt
title The action of key factors in protein evolution at high temporal resolution.
title_short The action of key factors in protein evolution at high temporal resolution.
title_full The action of key factors in protein evolution at high temporal resolution.
title_fullStr The action of key factors in protein evolution at high temporal resolution.
title_full_unstemmed The action of key factors in protein evolution at high temporal resolution.
title_sort action of key factors in protein evolution at high temporal resolution.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2009-01-01
description BACKGROUND: Protein evolution is particularly shaped by the conservation of the amino acids' physico-chemical properties and the structure of the genetic code. While conservation is the result of negative selection against proteins with reduced functionality, the codon sequences determine the stochastic aspect of amino acid exchanges. Thus far, it is known that the genetic code is the dominant factor if little time has elapsed since the divergence of one gene into two, but physico-chemical forces gain importance at greater evolutionary distances. Further details, however, on how the influence of these factors varies with time are unknown to date. METHODOLOGY/PRINCIPAL FINDINGS: Here, we derive each 10,000 divergence specific substitution matrices for orthologues and paralogues from the Pfam collection of multiple protein alignments and quantify the action of three physico-chemical forces and of the structure of the genetic code at high resolution using correlation analysis. For closely related proteins, the codon sequence similarity is the most influential factor controlling protein evolution, but its influence decreases rapidly as divergence grows. From a protein sequence divergence of about 20 percent on the maintenance of the hydrophobic character of an amino acid is the most influential factor. All factors lose importance from about 40 percent divergence on. This suggests that the original protein structure often does no longer represent a constraint to the protein sequence. The proteins then become free to adopt new functions. We furthermore show that the constraints exerted by both physico-chemical forces and by the genetic code are quite comparable for orthologues and paralogues, however somewhat weaker for paralogues than for orthologues in weakly or moderately diverged proteins. CONCLUSION/SIGNIFICANCE: Our analysis substantiates earlier findings that protein evolution is mainly governed by the structure of the genetic code in the early phase after divergence and by the conservation of physico-chemical properties at the later phase. We determine the level of sequence divergence from which on the conservation of the hydrophobic character is gaining importance over the genetic code to be 20 percent. The evolution of orthologues and paralogues is shaped by evolutionary forces in quite comparable ways.
url http://europepmc.org/articles/PMC2652826?pdf=render
work_keys_str_mv AT arminschmitt theactionofkeyfactorsinproteinevolutionathightemporalresolution
AT johannesschuchhardt theactionofkeyfactorsinproteinevolutionathightemporalresolution
AT gudrunabrockmann theactionofkeyfactorsinproteinevolutionathightemporalresolution
AT arminschmitt actionofkeyfactorsinproteinevolutionathightemporalresolution
AT johannesschuchhardt actionofkeyfactorsinproteinevolutionathightemporalresolution
AT gudrunabrockmann actionofkeyfactorsinproteinevolutionathightemporalresolution
_version_ 1725012724288585728