Human protein cluster analysis using amino acid frequencies.

The paper focuses on the development of a software tool for protein clustering according to their amino acid content. All known human proteins were clustered according to the relative frequencies of their amino acids starting from the UniProtKB/Swiss-Prot reference database and making use of hierarc...

Full description

Bibliographic Details
Main Authors: Annamaria Vernone, Paola Berchialla, Gianpiero Pescarmona
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3617222?pdf=render
id doaj-6661b48dbb0448699dbbdd91c7fbc1a3
record_format Article
spelling doaj-6661b48dbb0448699dbbdd91c7fbc1a32020-11-24T21:50:56ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0184e6022010.1371/journal.pone.0060220Human protein cluster analysis using amino acid frequencies.Annamaria VernonePaola BerchiallaGianpiero PescarmonaThe paper focuses on the development of a software tool for protein clustering according to their amino acid content. All known human proteins were clustered according to the relative frequencies of their amino acids starting from the UniProtKB/Swiss-Prot reference database and making use of hierarchical cluster analysis. RESULTS were compared to those based on sequence similarities.Proteins display different clustering patterns according to type. Many extracellular proteins with highly specific and repetitive sequences (keratins, collagens etc.) cluster clearly confirming the accuracy of the clustering method. In our case clustering by sequence and amino acid content overlaps. Proteins with a more complex structure with multiple domains (catalytic, extracellular, transmembrane etc.), even if classified very similar according to sequence similarity and function (aquaporins, cadherins, steroid 5-alpha reductase etc.) showed different clustering according to amino acid content. Availability of essential amino acids according to local conditions (starvation, low or high oxygen, cell cycle phase etc.) may be a limiting factor in protein synthesis, whatever the mRNA level. This type of protein clustering may therefore prove a valuable tool in identifying so far unknown metabolic connections and constraints.http://europepmc.org/articles/PMC3617222?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Annamaria Vernone
Paola Berchialla
Gianpiero Pescarmona
spellingShingle Annamaria Vernone
Paola Berchialla
Gianpiero Pescarmona
Human protein cluster analysis using amino acid frequencies.
PLoS ONE
author_facet Annamaria Vernone
Paola Berchialla
Gianpiero Pescarmona
author_sort Annamaria Vernone
title Human protein cluster analysis using amino acid frequencies.
title_short Human protein cluster analysis using amino acid frequencies.
title_full Human protein cluster analysis using amino acid frequencies.
title_fullStr Human protein cluster analysis using amino acid frequencies.
title_full_unstemmed Human protein cluster analysis using amino acid frequencies.
title_sort human protein cluster analysis using amino acid frequencies.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2013-01-01
description The paper focuses on the development of a software tool for protein clustering according to their amino acid content. All known human proteins were clustered according to the relative frequencies of their amino acids starting from the UniProtKB/Swiss-Prot reference database and making use of hierarchical cluster analysis. RESULTS were compared to those based on sequence similarities.Proteins display different clustering patterns according to type. Many extracellular proteins with highly specific and repetitive sequences (keratins, collagens etc.) cluster clearly confirming the accuracy of the clustering method. In our case clustering by sequence and amino acid content overlaps. Proteins with a more complex structure with multiple domains (catalytic, extracellular, transmembrane etc.), even if classified very similar according to sequence similarity and function (aquaporins, cadherins, steroid 5-alpha reductase etc.) showed different clustering according to amino acid content. Availability of essential amino acids according to local conditions (starvation, low or high oxygen, cell cycle phase etc.) may be a limiting factor in protein synthesis, whatever the mRNA level. This type of protein clustering may therefore prove a valuable tool in identifying so far unknown metabolic connections and constraints.
url http://europepmc.org/articles/PMC3617222?pdf=render
work_keys_str_mv AT annamariavernone humanproteinclusteranalysisusingaminoacidfrequencies
AT paolaberchialla humanproteinclusteranalysisusingaminoacidfrequencies
AT gianpieropescarmona humanproteinclusteranalysisusingaminoacidfrequencies
_version_ 1725881509369348096