Hyperdimensional analysis of amino acid pair distributions in proteins.
Our manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2011-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC3235099?pdf=render |
id |
doaj-c82eeee9b8ee49eb940d6f2bc819a4a5 |
---|---|
record_format |
Article |
spelling |
doaj-c82eeee9b8ee49eb940d6f2bc819a4a52020-11-25T01:32:36ZengPublic Library of Science (PLoS)PLoS ONE1932-62032011-01-01612e2563810.1371/journal.pone.0025638Hyperdimensional analysis of amino acid pair distributions in proteins.Svend B HenriksenRasmus J MortensenHenrik M Geertz-HansenMaria Teresa Neves-PetersenOmar ArnasonJón SöringSteffen B PetersenOur manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains information about amino acid type, solvent accessibility, spatial and sequence distance, secondary structure and sequence length. We are able to pose structural queries to the data cube using program ProPack. The response is a 1, 2 or 3D graph. Whereas the response is of a statistical nature, the user can obtain an instant list of all PDB-structures where such pair is found. The user may select a particular structure, which is displayed highlighting the pair in question. The user may pose millions of different queries and for each one he will receive the answer in a few seconds. In order to demonstrate the capabilities of the data cube as well as the programs, we have selected well known structural features, disulphide bridges and salt bridges, where we illustrate how the queries are posed, and how answers are given. Motifs involving cysteines such as disulphide bridges, zinc-fingers and iron-sulfur clusters are clearly identified and differentiated. ProPack also reveals that whereas pairs of Lys residues virtually never appear in close spatial proximity, pairs of Arg are abundant and appear at close spatial distance, contrasting the belief that electrostatic repulsion would prevent this juxtaposition and that Arg-Lys is perceived as a conservative mutation. The presented programs can find and visualize novel packing preferences in proteins structures allowing the user to unravel correlations between pairs of amino acids. The new tools allow the user to view statistical information and visualize instantly the structures that underpin the statistical information, which is far from trivial with most other SW tools for protein structure analysis.http://europepmc.org/articles/PMC3235099?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Svend B Henriksen Rasmus J Mortensen Henrik M Geertz-Hansen Maria Teresa Neves-Petersen Omar Arnason Jón Söring Steffen B Petersen |
spellingShingle |
Svend B Henriksen Rasmus J Mortensen Henrik M Geertz-Hansen Maria Teresa Neves-Petersen Omar Arnason Jón Söring Steffen B Petersen Hyperdimensional analysis of amino acid pair distributions in proteins. PLoS ONE |
author_facet |
Svend B Henriksen Rasmus J Mortensen Henrik M Geertz-Hansen Maria Teresa Neves-Petersen Omar Arnason Jón Söring Steffen B Petersen |
author_sort |
Svend B Henriksen |
title |
Hyperdimensional analysis of amino acid pair distributions in proteins. |
title_short |
Hyperdimensional analysis of amino acid pair distributions in proteins. |
title_full |
Hyperdimensional analysis of amino acid pair distributions in proteins. |
title_fullStr |
Hyperdimensional analysis of amino acid pair distributions in proteins. |
title_full_unstemmed |
Hyperdimensional analysis of amino acid pair distributions in proteins. |
title_sort |
hyperdimensional analysis of amino acid pair distributions in proteins. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2011-01-01 |
description |
Our manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains information about amino acid type, solvent accessibility, spatial and sequence distance, secondary structure and sequence length. We are able to pose structural queries to the data cube using program ProPack. The response is a 1, 2 or 3D graph. Whereas the response is of a statistical nature, the user can obtain an instant list of all PDB-structures where such pair is found. The user may select a particular structure, which is displayed highlighting the pair in question. The user may pose millions of different queries and for each one he will receive the answer in a few seconds. In order to demonstrate the capabilities of the data cube as well as the programs, we have selected well known structural features, disulphide bridges and salt bridges, where we illustrate how the queries are posed, and how answers are given. Motifs involving cysteines such as disulphide bridges, zinc-fingers and iron-sulfur clusters are clearly identified and differentiated. ProPack also reveals that whereas pairs of Lys residues virtually never appear in close spatial proximity, pairs of Arg are abundant and appear at close spatial distance, contrasting the belief that electrostatic repulsion would prevent this juxtaposition and that Arg-Lys is perceived as a conservative mutation. The presented programs can find and visualize novel packing preferences in proteins structures allowing the user to unravel correlations between pairs of amino acids. The new tools allow the user to view statistical information and visualize instantly the structures that underpin the statistical information, which is far from trivial with most other SW tools for protein structure analysis. |
url |
http://europepmc.org/articles/PMC3235099?pdf=render |
work_keys_str_mv |
AT svendbhenriksen hyperdimensionalanalysisofaminoacidpairdistributionsinproteins AT rasmusjmortensen hyperdimensionalanalysisofaminoacidpairdistributionsinproteins AT henrikmgeertzhansen hyperdimensionalanalysisofaminoacidpairdistributionsinproteins AT mariateresanevespetersen hyperdimensionalanalysisofaminoacidpairdistributionsinproteins AT omararnason hyperdimensionalanalysisofaminoacidpairdistributionsinproteins AT jonsoring hyperdimensionalanalysisofaminoacidpairdistributionsinproteins AT steffenbpetersen hyperdimensionalanalysisofaminoacidpairdistributionsinproteins |
_version_ |
1725081056016596992 |