Hyperdimensional analysis of amino acid pair distributions in proteins.

Our manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains...

Full description

Bibliographic Details
Main Authors: Svend B Henriksen, Rasmus J Mortensen, Henrik M Geertz-Hansen, Maria Teresa Neves-Petersen, Omar Arnason, Jón Söring, Steffen B Petersen
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2011-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3235099?pdf=render
id doaj-c82eeee9b8ee49eb940d6f2bc819a4a5
record_format Article
spelling doaj-c82eeee9b8ee49eb940d6f2bc819a4a52020-11-25T01:32:36ZengPublic Library of Science (PLoS)PLoS ONE1932-62032011-01-01612e2563810.1371/journal.pone.0025638Hyperdimensional analysis of amino acid pair distributions in proteins.Svend B HenriksenRasmus J MortensenHenrik M Geertz-HansenMaria Teresa Neves-PetersenOmar ArnasonJón SöringSteffen B PetersenOur manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains information about amino acid type, solvent accessibility, spatial and sequence distance, secondary structure and sequence length. We are able to pose structural queries to the data cube using program ProPack. The response is a 1, 2 or 3D graph. Whereas the response is of a statistical nature, the user can obtain an instant list of all PDB-structures where such pair is found. The user may select a particular structure, which is displayed highlighting the pair in question. The user may pose millions of different queries and for each one he will receive the answer in a few seconds. In order to demonstrate the capabilities of the data cube as well as the programs, we have selected well known structural features, disulphide bridges and salt bridges, where we illustrate how the queries are posed, and how answers are given. Motifs involving cysteines such as disulphide bridges, zinc-fingers and iron-sulfur clusters are clearly identified and differentiated. ProPack also reveals that whereas pairs of Lys residues virtually never appear in close spatial proximity, pairs of Arg are abundant and appear at close spatial distance, contrasting the belief that electrostatic repulsion would prevent this juxtaposition and that Arg-Lys is perceived as a conservative mutation. The presented programs can find and visualize novel packing preferences in proteins structures allowing the user to unravel correlations between pairs of amino acids. The new tools allow the user to view statistical information and visualize instantly the structures that underpin the statistical information, which is far from trivial with most other SW tools for protein structure analysis.http://europepmc.org/articles/PMC3235099?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Svend B Henriksen
Rasmus J Mortensen
Henrik M Geertz-Hansen
Maria Teresa Neves-Petersen
Omar Arnason
Jón Söring
Steffen B Petersen
spellingShingle Svend B Henriksen
Rasmus J Mortensen
Henrik M Geertz-Hansen
Maria Teresa Neves-Petersen
Omar Arnason
Jón Söring
Steffen B Petersen
Hyperdimensional analysis of amino acid pair distributions in proteins.
PLoS ONE
author_facet Svend B Henriksen
Rasmus J Mortensen
Henrik M Geertz-Hansen
Maria Teresa Neves-Petersen
Omar Arnason
Jón Söring
Steffen B Petersen
author_sort Svend B Henriksen
title Hyperdimensional analysis of amino acid pair distributions in proteins.
title_short Hyperdimensional analysis of amino acid pair distributions in proteins.
title_full Hyperdimensional analysis of amino acid pair distributions in proteins.
title_fullStr Hyperdimensional analysis of amino acid pair distributions in proteins.
title_full_unstemmed Hyperdimensional analysis of amino acid pair distributions in proteins.
title_sort hyperdimensional analysis of amino acid pair distributions in proteins.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2011-01-01
description Our manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains information about amino acid type, solvent accessibility, spatial and sequence distance, secondary structure and sequence length. We are able to pose structural queries to the data cube using program ProPack. The response is a 1, 2 or 3D graph. Whereas the response is of a statistical nature, the user can obtain an instant list of all PDB-structures where such pair is found. The user may select a particular structure, which is displayed highlighting the pair in question. The user may pose millions of different queries and for each one he will receive the answer in a few seconds. In order to demonstrate the capabilities of the data cube as well as the programs, we have selected well known structural features, disulphide bridges and salt bridges, where we illustrate how the queries are posed, and how answers are given. Motifs involving cysteines such as disulphide bridges, zinc-fingers and iron-sulfur clusters are clearly identified and differentiated. ProPack also reveals that whereas pairs of Lys residues virtually never appear in close spatial proximity, pairs of Arg are abundant and appear at close spatial distance, contrasting the belief that electrostatic repulsion would prevent this juxtaposition and that Arg-Lys is perceived as a conservative mutation. The presented programs can find and visualize novel packing preferences in proteins structures allowing the user to unravel correlations between pairs of amino acids. The new tools allow the user to view statistical information and visualize instantly the structures that underpin the statistical information, which is far from trivial with most other SW tools for protein structure analysis.
url http://europepmc.org/articles/PMC3235099?pdf=render
work_keys_str_mv AT svendbhenriksen hyperdimensionalanalysisofaminoacidpairdistributionsinproteins
AT rasmusjmortensen hyperdimensionalanalysisofaminoacidpairdistributionsinproteins
AT henrikmgeertzhansen hyperdimensionalanalysisofaminoacidpairdistributionsinproteins
AT mariateresanevespetersen hyperdimensionalanalysisofaminoacidpairdistributionsinproteins
AT omararnason hyperdimensionalanalysisofaminoacidpairdistributionsinproteins
AT jonsoring hyperdimensionalanalysisofaminoacidpairdistributionsinproteins
AT steffenbpetersen hyperdimensionalanalysisofaminoacidpairdistributionsinproteins
_version_ 1725081056016596992