Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure
<p>Abstract</p> <p>Background</p> <p>Amino acid sequence probability distributions, or profiles, have been used successfully to predict secondary structure and local structure in proteins. Profile models assume the statistical independence of each position in the sequen...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2008-10-01
|
Series: | BMC Bioinformatics |
Online Access: | http://www.biomedcentral.com/1471-2105/9/429 |
id |
doaj-4aecb1f6ca634b7cb26bea649235eb6f |
---|---|
record_format |
Article |
spelling |
doaj-4aecb1f6ca634b7cb26bea649235eb6f2020-11-25T00:46:36ZengBMCBMC Bioinformatics1471-21052008-10-019142910.1186/1471-2105-9-429Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structureWebb-Robertson Bobbie-JoBystroff Christopher<p>Abstract</p> <p>Background</p> <p>Amino acid sequence probability distributions, or profiles, have been used successfully to predict secondary structure and local structure in proteins. Profile models assume the statistical independence of each position in the sequence, but the energetics of protein folding is better captured in a scoring function that is based on pairwise interactions, like a force field.</p> <p>Results</p> <p>I-sites motifs are short sequence/structure motifs that populate the protein structure database due to energy-driven convergent evolution. Here we show that a pairwise covariant sequence model does not predict alpha helix or beta strand significantly better overall than a profile-based model, but it does improve the prediction of certain loop motifs. The finding is best explained by considering secondary structure profiles as multivariant, all-or-none models, which subsume covariant models. Pairwise covariance is nonetheless present and energetically rational. Examples of negative design are present, where the covariances disfavor non-native structures.</p> <p>Conclusion</p> <p>Measured pairwise covariances are shown to be statistically robust in cross-validation tests, as long as the amino acid alphabet is reduced to nine classes. An updated I-sites local structure motif library that provides sequence covariance information for all types of local structure in globular proteins and a web server for local structure prediction are available at <url>http://www.bioinfo.rpi.edu/bystrc/hmmstr/server.php</url>.</p> http://www.biomedcentral.com/1471-2105/9/429 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Webb-Robertson Bobbie-Jo Bystroff Christopher |
spellingShingle |
Webb-Robertson Bobbie-Jo Bystroff Christopher Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure BMC Bioinformatics |
author_facet |
Webb-Robertson Bobbie-Jo Bystroff Christopher |
author_sort |
Webb-Robertson Bobbie-Jo |
title |
Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure |
title_short |
Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure |
title_full |
Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure |
title_fullStr |
Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure |
title_full_unstemmed |
Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure |
title_sort |
pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2008-10-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Amino acid sequence probability distributions, or profiles, have been used successfully to predict secondary structure and local structure in proteins. Profile models assume the statistical independence of each position in the sequence, but the energetics of protein folding is better captured in a scoring function that is based on pairwise interactions, like a force field.</p> <p>Results</p> <p>I-sites motifs are short sequence/structure motifs that populate the protein structure database due to energy-driven convergent evolution. Here we show that a pairwise covariant sequence model does not predict alpha helix or beta strand significantly better overall than a profile-based model, but it does improve the prediction of certain loop motifs. The finding is best explained by considering secondary structure profiles as multivariant, all-or-none models, which subsume covariant models. Pairwise covariance is nonetheless present and energetically rational. Examples of negative design are present, where the covariances disfavor non-native structures.</p> <p>Conclusion</p> <p>Measured pairwise covariances are shown to be statistically robust in cross-validation tests, as long as the amino acid alphabet is reduced to nine classes. An updated I-sites local structure motif library that provides sequence covariance information for all types of local structure in globular proteins and a web server for local structure prediction are available at <url>http://www.bioinfo.rpi.edu/bystrc/hmmstr/server.php</url>.</p> |
url |
http://www.biomedcentral.com/1471-2105/9/429 |
work_keys_str_mv |
AT webbrobertsonbobbiejo pairwisecovarianceaddslittletosecondarystructurepredictionbutimprovesthepredictionofnoncanonicallocalstructure AT bystroffchristopher pairwisecovarianceaddslittletosecondarystructurepredictionbutimprovesthepredictionofnoncanonicallocalstructure |
_version_ |
1725264249961316352 |