Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure

<p>Abstract</p> <p>Background</p> <p>Amino acid sequence probability distributions, or profiles, have been used successfully to predict secondary structure and local structure in proteins. Profile models assume the statistical independence of each position in the sequen...

Full description

Bibliographic Details
Main Authors: Webb-Robertson Bobbie-Jo, Bystroff Christopher
Format: Article
Language:English
Published: BMC 2008-10-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/9/429
id doaj-4aecb1f6ca634b7cb26bea649235eb6f
record_format Article
spelling doaj-4aecb1f6ca634b7cb26bea649235eb6f2020-11-25T00:46:36ZengBMCBMC Bioinformatics1471-21052008-10-019142910.1186/1471-2105-9-429Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structureWebb-Robertson Bobbie-JoBystroff Christopher<p>Abstract</p> <p>Background</p> <p>Amino acid sequence probability distributions, or profiles, have been used successfully to predict secondary structure and local structure in proteins. Profile models assume the statistical independence of each position in the sequence, but the energetics of protein folding is better captured in a scoring function that is based on pairwise interactions, like a force field.</p> <p>Results</p> <p>I-sites motifs are short sequence/structure motifs that populate the protein structure database due to energy-driven convergent evolution. Here we show that a pairwise covariant sequence model does not predict alpha helix or beta strand significantly better overall than a profile-based model, but it does improve the prediction of certain loop motifs. The finding is best explained by considering secondary structure profiles as multivariant, all-or-none models, which subsume covariant models. Pairwise covariance is nonetheless present and energetically rational. Examples of negative design are present, where the covariances disfavor non-native structures.</p> <p>Conclusion</p> <p>Measured pairwise covariances are shown to be statistically robust in cross-validation tests, as long as the amino acid alphabet is reduced to nine classes. An updated I-sites local structure motif library that provides sequence covariance information for all types of local structure in globular proteins and a web server for local structure prediction are available at <url>http://www.bioinfo.rpi.edu/bystrc/hmmstr/server.php</url>.</p> http://www.biomedcentral.com/1471-2105/9/429
collection DOAJ
language English
format Article
sources DOAJ
author Webb-Robertson Bobbie-Jo
Bystroff Christopher
spellingShingle Webb-Robertson Bobbie-Jo
Bystroff Christopher
Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure
BMC Bioinformatics
author_facet Webb-Robertson Bobbie-Jo
Bystroff Christopher
author_sort Webb-Robertson Bobbie-Jo
title Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure
title_short Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure
title_full Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure
title_fullStr Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure
title_full_unstemmed Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure
title_sort pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2008-10-01
description <p>Abstract</p> <p>Background</p> <p>Amino acid sequence probability distributions, or profiles, have been used successfully to predict secondary structure and local structure in proteins. Profile models assume the statistical independence of each position in the sequence, but the energetics of protein folding is better captured in a scoring function that is based on pairwise interactions, like a force field.</p> <p>Results</p> <p>I-sites motifs are short sequence/structure motifs that populate the protein structure database due to energy-driven convergent evolution. Here we show that a pairwise covariant sequence model does not predict alpha helix or beta strand significantly better overall than a profile-based model, but it does improve the prediction of certain loop motifs. The finding is best explained by considering secondary structure profiles as multivariant, all-or-none models, which subsume covariant models. Pairwise covariance is nonetheless present and energetically rational. Examples of negative design are present, where the covariances disfavor non-native structures.</p> <p>Conclusion</p> <p>Measured pairwise covariances are shown to be statistically robust in cross-validation tests, as long as the amino acid alphabet is reduced to nine classes. An updated I-sites local structure motif library that provides sequence covariance information for all types of local structure in globular proteins and a web server for local structure prediction are available at <url>http://www.bioinfo.rpi.edu/bystrc/hmmstr/server.php</url>.</p>
url http://www.biomedcentral.com/1471-2105/9/429
work_keys_str_mv AT webbrobertsonbobbiejo pairwisecovarianceaddslittletosecondarystructurepredictionbutimprovesthepredictionofnoncanonicallocalstructure
AT bystroffchristopher pairwisecovarianceaddslittletosecondarystructurepredictionbutimprovesthepredictionofnoncanonicallocalstructure
_version_ 1725264249961316352