The relationship between protein sequences and their gene ontology functions
<p>Abstract</p> <p>Background</p> <p>One main research challenge in the post-genomic era is to understand the relationship between protein sequences and their biological functions. In recent years, several automated annotation systems have been developed for the functio...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2006-12-01
|
Series: | BMC Bioinformatics |
id |
doaj-937bcc0d9c554de99c01e21cc2ddee77 |
---|---|
record_format |
Article |
spelling |
doaj-937bcc0d9c554de99c01e21cc2ddee772020-11-24T23:33:00ZengBMCBMC Bioinformatics1471-21052006-12-017Suppl 4S1110.1186/1471-2105-7-S4-S11The relationship between protein sequences and their gene ontology functionsPerez Dianne MReichel LotharHughes BrentDuan Zhong-HuiShi Ting<p>Abstract</p> <p>Background</p> <p>One main research challenge in the post-genomic era is to understand the relationship between protein sequences and their biological functions. In recent years, several automated annotation systems have been developed for the functional assignment of uncharacterized proteins. The underlying assumption of these systems is that similar sequences imply similar biological functions. However, it has been noted that matching sequences do not always infer similar functions.</p> <p>Results</p> <p>In this paper, we present the correlation between protein sequences and protein functions for the yeast proteome in the context of gene ontology. A novel measure is introduced to define the overall similarity between two protein sequences. The effects of the level as well as the size of a gene ontology group on the degree of similarity were studied. The similarity distributions at different levels of gene ontology trees are presented. To evaluate the theoretical prediction power of similar sequences, we computed the posterior probability of correct predictions.</p> <p>Conclusion</p> <p>The results indicate that protein pairs of similar biological functions tend to have higher sequence similarity, although the similarity distribution in each functional group is heterogeneous and varies from group to group. We conclude that sequence similarity can serve as a key measure in protein function prediction. However, the resulting annotations must be verified through other means. A method that combines a broader range of measures is more likely to provide more accurate prediction. Our study indicates that the posterior probability of a correct prediction could serve as one of the key measures.</p> |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Perez Dianne M Reichel Lothar Hughes Brent Duan Zhong-Hui Shi Ting |
spellingShingle |
Perez Dianne M Reichel Lothar Hughes Brent Duan Zhong-Hui Shi Ting The relationship between protein sequences and their gene ontology functions BMC Bioinformatics |
author_facet |
Perez Dianne M Reichel Lothar Hughes Brent Duan Zhong-Hui Shi Ting |
author_sort |
Perez Dianne M |
title |
The relationship between protein sequences and their gene ontology functions |
title_short |
The relationship between protein sequences and their gene ontology functions |
title_full |
The relationship between protein sequences and their gene ontology functions |
title_fullStr |
The relationship between protein sequences and their gene ontology functions |
title_full_unstemmed |
The relationship between protein sequences and their gene ontology functions |
title_sort |
relationship between protein sequences and their gene ontology functions |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2006-12-01 |
description |
<p>Abstract</p> <p>Background</p> <p>One main research challenge in the post-genomic era is to understand the relationship between protein sequences and their biological functions. In recent years, several automated annotation systems have been developed for the functional assignment of uncharacterized proteins. The underlying assumption of these systems is that similar sequences imply similar biological functions. However, it has been noted that matching sequences do not always infer similar functions.</p> <p>Results</p> <p>In this paper, we present the correlation between protein sequences and protein functions for the yeast proteome in the context of gene ontology. A novel measure is introduced to define the overall similarity between two protein sequences. The effects of the level as well as the size of a gene ontology group on the degree of similarity were studied. The similarity distributions at different levels of gene ontology trees are presented. To evaluate the theoretical prediction power of similar sequences, we computed the posterior probability of correct predictions.</p> <p>Conclusion</p> <p>The results indicate that protein pairs of similar biological functions tend to have higher sequence similarity, although the similarity distribution in each functional group is heterogeneous and varies from group to group. We conclude that sequence similarity can serve as a key measure in protein function prediction. However, the resulting annotations must be verified through other means. A method that combines a broader range of measures is more likely to provide more accurate prediction. Our study indicates that the posterior probability of a correct prediction could serve as one of the key measures.</p> |
work_keys_str_mv |
AT perezdiannem therelationshipbetweenproteinsequencesandtheirgeneontologyfunctions AT reichellothar therelationshipbetweenproteinsequencesandtheirgeneontologyfunctions AT hughesbrent therelationshipbetweenproteinsequencesandtheirgeneontologyfunctions AT duanzhonghui therelationshipbetweenproteinsequencesandtheirgeneontologyfunctions AT shiting therelationshipbetweenproteinsequencesandtheirgeneontologyfunctions AT perezdiannem relationshipbetweenproteinsequencesandtheirgeneontologyfunctions AT reichellothar relationshipbetweenproteinsequencesandtheirgeneontologyfunctions AT hughesbrent relationshipbetweenproteinsequencesandtheirgeneontologyfunctions AT duanzhonghui relationshipbetweenproteinsequencesandtheirgeneontologyfunctions AT shiting relationshipbetweenproteinsequencesandtheirgeneontologyfunctions |
_version_ |
1725532328334196736 |