Prediction of Disulfide Connectivity Patterns in Proteins

碩士 === 國立臺灣大學 === 資訊工程學研究所 === 91 === In this study, we proposed a novel approach to effectively predict the protein disulfide connectivity pattern directly from its amino acid sequence. In protein structure prediction, the conformation space is extremely large. Constraints such as second...

Full description

Bibliographic Details
Main Authors:	Shih-Chieh Chen, 陳士傑
Other Authors:	Cheng-Yan Kao
Format:	Others
Language:	en_US
Published:	2003
Online Access:	http://ndltd.ncl.edu.tw/handle/78854969599296942141

id	ndltd-TW-091NTU00392060
record_format	oai_dc
spelling	ndltd-TW-091NTU003920602016-06-20T04:15:31Z http://ndltd.ncl.edu.tw/handle/78854969599296942141 Prediction of Disulfide Connectivity Patterns in Proteins 蛋白質雙硫鍵連接樣式預測 Shih-Chieh Chen 陳士傑碩士國立臺灣大學資訊工程學研究所 91 In this study, we proposed a novel approach to effectively predict the protein disulfide connectivity pattern directly from its amino acid sequence. In protein structure prediction, the conformation space is extremely large. Constraints such as secondary structure information and solvent accessibility of residues were applied to reduce the search space and the prediction accuracy could thus be improved. Disulfide bonds, the covalent linkages between two cysteines, are commonly found in extracellular proteins. The correct prediction of disulfide connectivity can strongly reduce the conformation space and may also be useful in predicting protein tertiary structure. Two steps were combined in our approach: 1) Trained a model to predict the bond potential for all pairs of cysteines from the training set; 2) For a given protein, the predicted bond potential was adopted to find the most possible disulfide connectivity pattern. In step 1 each pair of cysteines in the training set, whether formed disulfide bond or not, were fed into the Support Vector Machine to train the bond potential predictor. In step 2, for a target sequence, a weighted complete graph was constructed in which cysteines and the corresponding bond potentials were represented by vertices and the weights of edges, respectively. The Edmonds’ algorithm was applied to find the perfect matching with the maximal weight. According to the matching, a disulfide connectivity pattern was successfully obtained. A four-fold cross-validation procedure on a data set containing 452 proteins was performed in this study to validate the proposed approach. As a result, the proposed approach has an overall accuracy of 44.53%, which is better than that of previous works. In summary, the proposed method is promising to locate the disulfide bridges in proteins Cheng-Yan Kao 高成炎 2003 學位論文 ; thesis 43 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立臺灣大學 === 資訊工程學研究所 === 91 === In this study, we proposed a novel approach to effectively predict the protein disulfide connectivity pattern directly from its amino acid sequence. In protein structure prediction, the conformation space is extremely large. Constraints such as secondary structure information and solvent accessibility of residues were applied to reduce the search space and the prediction accuracy could thus be improved. Disulfide bonds, the covalent linkages between two cysteines, are commonly found in extracellular proteins. The correct prediction of disulfide connectivity can strongly reduce the conformation space and may also be useful in predicting protein tertiary structure. Two steps were combined in our approach: 1) Trained a model to predict the bond potential for all pairs of cysteines from the training set; 2) For a given protein, the predicted bond potential was adopted to find the most possible disulfide connectivity pattern. In step 1 each pair of cysteines in the training set, whether formed disulfide bond or not, were fed into the Support Vector Machine to train the bond potential predictor. In step 2, for a target sequence, a weighted complete graph was constructed in which cysteines and the corresponding bond potentials were represented by vertices and the weights of edges, respectively. The Edmonds’ algorithm was applied to find the perfect matching with the maximal weight. According to the matching, a disulfide connectivity pattern was successfully obtained. A four-fold cross-validation procedure on a data set containing 452 proteins was performed in this study to validate the proposed approach. As a result, the proposed approach has an overall accuracy of 44.53%, which is better than that of previous works. In summary, the proposed method is promising to locate the disulfide bridges in proteins
author2	Cheng-Yan Kao
author_facet	Cheng-Yan Kao Shih-Chieh Chen 陳士傑
author	Shih-Chieh Chen 陳士傑
spellingShingle	Shih-Chieh Chen 陳士傑 Prediction of Disulfide Connectivity Patterns in Proteins
author_sort	Shih-Chieh Chen
title	Prediction of Disulfide Connectivity Patterns in Proteins
title_short	Prediction of Disulfide Connectivity Patterns in Proteins
title_full	Prediction of Disulfide Connectivity Patterns in Proteins
title_fullStr	Prediction of Disulfide Connectivity Patterns in Proteins
title_full_unstemmed	Prediction of Disulfide Connectivity Patterns in Proteins
title_sort	prediction of disulfide connectivity patterns in proteins
publishDate	2003
url	http://ndltd.ncl.edu.tw/handle/78854969599296942141
work_keys_str_mv	AT shihchiehchen predictionofdisulfideconnectivitypatternsinproteins AT chénshìjié predictionofdisulfideconnectivitypatternsinproteins AT shihchiehchen dànbáizhìshuāngliújiànliánjiēyàngshìyùcè AT chénshìjié dànbáizhìshuāngliújiànliánjiēyàngshìyùcè
_version_	1718310306157953024

Prediction of Disulfide Connectivity Patterns in Proteins

Similar Items