Prediction of Disulfide Connectivity Patterns in Proteins

碩士 === 國立臺灣大學 === 資訊工程學研究所 === 91 === In this study, we proposed a novel approach to effectively predict the protein disulfide connectivity pattern directly from its amino acid sequence. In protein structure prediction, the conformation space is extremely large. Constraints such as second...

Full description

Bibliographic Details
Main Authors: Shih-Chieh Chen, 陳士傑
Other Authors: Cheng-Yan Kao
Format: Others
Language:en_US
Published: 2003
Online Access:http://ndltd.ncl.edu.tw/handle/78854969599296942141
id ndltd-TW-091NTU00392060
record_format oai_dc
spelling ndltd-TW-091NTU003920602016-06-20T04:15:31Z http://ndltd.ncl.edu.tw/handle/78854969599296942141 Prediction of Disulfide Connectivity Patterns in Proteins 蛋白質雙硫鍵連接樣式預測 Shih-Chieh Chen 陳士傑 碩士 國立臺灣大學 資訊工程學研究所 91 In this study, we proposed a novel approach to effectively predict the protein disulfide connectivity pattern directly from its amino acid sequence. In protein structure prediction, the conformation space is extremely large. Constraints such as secondary structure information and solvent accessibility of residues were applied to reduce the search space and the prediction accuracy could thus be improved. Disulfide bonds, the covalent linkages between two cysteines, are commonly found in extracellular proteins. The correct prediction of disulfide connectivity can strongly reduce the conformation space and may also be useful in predicting protein tertiary structure. Two steps were combined in our approach: 1) Trained a model to predict the bond potential for all pairs of cysteines from the training set; 2) For a given protein, the predicted bond potential was adopted to find the most possible disulfide connectivity pattern. In step 1 each pair of cysteines in the training set, whether formed disulfide bond or not, were fed into the Support Vector Machine to train the bond potential predictor. In step 2, for a target sequence, a weighted complete graph was constructed in which cysteines and the corresponding bond potentials were represented by vertices and the weights of edges, respectively. The Edmonds’ algorithm was applied to find the perfect matching with the maximal weight. According to the matching, a disulfide connectivity pattern was successfully obtained. A four-fold cross-validation procedure on a data set containing 452 proteins was performed in this study to validate the proposed approach. As a result, the proposed approach has an overall accuracy of 44.53%, which is better than that of previous works. In summary, the proposed method is promising to locate the disulfide bridges in proteins Cheng-Yan Kao 高成炎 2003 學位論文 ; thesis 43 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 資訊工程學研究所 === 91 === In this study, we proposed a novel approach to effectively predict the protein disulfide connectivity pattern directly from its amino acid sequence. In protein structure prediction, the conformation space is extremely large. Constraints such as secondary structure information and solvent accessibility of residues were applied to reduce the search space and the prediction accuracy could thus be improved. Disulfide bonds, the covalent linkages between two cysteines, are commonly found in extracellular proteins. The correct prediction of disulfide connectivity can strongly reduce the conformation space and may also be useful in predicting protein tertiary structure. Two steps were combined in our approach: 1) Trained a model to predict the bond potential for all pairs of cysteines from the training set; 2) For a given protein, the predicted bond potential was adopted to find the most possible disulfide connectivity pattern. In step 1 each pair of cysteines in the training set, whether formed disulfide bond or not, were fed into the Support Vector Machine to train the bond potential predictor. In step 2, for a target sequence, a weighted complete graph was constructed in which cysteines and the corresponding bond potentials were represented by vertices and the weights of edges, respectively. The Edmonds’ algorithm was applied to find the perfect matching with the maximal weight. According to the matching, a disulfide connectivity pattern was successfully obtained. A four-fold cross-validation procedure on a data set containing 452 proteins was performed in this study to validate the proposed approach. As a result, the proposed approach has an overall accuracy of 44.53%, which is better than that of previous works. In summary, the proposed method is promising to locate the disulfide bridges in proteins
author2 Cheng-Yan Kao
author_facet Cheng-Yan Kao
Shih-Chieh Chen
陳士傑
author Shih-Chieh Chen
陳士傑
spellingShingle Shih-Chieh Chen
陳士傑
Prediction of Disulfide Connectivity Patterns in Proteins
author_sort Shih-Chieh Chen
title Prediction of Disulfide Connectivity Patterns in Proteins
title_short Prediction of Disulfide Connectivity Patterns in Proteins
title_full Prediction of Disulfide Connectivity Patterns in Proteins
title_fullStr Prediction of Disulfide Connectivity Patterns in Proteins
title_full_unstemmed Prediction of Disulfide Connectivity Patterns in Proteins
title_sort prediction of disulfide connectivity patterns in proteins
publishDate 2003
url http://ndltd.ncl.edu.tw/handle/78854969599296942141
work_keys_str_mv AT shihchiehchen predictionofdisulfideconnectivitypatternsinproteins
AT chénshìjié predictionofdisulfideconnectivitypatternsinproteins
AT shihchiehchen dànbáizhìshuāngliújiànliánjiēyàngshìyùcè
AT chénshìjié dànbáizhìshuāngliújiànliánjiēyàngshìyùcè
_version_ 1718310306157953024