Protein-Protein Recognition Classification by Genetic Programming

碩士 === 國立嘉義大學 === 資訊工程學系研究所 === 98 === With the development of bioinformatics, discovering amino acid patterns on protein binding sites has recently become a popular issue. A protein interacts with another protein through the binding sites, which contain a lot of information about physicochemical pr...

Full description

Bibliographic Details
Main Authors: Kuan-Yu Su, 蘇冠宇
Other Authors: Huang-Cheng Kuo
Format: Others
Language:en_US
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/46469651675379281342
id ndltd-TW-098NCYU5392016
record_format oai_dc
spelling ndltd-TW-098NCYU53920162015-10-13T18:35:12Z http://ndltd.ncl.edu.tw/handle/46469651675379281342 Protein-Protein Recognition Classification by Genetic Programming 以基因規劃法預測蛋白質辨識分類 Kuan-Yu Su 蘇冠宇 碩士 國立嘉義大學 資訊工程學系研究所 98 With the development of bioinformatics, discovering amino acid patterns on protein binding sites has recently become a popular issue. A protein interacts with another protein through the binding sites, which contain a lot of information about physicochemical properties. And most of properties are obtained from the composition of amino acids. Hence, the compositions of amino acids or characteristics of lead proteins interacting with each other, is what we are curious about. Protein-protein interaction represents the relationship of proteins. The interaction network can reflect which proteins belong to what kind of functions and roles. Among the interactions, there is an interaction case is if a protein interacts with another protein transiently and they will separate, the interaction is called protein-protein recognition or a transient protein complex. Genetic programming is a prominent technique of evolutionary computation. It mimics the evolution mechanism of biological environment to determine optimal solutions. Classification problems play an important role in the development of knowledge engineering. Thus, many machine learning algorithms have arisen to solve such problems. In this thesis, we focus on a proposed genetic programming method for feature selection and feature construction and combine SVM and Neural Network to solve classification problem on protein-protein recognition. Experimental results show that the proposed methods are accurate and effective. The experiment shows an acceptable prediction of recognition proteins with an average accuracy of 80% with ten-fold cross validation. We used that the constructed features can significantly improve the prediction accuracy in SVM and Neural Network. This satisfies the biologist’s efforts by saving time and money. Huang-Cheng Kuo 郭煌政 2010 學位論文 ; thesis 67 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立嘉義大學 === 資訊工程學系研究所 === 98 === With the development of bioinformatics, discovering amino acid patterns on protein binding sites has recently become a popular issue. A protein interacts with another protein through the binding sites, which contain a lot of information about physicochemical properties. And most of properties are obtained from the composition of amino acids. Hence, the compositions of amino acids or characteristics of lead proteins interacting with each other, is what we are curious about. Protein-protein interaction represents the relationship of proteins. The interaction network can reflect which proteins belong to what kind of functions and roles. Among the interactions, there is an interaction case is if a protein interacts with another protein transiently and they will separate, the interaction is called protein-protein recognition or a transient protein complex. Genetic programming is a prominent technique of evolutionary computation. It mimics the evolution mechanism of biological environment to determine optimal solutions. Classification problems play an important role in the development of knowledge engineering. Thus, many machine learning algorithms have arisen to solve such problems. In this thesis, we focus on a proposed genetic programming method for feature selection and feature construction and combine SVM and Neural Network to solve classification problem on protein-protein recognition. Experimental results show that the proposed methods are accurate and effective. The experiment shows an acceptable prediction of recognition proteins with an average accuracy of 80% with ten-fold cross validation. We used that the constructed features can significantly improve the prediction accuracy in SVM and Neural Network. This satisfies the biologist’s efforts by saving time and money.
author2 Huang-Cheng Kuo
author_facet Huang-Cheng Kuo
Kuan-Yu Su
蘇冠宇
author Kuan-Yu Su
蘇冠宇
spellingShingle Kuan-Yu Su
蘇冠宇
Protein-Protein Recognition Classification by Genetic Programming
author_sort Kuan-Yu Su
title Protein-Protein Recognition Classification by Genetic Programming
title_short Protein-Protein Recognition Classification by Genetic Programming
title_full Protein-Protein Recognition Classification by Genetic Programming
title_fullStr Protein-Protein Recognition Classification by Genetic Programming
title_full_unstemmed Protein-Protein Recognition Classification by Genetic Programming
title_sort protein-protein recognition classification by genetic programming
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/46469651675379281342
work_keys_str_mv AT kuanyusu proteinproteinrecognitionclassificationbygeneticprogramming
AT sūguānyǔ proteinproteinrecognitionclassificationbygeneticprogramming
AT kuanyusu yǐjīyīnguīhuàfǎyùcèdànbáizhìbiànshífēnlèi
AT sūguānyǔ yǐjīyīnguīhuàfǎyùcèdànbáizhìbiànshífēnlèi
_version_ 1718034694632636416