Support vector machines to predict DNA-binding proteins

碩士 === 中華大學 === 資訊工程學系碩士班 === 92 === In this work, we have developed a prototype tool for binary prediction of DNA-binding proteins. We combine segmentation of protein sequences using Poisson generator and extraction of distance features which are used as input for classification by support vector m...

Full description

Bibliographic Details
Main Author: 劉家輝
Other Authors: 吳哲賢
Format: Others
Language:zh-TW
Published: 2004
Online Access:http://ndltd.ncl.edu.tw/handle/89198735918439732887
id ndltd-TW-092CHPI0392011
record_format oai_dc
spelling ndltd-TW-092CHPI03920112016-01-04T04:08:39Z http://ndltd.ncl.edu.tw/handle/89198735918439732887 Support vector machines to predict DNA-binding proteins 支援向量機器理論預測DNA結合蛋白 劉家輝 碩士 中華大學 資訊工程學系碩士班 92 In this work, we have developed a prototype tool for binary prediction of DNA-binding proteins. We combine segmentation of protein sequences using Poisson generator and extraction of distance features which are used as input for classification by support vector machine (SVM) which is first shaped by the training set to predict the DNA-binding proteins. To extract the distance features, 40-dimensional vectors together with the limited range correlation of charge, hydrophobic, and accessible surface area are used as input to represent the amino acid composition, and each of the tools predicts whether the protein belongs to DNA-binding proteins or Non-binding proteins. The measures have facilitated the success of learning and prediction that the results have consistently achieved approximate 78.4% accuracy. 吳哲賢 2004 學位論文 ; thesis 0 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 中華大學 === 資訊工程學系碩士班 === 92 === In this work, we have developed a prototype tool for binary prediction of DNA-binding proteins. We combine segmentation of protein sequences using Poisson generator and extraction of distance features which are used as input for classification by support vector machine (SVM) which is first shaped by the training set to predict the DNA-binding proteins. To extract the distance features, 40-dimensional vectors together with the limited range correlation of charge, hydrophobic, and accessible surface area are used as input to represent the amino acid composition, and each of the tools predicts whether the protein belongs to DNA-binding proteins or Non-binding proteins. The measures have facilitated the success of learning and prediction that the results have consistently achieved approximate 78.4% accuracy.
author2 吳哲賢
author_facet 吳哲賢
劉家輝
author 劉家輝
spellingShingle 劉家輝
Support vector machines to predict DNA-binding proteins
author_sort 劉家輝
title Support vector machines to predict DNA-binding proteins
title_short Support vector machines to predict DNA-binding proteins
title_full Support vector machines to predict DNA-binding proteins
title_fullStr Support vector machines to predict DNA-binding proteins
title_full_unstemmed Support vector machines to predict DNA-binding proteins
title_sort support vector machines to predict dna-binding proteins
publishDate 2004
url http://ndltd.ncl.edu.tw/handle/89198735918439732887
work_keys_str_mv AT liújiāhuī supportvectormachinestopredictdnabindingproteins
AT liújiāhuī zhīyuánxiàngliàngjīqìlǐlùnyùcèdnajiéhédànbái
_version_ 1718158823226605568