Two-stage individual gene selection methods based on misclassification cost and accuracy

碩士 === 國立成功大學 === 資訊管理研究所 === 98 === As the continuous improvement and innovation of bioscience and medical science technologies, scientists developed a series of technologies for gene microarray data to find the relations between diseases and genes. The number of instances in a microar-ray data set...

Full description

Bibliographic Details
Main Authors: Jui-YangHsu, 許瑞洋
Other Authors: Tzu-Tsung Wong
Format: Others
Language:zh-TW
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/16808233634669487228
id ndltd-TW-098NCKU5396009
record_format oai_dc
spelling ndltd-TW-098NCKU53960092015-10-13T18:26:17Z http://ndltd.ncl.edu.tw/handle/16808233634669487228 Two-stage individual gene selection methods based on misclassification cost and accuracy 同時考量分類錯誤成本及正確率之二階段個別基因選取法 Jui-YangHsu 許瑞洋 碩士 國立成功大學 資訊管理研究所 98 As the continuous improvement and innovation of bioscience and medical science technologies, scientists developed a series of technologies for gene microarray data to find the relations between diseases and genes. The number of instances in a microar-ray data set is far less than the number of genes in an instances, and lots of genes are irrelevant to a specific disease. Therefore, gene selection is essential to reduce the di-mensionality of a microarray data set. The misclassification costs of different classes are generally different. Previous study performs cost-sensitive gene selection such that the classification accuracy of a microarray data set is greatly reduced. To com-pensate such difficiency, this study considers both misclassification cost and predic-tion accuracy to propose 18 two-stage individual gene selection methods for microar-ray data. The experimental results on eight microarray data sets show that the method adopting probability ranking for misclassification cost in the first stage and t-value ranking for prediction accuracy in the second stage has the best performance evalu-ated by area under cost curve and area under accuracy curve. Tzu-Tsung Wong 翁慈宗 2010 學位論文 ; thesis 55 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 資訊管理研究所 === 98 === As the continuous improvement and innovation of bioscience and medical science technologies, scientists developed a series of technologies for gene microarray data to find the relations between diseases and genes. The number of instances in a microar-ray data set is far less than the number of genes in an instances, and lots of genes are irrelevant to a specific disease. Therefore, gene selection is essential to reduce the di-mensionality of a microarray data set. The misclassification costs of different classes are generally different. Previous study performs cost-sensitive gene selection such that the classification accuracy of a microarray data set is greatly reduced. To com-pensate such difficiency, this study considers both misclassification cost and predic-tion accuracy to propose 18 two-stage individual gene selection methods for microar-ray data. The experimental results on eight microarray data sets show that the method adopting probability ranking for misclassification cost in the first stage and t-value ranking for prediction accuracy in the second stage has the best performance evalu-ated by area under cost curve and area under accuracy curve.
author2 Tzu-Tsung Wong
author_facet Tzu-Tsung Wong
Jui-YangHsu
許瑞洋
author Jui-YangHsu
許瑞洋
spellingShingle Jui-YangHsu
許瑞洋
Two-stage individual gene selection methods based on misclassification cost and accuracy
author_sort Jui-YangHsu
title Two-stage individual gene selection methods based on misclassification cost and accuracy
title_short Two-stage individual gene selection methods based on misclassification cost and accuracy
title_full Two-stage individual gene selection methods based on misclassification cost and accuracy
title_fullStr Two-stage individual gene selection methods based on misclassification cost and accuracy
title_full_unstemmed Two-stage individual gene selection methods based on misclassification cost and accuracy
title_sort two-stage individual gene selection methods based on misclassification cost and accuracy
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/16808233634669487228
work_keys_str_mv AT juiyanghsu twostageindividualgeneselectionmethodsbasedonmisclassificationcostandaccuracy
AT xǔruìyáng twostageindividualgeneselectionmethodsbasedonmisclassificationcostandaccuracy
AT juiyanghsu tóngshíkǎoliàngfēnlèicuòwùchéngběnjízhèngquèlǜzhīèrjiēduàngèbiéjīyīnxuǎnqǔfǎ
AT xǔruìyáng tóngshíkǎoliàngfēnlèicuòwùchéngběnjízhèngquèlǜzhīèrjiēduàngèbiéjīyīnxuǎnqǔfǎ
_version_ 1718033546851909632