Mining Gene Expression Data for Identifying Cancer Subtypes via Supervised Clustering

碩士 === 長庚大學 === 資訊管理研究所 === 93 === With the successful decoding of human genome, it was proved that many diseases such as cancers are closely related the expression of genes. Therefore, how to increase the survival rate of cancer patients by taking advantage of gene expression data becomes one of ur...

Full description

Bibliographic Details
Main Authors: Wen-Tzu Lee, 李文慈
Other Authors: 陳春賢
Format: Others
Language:zh-TW
Published: 2005
Online Access:http://ndltd.ncl.edu.tw/handle/35345797152213077210
id ndltd-TW-093CGU00396032
record_format oai_dc
spelling ndltd-TW-093CGU003960322015-10-13T15:29:17Z http://ndltd.ncl.edu.tw/handle/35345797152213077210 Mining Gene Expression Data for Identifying Cancer Subtypes via Supervised Clustering 以監督式叢集分析方式使用基因表現資料尋找癌症子型 Wen-Tzu Lee 李文慈 碩士 長庚大學 資訊管理研究所 93 With the successful decoding of human genome, it was proved that many diseases such as cancers are closely related the expression of genes. Therefore, how to increase the survival rate of cancer patients by taking advantage of gene expression data becomes one of urgent research topics. Data mining is a very popular technique for the analysis of large amount of gene expression data. Data mining technique can be used to extract potentially useful information or knowledge from complicated data sets. In this thesis, a supervised clustering method is proposed to mine gene expression data of cancer patients for the analysis of cancer subtypes. Each subtype is represented by a cluster of gene expression samples from patients of certain cancer. The clusters are further verified by using various cluster validity indices. In the verification, the optimal clustering result is derived by the ensemble of the various cluster validity indices. The proposed ensemble approach is validated by using a simulation data initially. Then gene expression data of leukemia patients are used to find the subtypes of leukemia. This thesis proposes a method for the identification of gene expression profiles of cancer subtypes. The identified subtypes of certain cancer can serve as important information to help physicians treat patients of the cancer patients affectively and appropriately. 陳春賢 2005 學位論文 ; thesis 78 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 長庚大學 === 資訊管理研究所 === 93 === With the successful decoding of human genome, it was proved that many diseases such as cancers are closely related the expression of genes. Therefore, how to increase the survival rate of cancer patients by taking advantage of gene expression data becomes one of urgent research topics. Data mining is a very popular technique for the analysis of large amount of gene expression data. Data mining technique can be used to extract potentially useful information or knowledge from complicated data sets. In this thesis, a supervised clustering method is proposed to mine gene expression data of cancer patients for the analysis of cancer subtypes. Each subtype is represented by a cluster of gene expression samples from patients of certain cancer. The clusters are further verified by using various cluster validity indices. In the verification, the optimal clustering result is derived by the ensemble of the various cluster validity indices. The proposed ensemble approach is validated by using a simulation data initially. Then gene expression data of leukemia patients are used to find the subtypes of leukemia. This thesis proposes a method for the identification of gene expression profiles of cancer subtypes. The identified subtypes of certain cancer can serve as important information to help physicians treat patients of the cancer patients affectively and appropriately.
author2 陳春賢
author_facet 陳春賢
Wen-Tzu Lee
李文慈
author Wen-Tzu Lee
李文慈
spellingShingle Wen-Tzu Lee
李文慈
Mining Gene Expression Data for Identifying Cancer Subtypes via Supervised Clustering
author_sort Wen-Tzu Lee
title Mining Gene Expression Data for Identifying Cancer Subtypes via Supervised Clustering
title_short Mining Gene Expression Data for Identifying Cancer Subtypes via Supervised Clustering
title_full Mining Gene Expression Data for Identifying Cancer Subtypes via Supervised Clustering
title_fullStr Mining Gene Expression Data for Identifying Cancer Subtypes via Supervised Clustering
title_full_unstemmed Mining Gene Expression Data for Identifying Cancer Subtypes via Supervised Clustering
title_sort mining gene expression data for identifying cancer subtypes via supervised clustering
publishDate 2005
url http://ndltd.ncl.edu.tw/handle/35345797152213077210
work_keys_str_mv AT wentzulee mininggeneexpressiondataforidentifyingcancersubtypesviasupervisedclustering
AT lǐwéncí mininggeneexpressiondataforidentifyingcancersubtypesviasupervisedclustering
AT wentzulee yǐjiāndūshìcóngjífēnxīfāngshìshǐyòngjīyīnbiǎoxiànzīliàoxúnzhǎoáizhèngzixíng
AT lǐwéncí yǐjiāndūshìcóngjífēnxīfāngshìshǐyòngjīyīnbiǎoxiànzīliàoxúnzhǎoáizhèngzixíng
_version_ 1717764973946470400