Application of Optimization on Exploring Biomarkers and Improving the Classification Performance in Lung Cancer
碩士 === 慈濟大學 === 醫學資訊學系碩士班 === 102 === Lung cancer is the leading cause of death from cancer in the Taiwan and in the world. The high mortality rate (80–85% within 5 years) results, in part, from a lack of effective tools that can diagnose the disease at an early stage. The mortality could be gre...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2014
|
Online Access: | http://ndltd.ncl.edu.tw/handle/23735356179549213772 |
id |
ndltd-TW-102TCU00604012 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-102TCU006040122015-10-13T23:23:01Z http://ndltd.ncl.edu.tw/handle/23735356179549213772 Application of Optimization on Exploring Biomarkers and Improving the Classification Performance in Lung Cancer 探討肺癌基因生物標誌與應用最佳化方法改善分類器效能 Chih-Hung Cheng 鄭智鴻 碩士 慈濟大學 醫學資訊學系碩士班 102 Lung cancer is the leading cause of death from cancer in the Taiwan and in the world. The high mortality rate (80–85% within 5 years) results, in part, from a lack of effective tools that can diagnose the disease at an early stage. The mortality could be greatly reduced through the development of sensitive molecular markers, if these biomarkers are detectable in the earliest stages. The goals of this thesis, therefore, are to identify the gene biomarkers of lung cancer and to find the best classifier by optimizing the SVM parameters. In this study, three gene selection methods: ANOVA, q-value, and Random Forests, are used to identify the potential gene biomarkers. Furthermore, we also develop three automatically optimization methods in order to have the better classification performance. These three methods are: Genetic Algorithm- Support Vector Machine (GA-SVM), Particle Swarm Optimization- Support Vector Machine (PSO-SVM), and Ant Colony Optimization- Support Vector Machine (ACO-SVM). In summary, we found twelve important genes. Among these genes, CAV1、SFTPC、EPAS1、PECAM1、EDNRB and FHL1 had been identified to have the relationship with the lung cancer; the other six genes: ZFP106、TNXB、CD36、VWF、NPR1 and LDB2 could have strong relationship with the lung cancer but still need further validation. All our three optimization methods have better classification accuracy when compared to the traditional SVM. The average accuracy of these methods can reach around 95% that is almost 6% higher than the traditional method. In these methods, ACO-SVM method is the best classifier based on our study. Austin H Chen 陳信志 2014 學位論文 ; thesis 60 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 慈濟大學 === 醫學資訊學系碩士班 === 102 === Lung cancer is the leading cause of death from cancer in the Taiwan and in the world. The high mortality rate (80–85% within 5 years) results, in part, from a lack of effective tools that can diagnose the disease at an early stage. The mortality could be greatly reduced through the development of sensitive molecular markers, if these biomarkers are detectable in the earliest stages. The goals of this thesis, therefore, are to identify the gene biomarkers of lung cancer and to find the best classifier by optimizing the SVM parameters.
In this study, three gene selection methods: ANOVA, q-value, and Random Forests, are used to identify the potential gene biomarkers. Furthermore, we also develop three automatically optimization methods in order to have the better classification performance. These three methods are: Genetic Algorithm- Support Vector Machine (GA-SVM), Particle Swarm Optimization- Support Vector Machine (PSO-SVM), and Ant Colony Optimization- Support Vector Machine (ACO-SVM).
In summary, we found twelve important genes. Among these genes, CAV1、SFTPC、EPAS1、PECAM1、EDNRB and FHL1 had been identified to have the relationship with the lung cancer; the other six genes: ZFP106、TNXB、CD36、VWF、NPR1 and LDB2 could have strong relationship with the lung cancer but still need further validation. All our three optimization methods have better classification accuracy when compared to the traditional SVM. The average accuracy of these methods can reach around 95% that is almost 6% higher than the traditional method. In these methods, ACO-SVM method is the best classifier based on our study.
|
author2 |
Austin H Chen |
author_facet |
Austin H Chen Chih-Hung Cheng 鄭智鴻 |
author |
Chih-Hung Cheng 鄭智鴻 |
spellingShingle |
Chih-Hung Cheng 鄭智鴻 Application of Optimization on Exploring Biomarkers and Improving the Classification Performance in Lung Cancer |
author_sort |
Chih-Hung Cheng |
title |
Application of Optimization on Exploring Biomarkers and Improving the Classification Performance in Lung Cancer |
title_short |
Application of Optimization on Exploring Biomarkers and Improving the Classification Performance in Lung Cancer |
title_full |
Application of Optimization on Exploring Biomarkers and Improving the Classification Performance in Lung Cancer |
title_fullStr |
Application of Optimization on Exploring Biomarkers and Improving the Classification Performance in Lung Cancer |
title_full_unstemmed |
Application of Optimization on Exploring Biomarkers and Improving the Classification Performance in Lung Cancer |
title_sort |
application of optimization on exploring biomarkers and improving the classification performance in lung cancer |
publishDate |
2014 |
url |
http://ndltd.ncl.edu.tw/handle/23735356179549213772 |
work_keys_str_mv |
AT chihhungcheng applicationofoptimizationonexploringbiomarkersandimprovingtheclassificationperformanceinlungcancer AT zhèngzhìhóng applicationofoptimizationonexploringbiomarkersandimprovingtheclassificationperformanceinlungcancer AT chihhungcheng tàntǎofèiáijīyīnshēngwùbiāozhìyǔyīngyòngzuìjiāhuàfāngfǎgǎishànfēnlèiqìxiàonéng AT zhèngzhìhóng tàntǎofèiáijīyīnshēngwùbiāozhìyǔyīngyòngzuìjiāhuàfāngfǎgǎishànfēnlèiqìxiàonéng |
_version_ |
1718086185137471488 |