Variable Selection in Boosting

碩士 === 國立東華大學 === 應用數學系 === 92 === Boosting is one of the successful ensemble classifiers. It attracts much attention recently because its impressive empirical performances and less understood theoretical properties. In this study, we consider the issue of variable selection under the boos...

Full description

Bibliographic Details
Main Authors: Yi-mo Tasi, 蔡易牟
Other Authors: Chen-Hai Andy Tsao
Format: Others
Language:zh-TW
Published: 2004
Online Access:http://ndltd.ncl.edu.tw/handle/47748853554906680421
id ndltd-TW-092NDHU5507009
record_format oai_dc
spelling ndltd-TW-092NDHU55070092016-06-17T04:16:18Z http://ndltd.ncl.edu.tw/handle/47748853554906680421 Variable Selection in Boosting 普適提之變數選擇 Yi-mo Tasi 蔡易牟 碩士 國立東華大學 應用數學系 92 Boosting is one of the successful ensemble classifiers. It attracts much attention recently because its impressive empirical performances and less understood theoretical properties. In this study, we consider the issue of variable selection under the boosting scheme. We study the effectiveness of principle component analysis (PCA), the significance-based selection method and their hybrids as methods of selection. Under multivariate normal model, we found that PCA-SIG (PCA then Significance-based) methods outperform PCA methods and are comparable with SIG-PCA methods. Similar yet more marked phenomena are observed for NBA 2002-2003 (Spurs vs. Nets) box-score data analysis. Using the box-scores from 'Spurs-like' and 'Nets-like' teams, we consider the winner prediction as a (supervised) learning problem. We note that boosting with PCA-SIG achieves satisfactory error rates and outperforms other methods. Chen-Hai Andy Tsao 曹振海 2004 學位論文 ; thesis 51 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立東華大學 === 應用數學系 === 92 === Boosting is one of the successful ensemble classifiers. It attracts much attention recently because its impressive empirical performances and less understood theoretical properties. In this study, we consider the issue of variable selection under the boosting scheme. We study the effectiveness of principle component analysis (PCA), the significance-based selection method and their hybrids as methods of selection. Under multivariate normal model, we found that PCA-SIG (PCA then Significance-based) methods outperform PCA methods and are comparable with SIG-PCA methods. Similar yet more marked phenomena are observed for NBA 2002-2003 (Spurs vs. Nets) box-score data analysis. Using the box-scores from 'Spurs-like' and 'Nets-like' teams, we consider the winner prediction as a (supervised) learning problem. We note that boosting with PCA-SIG achieves satisfactory error rates and outperforms other methods.
author2 Chen-Hai Andy Tsao
author_facet Chen-Hai Andy Tsao
Yi-mo Tasi
蔡易牟
author Yi-mo Tasi
蔡易牟
spellingShingle Yi-mo Tasi
蔡易牟
Variable Selection in Boosting
author_sort Yi-mo Tasi
title Variable Selection in Boosting
title_short Variable Selection in Boosting
title_full Variable Selection in Boosting
title_fullStr Variable Selection in Boosting
title_full_unstemmed Variable Selection in Boosting
title_sort variable selection in boosting
publishDate 2004
url http://ndltd.ncl.edu.tw/handle/47748853554906680421
work_keys_str_mv AT yimotasi variableselectioninboosting
AT càiyìmóu variableselectioninboosting
AT yimotasi pǔshìtízhībiànshùxuǎnzé
AT càiyìmóu pǔshìtízhībiànshùxuǎnzé
_version_ 1718307425825587200