Variable Selection in Boosting
碩士 === 國立東華大學 === 應用數學系 === 92 === Boosting is one of the successful ensemble classifiers. It attracts much attention recently because its impressive empirical performances and less understood theoretical properties. In this study, we consider the issue of variable selection under the boos...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2004
|
Online Access: | http://ndltd.ncl.edu.tw/handle/47748853554906680421 |
id |
ndltd-TW-092NDHU5507009 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-092NDHU55070092016-06-17T04:16:18Z http://ndltd.ncl.edu.tw/handle/47748853554906680421 Variable Selection in Boosting 普適提之變數選擇 Yi-mo Tasi 蔡易牟 碩士 國立東華大學 應用數學系 92 Boosting is one of the successful ensemble classifiers. It attracts much attention recently because its impressive empirical performances and less understood theoretical properties. In this study, we consider the issue of variable selection under the boosting scheme. We study the effectiveness of principle component analysis (PCA), the significance-based selection method and their hybrids as methods of selection. Under multivariate normal model, we found that PCA-SIG (PCA then Significance-based) methods outperform PCA methods and are comparable with SIG-PCA methods. Similar yet more marked phenomena are observed for NBA 2002-2003 (Spurs vs. Nets) box-score data analysis. Using the box-scores from 'Spurs-like' and 'Nets-like' teams, we consider the winner prediction as a (supervised) learning problem. We note that boosting with PCA-SIG achieves satisfactory error rates and outperforms other methods. Chen-Hai Andy Tsao 曹振海 2004 學位論文 ; thesis 51 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立東華大學 === 應用數學系 === 92 ===
Boosting is one of the successful ensemble classifiers. It attracts much attention
recently because its impressive empirical performances and less understood
theoretical properties. In this study, we consider the issue of variable selection
under the boosting scheme. We study the effectiveness of principle component analysis (PCA),
the significance-based selection method and their hybrids as methods of selection.
Under multivariate normal model, we found that PCA-SIG (PCA then Significance-based) methods
outperform PCA methods and are comparable with SIG-PCA methods.
Similar yet more marked phenomena are observed for NBA 2002-2003
(Spurs vs. Nets) box-score data analysis. Using the box-scores from
'Spurs-like' and 'Nets-like' teams, we consider the winner prediction
as a (supervised) learning problem. We note that boosting with PCA-SIG achieves satisfactory
error rates and outperforms other methods.
|
author2 |
Chen-Hai Andy Tsao |
author_facet |
Chen-Hai Andy Tsao Yi-mo Tasi 蔡易牟 |
author |
Yi-mo Tasi 蔡易牟 |
spellingShingle |
Yi-mo Tasi 蔡易牟 Variable Selection in Boosting |
author_sort |
Yi-mo Tasi |
title |
Variable Selection in Boosting |
title_short |
Variable Selection in Boosting |
title_full |
Variable Selection in Boosting |
title_fullStr |
Variable Selection in Boosting |
title_full_unstemmed |
Variable Selection in Boosting |
title_sort |
variable selection in boosting |
publishDate |
2004 |
url |
http://ndltd.ncl.edu.tw/handle/47748853554906680421 |
work_keys_str_mv |
AT yimotasi variableselectioninboosting AT càiyìmóu variableselectioninboosting AT yimotasi pǔshìtízhībiànshùxuǎnzé AT càiyìmóu pǔshìtízhībiànshùxuǎnzé |
_version_ |
1718307425825587200 |