Classification on the Imbalanced Data Stream with Concept Drifts Using a G-means Update Ensemble Approach

碩士 === 國立臺灣科技大學 === 資訊工程系 === 104 === Concept drift has become an important issue while analyzing data streams. Further, data streams can also have skewed class distributions, known as class imbalance. Actually, in the real world, it is likely that a data stream simultaneously has multiple concept d...

Full description

Bibliographic Details
Main Authors: Sin-Kai Wang, 王信凱
Other Authors: Bi-Ru Dai
Format: Others
Language:en_US
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/uhhp44
id ndltd-TW-104NTUS5392069
record_format oai_dc
spelling ndltd-TW-104NTUS53920692019-05-15T23:01:18Z http://ndltd.ncl.edu.tw/handle/uhhp44 Classification on the Imbalanced Data Stream with Concept Drifts Using a G-means Update Ensemble Approach 在類別不平衡的資料串流上針對概念漂移問題的幾何平均更新集成式學習方法 Sin-Kai Wang 王信凱 碩士 國立臺灣科技大學 資訊工程系 104 Concept drift has become an important issue while analyzing data streams. Further, data streams can also have skewed class distributions, known as class imbalance. Actually, in the real world, it is likely that a data stream simultaneously has multiple concept drifts and an imbalanced class distribution. However, since most research approaches do not consider class imbalance and the concept drift problem at the same time, they probably have a good performance on the overall average accuracy, while the accuracy of the minority class is very poor. To deal with these challenges, this paper proposes a new weighting method which can further improve the accuracy of the minority class on the imbalanced data streams with concept drifts. The experimental results confirm that our method not only achieves an impressive performance on the average accuracy but also improves the accuracy of the minority class on the imbalanced data streams. Bi-Ru Dai 戴碧如 2016 學位論文 ; thesis 41 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 資訊工程系 === 104 === Concept drift has become an important issue while analyzing data streams. Further, data streams can also have skewed class distributions, known as class imbalance. Actually, in the real world, it is likely that a data stream simultaneously has multiple concept drifts and an imbalanced class distribution. However, since most research approaches do not consider class imbalance and the concept drift problem at the same time, they probably have a good performance on the overall average accuracy, while the accuracy of the minority class is very poor. To deal with these challenges, this paper proposes a new weighting method which can further improve the accuracy of the minority class on the imbalanced data streams with concept drifts. The experimental results confirm that our method not only achieves an impressive performance on the average accuracy but also improves the accuracy of the minority class on the imbalanced data streams.
author2 Bi-Ru Dai
author_facet Bi-Ru Dai
Sin-Kai Wang
王信凱
author Sin-Kai Wang
王信凱
spellingShingle Sin-Kai Wang
王信凱
Classification on the Imbalanced Data Stream with Concept Drifts Using a G-means Update Ensemble Approach
author_sort Sin-Kai Wang
title Classification on the Imbalanced Data Stream with Concept Drifts Using a G-means Update Ensemble Approach
title_short Classification on the Imbalanced Data Stream with Concept Drifts Using a G-means Update Ensemble Approach
title_full Classification on the Imbalanced Data Stream with Concept Drifts Using a G-means Update Ensemble Approach
title_fullStr Classification on the Imbalanced Data Stream with Concept Drifts Using a G-means Update Ensemble Approach
title_full_unstemmed Classification on the Imbalanced Data Stream with Concept Drifts Using a G-means Update Ensemble Approach
title_sort classification on the imbalanced data stream with concept drifts using a g-means update ensemble approach
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/uhhp44
work_keys_str_mv AT sinkaiwang classificationontheimbalanceddatastreamwithconceptdriftsusingagmeansupdateensembleapproach
AT wángxìnkǎi classificationontheimbalanceddatastreamwithconceptdriftsusingagmeansupdateensembleapproach
AT sinkaiwang zàilèibiébùpínghéngdezīliàochuànliúshàngzhēnduìgàiniànpiàoyíwèntídejǐhépíngjūngèngxīnjíchéngshìxuéxífāngfǎ
AT wángxìnkǎi zàilèibiébùpínghéngdezīliàochuànliúshàngzhēnduìgàiniànpiàoyíwèntídejǐhépíngjūngèngxīnjíchéngshìxuéxífāngfǎ
_version_ 1719139238692257792