Fast Reaction to Sudden Concept Drift in the Absence of Class Labels

A data stream can be considered as a sequence of examples that arrive continuously and are potentially unbounded, such as web page visits, sensor readings and call records. One of the serious and challenging problems that appears in a data stream is concept drift. This problem occurs when the relati...

Full description

Bibliographic Details
Main Authors: Osama A. Mahdi, Eric Pardede, Nawfal Ali, Jinli Cao
Format: Article
Language:English
Published: MDPI AG 2020-01-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/10/2/606
id doaj-c291229acb6e44b4bc78923b301c20f2
record_format Article
spelling doaj-c291229acb6e44b4bc78923b301c20f22020-11-25T03:30:24ZengMDPI AGApplied Sciences2076-34172020-01-0110260610.3390/app10020606app10020606Fast Reaction to Sudden Concept Drift in the Absence of Class LabelsOsama A. Mahdi0Eric Pardede1Nawfal Ali2Jinli Cao3Computer Science and Information Technology, La Trobe University, Bundoora VIC 3086, AustraliaComputer Science and Information Technology, La Trobe University, Bundoora VIC 3086, AustraliaFaculty of Information Technology, Monash University, Clayton 3800, AustraliaComputer Science and Information Technology, La Trobe University, Bundoora VIC 3086, AustraliaA data stream can be considered as a sequence of examples that arrive continuously and are potentially unbounded, such as web page visits, sensor readings and call records. One of the serious and challenging problems that appears in a data stream is concept drift. This problem occurs when the relation between the input data and the target variable changes over time. Most existing works make an optimistic assumption that all incoming data are labelled and the class labels are available immediately. However, such an assumption is not always valid. Therefore, a lack of class labels aggravates the problem of concept drift detection. With this motivation, we propose a drift detector that reacts naturally to sudden drifts in the absence of class labels. In a novel way, the proposed detector reacts to concept drift in the absence of class labels, where the true label of an example is not necessary. Instead of monitoring the error estimates, the proposed detector monitors the diversity of a pair of classifiers, where the true label of an example is not necessary to determine whether components disagree. Using several datasets, an experimental evaluation and comparison is conducted against several existing detectors. The experiment results show that the proposed detector can detect drifts with less delay, runtime and memory usage.https://www.mdpi.com/2076-3417/10/2/606concept driftdata stream miningsemisupervised environment
collection DOAJ
language English
format Article
sources DOAJ
author Osama A. Mahdi
Eric Pardede
Nawfal Ali
Jinli Cao
spellingShingle Osama A. Mahdi
Eric Pardede
Nawfal Ali
Jinli Cao
Fast Reaction to Sudden Concept Drift in the Absence of Class Labels
Applied Sciences
concept drift
data stream mining
semisupervised environment
author_facet Osama A. Mahdi
Eric Pardede
Nawfal Ali
Jinli Cao
author_sort Osama A. Mahdi
title Fast Reaction to Sudden Concept Drift in the Absence of Class Labels
title_short Fast Reaction to Sudden Concept Drift in the Absence of Class Labels
title_full Fast Reaction to Sudden Concept Drift in the Absence of Class Labels
title_fullStr Fast Reaction to Sudden Concept Drift in the Absence of Class Labels
title_full_unstemmed Fast Reaction to Sudden Concept Drift in the Absence of Class Labels
title_sort fast reaction to sudden concept drift in the absence of class labels
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2020-01-01
description A data stream can be considered as a sequence of examples that arrive continuously and are potentially unbounded, such as web page visits, sensor readings and call records. One of the serious and challenging problems that appears in a data stream is concept drift. This problem occurs when the relation between the input data and the target variable changes over time. Most existing works make an optimistic assumption that all incoming data are labelled and the class labels are available immediately. However, such an assumption is not always valid. Therefore, a lack of class labels aggravates the problem of concept drift detection. With this motivation, we propose a drift detector that reacts naturally to sudden drifts in the absence of class labels. In a novel way, the proposed detector reacts to concept drift in the absence of class labels, where the true label of an example is not necessary. Instead of monitoring the error estimates, the proposed detector monitors the diversity of a pair of classifiers, where the true label of an example is not necessary to determine whether components disagree. Using several datasets, an experimental evaluation and comparison is conducted against several existing detectors. The experiment results show that the proposed detector can detect drifts with less delay, runtime and memory usage.
topic concept drift
data stream mining
semisupervised environment
url https://www.mdpi.com/2076-3417/10/2/606
work_keys_str_mv AT osamaamahdi fastreactiontosuddenconceptdriftintheabsenceofclasslabels
AT ericpardede fastreactiontosuddenconceptdriftintheabsenceofclasslabels
AT nawfalali fastreactiontosuddenconceptdriftintheabsenceofclasslabels
AT jinlicao fastreactiontosuddenconceptdriftintheabsenceofclasslabels
_version_ 1724575800155439104