An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning

Supervised learning has tremendous applications in cancer prediction, patient treatment, business, and engineering. Datasets for supervised learning are often corrupted by noise and non-biological effects, leading to model-overfitting and performance degradation in current methods of binary classifi...

Full description

Bibliographic Details
Main Authors: Anuchate Pattanateepapon, Watcharapan Suwansantisuk, Pinit Kumhom
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8963690/
id doaj-0b0a919a0b7c45d99df2e4d271574a6d
record_format Article
spelling doaj-0b0a919a0b7c45d99df2e4d271574a6d2021-03-30T01:10:10ZengIEEEIEEE Access2169-35362020-01-018220482207110.1109/ACCESS.2020.29678418963690An Approach to Supervised Learning: Dynamic Multi-Hyperplane PartitioningAnuchate Pattanateepapon0https://orcid.org/0000-0003-1246-9482Watcharapan Suwansantisuk1https://orcid.org/0000-0003-0195-7616Pinit Kumhom2https://orcid.org/0000-0003-3059-4877Department of Electronic and Telecommunication Engineering, Faculty of Engineering, King Mongkut’s University of Technology Thonburi, Bangkok, ThailandDepartment of Electronic and Telecommunication Engineering, Faculty of Engineering, King Mongkut’s University of Technology Thonburi, Bangkok, ThailandDepartment of Electronic and Telecommunication Engineering, Faculty of Engineering, King Mongkut’s University of Technology Thonburi, Bangkok, ThailandSupervised learning has tremendous applications in cancer prediction, patient treatment, business, and engineering. Datasets for supervised learning are often corrupted by noise and non-biological effects, leading to model-overfitting and performance degradation in current methods of binary classification. In this research, we develop a new classification method that fully exploits unique characteristics, such as persistent outliers, anomalies from a batch effect, and hidden relationships between features and their classes in the datasets, hence improving classification performance of current methods. The proposed method, called dynamic multi-hyperplane partitioning (DMP), learns the model by using subclassifiers, which are random in number and each of which uses multiple hyperplanes for decision boundaries. We also develop a method to transform samples to improve classification performance of DMP. We prove that, under a mild condition, accuracy of DMP is as good as or supersedes that of support vector machine (SVM). We test DMP on comprehensive datasets, which span diverse fields of applications, and compare accuracy, sensitivity, specificity, F-measure, and the receiver operating characteristic of DMP to those of competitive baselines, including SVM, random forest, Bayes classifier, gradient boosting tree, and deep-belief nets and neural nets. From the comparison, the proposed method is most accurate in nine out of eleven datasets, when using the mean values alone for comparison. DMP achieves 100% accuracy, 100% sensitivity, and 100% specificity in three datasets. As a generalization, we perform statistical test of difference, at significance levels of 0.05, 0.01 and 0.001. From statistical tests, DMP is the most accurate or one of the most accurate classifiers in nine out of eleven benchmark datasets., and is not the most accurate classifier in the remaining two datasets. The DMP learning method is accurate, simple to implement, and does not require fine-tuning of parameters, making it attractive for binary classification. This research has practical applications and leads to a timely and accurate approach to binary classification in diverse fields.https://ieeexplore.ieee.org/document/8963690/Binary classificationsupervised learningmultiple hyperplanesperformance evaluation
collection DOAJ
language English
format Article
sources DOAJ
author Anuchate Pattanateepapon
Watcharapan Suwansantisuk
Pinit Kumhom
spellingShingle Anuchate Pattanateepapon
Watcharapan Suwansantisuk
Pinit Kumhom
An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning
IEEE Access
Binary classification
supervised learning
multiple hyperplanes
performance evaluation
author_facet Anuchate Pattanateepapon
Watcharapan Suwansantisuk
Pinit Kumhom
author_sort Anuchate Pattanateepapon
title An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning
title_short An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning
title_full An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning
title_fullStr An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning
title_full_unstemmed An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning
title_sort approach to supervised learning: dynamic multi-hyperplane partitioning
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Supervised learning has tremendous applications in cancer prediction, patient treatment, business, and engineering. Datasets for supervised learning are often corrupted by noise and non-biological effects, leading to model-overfitting and performance degradation in current methods of binary classification. In this research, we develop a new classification method that fully exploits unique characteristics, such as persistent outliers, anomalies from a batch effect, and hidden relationships between features and their classes in the datasets, hence improving classification performance of current methods. The proposed method, called dynamic multi-hyperplane partitioning (DMP), learns the model by using subclassifiers, which are random in number and each of which uses multiple hyperplanes for decision boundaries. We also develop a method to transform samples to improve classification performance of DMP. We prove that, under a mild condition, accuracy of DMP is as good as or supersedes that of support vector machine (SVM). We test DMP on comprehensive datasets, which span diverse fields of applications, and compare accuracy, sensitivity, specificity, F-measure, and the receiver operating characteristic of DMP to those of competitive baselines, including SVM, random forest, Bayes classifier, gradient boosting tree, and deep-belief nets and neural nets. From the comparison, the proposed method is most accurate in nine out of eleven datasets, when using the mean values alone for comparison. DMP achieves 100% accuracy, 100% sensitivity, and 100% specificity in three datasets. As a generalization, we perform statistical test of difference, at significance levels of 0.05, 0.01 and 0.001. From statistical tests, DMP is the most accurate or one of the most accurate classifiers in nine out of eleven benchmark datasets., and is not the most accurate classifier in the remaining two datasets. The DMP learning method is accurate, simple to implement, and does not require fine-tuning of parameters, making it attractive for binary classification. This research has practical applications and leads to a timely and accurate approach to binary classification in diverse fields.
topic Binary classification
supervised learning
multiple hyperplanes
performance evaluation
url https://ieeexplore.ieee.org/document/8963690/
work_keys_str_mv AT anuchatepattanateepapon anapproachtosupervisedlearningdynamicmultihyperplanepartitioning
AT watcharapansuwansantisuk anapproachtosupervisedlearningdynamicmultihyperplanepartitioning
AT pinitkumhom anapproachtosupervisedlearningdynamicmultihyperplanepartitioning
AT anuchatepattanateepapon approachtosupervisedlearningdynamicmultihyperplanepartitioning
AT watcharapansuwansantisuk approachtosupervisedlearningdynamicmultihyperplanepartitioning
AT pinitkumhom approachtosupervisedlearningdynamicmultihyperplanepartitioning
_version_ 1724187552896778240