An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning

Supervised learning has tremendous applications in cancer prediction, patient treatment, business, and engineering. Datasets for supervised learning are often corrupted by noise and non-biological effects, leading to model-overfitting and performance degradation in current methods of binary classifi...

Full description

Bibliographic Details
Main Authors:	Anuchate Pattanateepapon, Watcharapan Suwansantisuk, Pinit Kumhom
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Binary classification supervised learning multiple hyperplanes performance evaluation
Online Access:	https://ieeexplore.ieee.org/document/8963690/

id	doaj-0b0a919a0b7c45d99df2e4d271574a6d
record_format	Article
spelling	doaj-0b0a919a0b7c45d99df2e4d271574a6d2021-03-30T01:10:10ZengIEEEIEEE Access2169-35362020-01-018220482207110.1109/ACCESS.2020.29678418963690An Approach to Supervised Learning: Dynamic Multi-Hyperplane PartitioningAnuchate Pattanateepapon0https://orcid.org/0000-0003-1246-9482Watcharapan Suwansantisuk1https://orcid.org/0000-0003-0195-7616Pinit Kumhom2https://orcid.org/0000-0003-3059-4877Department of Electronic and Telecommunication Engineering, Faculty of Engineering, King Mongkut’s University of Technology Thonburi, Bangkok, ThailandDepartment of Electronic and Telecommunication Engineering, Faculty of Engineering, King Mongkut’s University of Technology Thonburi, Bangkok, ThailandDepartment of Electronic and Telecommunication Engineering, Faculty of Engineering, King Mongkut’s University of Technology Thonburi, Bangkok, ThailandSupervised learning has tremendous applications in cancer prediction, patient treatment, business, and engineering. Datasets for supervised learning are often corrupted by noise and non-biological effects, leading to model-overfitting and performance degradation in current methods of binary classification. In this research, we develop a new classification method that fully exploits unique characteristics, such as persistent outliers, anomalies from a batch effect, and hidden relationships between features and their classes in the datasets, hence improving classification performance of current methods. The proposed method, called dynamic multi-hyperplane partitioning (DMP), learns the model by using subclassifiers, which are random in number and each of which uses multiple hyperplanes for decision boundaries. We also develop a method to transform samples to improve classification performance of DMP. We prove that, under a mild condition, accuracy of DMP is as good as or supersedes that of support vector machine (SVM). We test DMP on comprehensive datasets, which span diverse fields of applications, and compare accuracy, sensitivity, specificity, F-measure, and the receiver operating characteristic of DMP to those of competitive baselines, including SVM, random forest, Bayes classifier, gradient boosting tree, and deep-belief nets and neural nets. From the comparison, the proposed method is most accurate in nine out of eleven datasets, when using the mean values alone for comparison. DMP achieves 100% accuracy, 100% sensitivity, and 100% specificity in three datasets. As a generalization, we perform statistical test of difference, at significance levels of 0.05, 0.01 and 0.001. From statistical tests, DMP is the most accurate or one of the most accurate classifiers in nine out of eleven benchmark datasets., and is not the most accurate classifier in the remaining two datasets. The DMP learning method is accurate, simple to implement, and does not require fine-tuning of parameters, making it attractive for binary classification. This research has practical applications and leads to a timely and accurate approach to binary classification in diverse fields.https://ieeexplore.ieee.org/document/8963690/Binary classificationsupervised learningmultiple hyperplanesperformance evaluation
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Anuchate Pattanateepapon Watcharapan Suwansantisuk Pinit Kumhom
spellingShingle	Anuchate Pattanateepapon Watcharapan Suwansantisuk Pinit Kumhom An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning IEEE Access Binary classification supervised learning multiple hyperplanes performance evaluation
author_facet	Anuchate Pattanateepapon Watcharapan Suwansantisuk Pinit Kumhom
author_sort	Anuchate Pattanateepapon
title	An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning
title_short	An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning
title_full	An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning
title_fullStr	An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning
title_full_unstemmed	An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning
title_sort	approach to supervised learning: dynamic multi-hyperplane partitioning
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2020-01-01
description	Supervised learning has tremendous applications in cancer prediction, patient treatment, business, and engineering. Datasets for supervised learning are often corrupted by noise and non-biological effects, leading to model-overfitting and performance degradation in current methods of binary classification. In this research, we develop a new classification method that fully exploits unique characteristics, such as persistent outliers, anomalies from a batch effect, and hidden relationships between features and their classes in the datasets, hence improving classification performance of current methods. The proposed method, called dynamic multi-hyperplane partitioning (DMP), learns the model by using subclassifiers, which are random in number and each of which uses multiple hyperplanes for decision boundaries. We also develop a method to transform samples to improve classification performance of DMP. We prove that, under a mild condition, accuracy of DMP is as good as or supersedes that of support vector machine (SVM). We test DMP on comprehensive datasets, which span diverse fields of applications, and compare accuracy, sensitivity, specificity, F-measure, and the receiver operating characteristic of DMP to those of competitive baselines, including SVM, random forest, Bayes classifier, gradient boosting tree, and deep-belief nets and neural nets. From the comparison, the proposed method is most accurate in nine out of eleven datasets, when using the mean values alone for comparison. DMP achieves 100% accuracy, 100% sensitivity, and 100% specificity in three datasets. As a generalization, we perform statistical test of difference, at significance levels of 0.05, 0.01 and 0.001. From statistical tests, DMP is the most accurate or one of the most accurate classifiers in nine out of eleven benchmark datasets., and is not the most accurate classifier in the remaining two datasets. The DMP learning method is accurate, simple to implement, and does not require fine-tuning of parameters, making it attractive for binary classification. This research has practical applications and leads to a timely and accurate approach to binary classification in diverse fields.
topic	Binary classification supervised learning multiple hyperplanes performance evaluation
url	https://ieeexplore.ieee.org/document/8963690/
work_keys_str_mv	AT anuchatepattanateepapon anapproachtosupervisedlearningdynamicmultihyperplanepartitioning AT watcharapansuwansantisuk anapproachtosupervisedlearningdynamicmultihyperplanepartitioning AT pinitkumhom anapproachtosupervisedlearningdynamicmultihyperplanepartitioning AT anuchatepattanateepapon approachtosupervisedlearningdynamicmultihyperplanepartitioning AT watcharapansuwansantisuk approachtosupervisedlearningdynamicmultihyperplanepartitioning AT pinitkumhom approachtosupervisedlearningdynamicmultihyperplanepartitioning
_version_	1724187552896778240

An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning

Similar Items