An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning
Supervised learning has tremendous applications in cancer prediction, patient treatment, business, and engineering. Datasets for supervised learning are often corrupted by noise and non-biological effects, leading to model-overfitting and performance degradation in current methods of binary classifi...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8963690/ |
id |
doaj-0b0a919a0b7c45d99df2e4d271574a6d |
---|---|
record_format |
Article |
spelling |
doaj-0b0a919a0b7c45d99df2e4d271574a6d2021-03-30T01:10:10ZengIEEEIEEE Access2169-35362020-01-018220482207110.1109/ACCESS.2020.29678418963690An Approach to Supervised Learning: Dynamic Multi-Hyperplane PartitioningAnuchate Pattanateepapon0https://orcid.org/0000-0003-1246-9482Watcharapan Suwansantisuk1https://orcid.org/0000-0003-0195-7616Pinit Kumhom2https://orcid.org/0000-0003-3059-4877Department of Electronic and Telecommunication Engineering, Faculty of Engineering, King Mongkut’s University of Technology Thonburi, Bangkok, ThailandDepartment of Electronic and Telecommunication Engineering, Faculty of Engineering, King Mongkut’s University of Technology Thonburi, Bangkok, ThailandDepartment of Electronic and Telecommunication Engineering, Faculty of Engineering, King Mongkut’s University of Technology Thonburi, Bangkok, ThailandSupervised learning has tremendous applications in cancer prediction, patient treatment, business, and engineering. Datasets for supervised learning are often corrupted by noise and non-biological effects, leading to model-overfitting and performance degradation in current methods of binary classification. In this research, we develop a new classification method that fully exploits unique characteristics, such as persistent outliers, anomalies from a batch effect, and hidden relationships between features and their classes in the datasets, hence improving classification performance of current methods. The proposed method, called dynamic multi-hyperplane partitioning (DMP), learns the model by using subclassifiers, which are random in number and each of which uses multiple hyperplanes for decision boundaries. We also develop a method to transform samples to improve classification performance of DMP. We prove that, under a mild condition, accuracy of DMP is as good as or supersedes that of support vector machine (SVM). We test DMP on comprehensive datasets, which span diverse fields of applications, and compare accuracy, sensitivity, specificity, F-measure, and the receiver operating characteristic of DMP to those of competitive baselines, including SVM, random forest, Bayes classifier, gradient boosting tree, and deep-belief nets and neural nets. From the comparison, the proposed method is most accurate in nine out of eleven datasets, when using the mean values alone for comparison. DMP achieves 100% accuracy, 100% sensitivity, and 100% specificity in three datasets. As a generalization, we perform statistical test of difference, at significance levels of 0.05, 0.01 and 0.001. From statistical tests, DMP is the most accurate or one of the most accurate classifiers in nine out of eleven benchmark datasets., and is not the most accurate classifier in the remaining two datasets. The DMP learning method is accurate, simple to implement, and does not require fine-tuning of parameters, making it attractive for binary classification. This research has practical applications and leads to a timely and accurate approach to binary classification in diverse fields.https://ieeexplore.ieee.org/document/8963690/Binary classificationsupervised learningmultiple hyperplanesperformance evaluation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Anuchate Pattanateepapon Watcharapan Suwansantisuk Pinit Kumhom |
spellingShingle |
Anuchate Pattanateepapon Watcharapan Suwansantisuk Pinit Kumhom An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning IEEE Access Binary classification supervised learning multiple hyperplanes performance evaluation |
author_facet |
Anuchate Pattanateepapon Watcharapan Suwansantisuk Pinit Kumhom |
author_sort |
Anuchate Pattanateepapon |
title |
An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning |
title_short |
An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning |
title_full |
An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning |
title_fullStr |
An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning |
title_full_unstemmed |
An Approach to Supervised Learning: Dynamic Multi-Hyperplane Partitioning |
title_sort |
approach to supervised learning: dynamic multi-hyperplane partitioning |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Supervised learning has tremendous applications in cancer prediction, patient treatment, business, and engineering. Datasets for supervised learning are often corrupted by noise and non-biological effects, leading to model-overfitting and performance degradation in current methods of binary classification. In this research, we develop a new classification method that fully exploits unique characteristics, such as persistent outliers, anomalies from a batch effect, and hidden relationships between features and their classes in the datasets, hence improving classification performance of current methods. The proposed method, called dynamic multi-hyperplane partitioning (DMP), learns the model by using subclassifiers, which are random in number and each of which uses multiple hyperplanes for decision boundaries. We also develop a method to transform samples to improve classification performance of DMP. We prove that, under a mild condition, accuracy of DMP is as good as or supersedes that of support vector machine (SVM). We test DMP on comprehensive datasets, which span diverse fields of applications, and compare accuracy, sensitivity, specificity, F-measure, and the receiver operating characteristic of DMP to those of competitive baselines, including SVM, random forest, Bayes classifier, gradient boosting tree, and deep-belief nets and neural nets. From the comparison, the proposed method is most accurate in nine out of eleven datasets, when using the mean values alone for comparison. DMP achieves 100% accuracy, 100% sensitivity, and 100% specificity in three datasets. As a generalization, we perform statistical test of difference, at significance levels of 0.05, 0.01 and 0.001. From statistical tests, DMP is the most accurate or one of the most accurate classifiers in nine out of eleven benchmark datasets., and is not the most accurate classifier in the remaining two datasets. The DMP learning method is accurate, simple to implement, and does not require fine-tuning of parameters, making it attractive for binary classification. This research has practical applications and leads to a timely and accurate approach to binary classification in diverse fields. |
topic |
Binary classification supervised learning multiple hyperplanes performance evaluation |
url |
https://ieeexplore.ieee.org/document/8963690/ |
work_keys_str_mv |
AT anuchatepattanateepapon anapproachtosupervisedlearningdynamicmultihyperplanepartitioning AT watcharapansuwansantisuk anapproachtosupervisedlearningdynamicmultihyperplanepartitioning AT pinitkumhom anapproachtosupervisedlearningdynamicmultihyperplanepartitioning AT anuchatepattanateepapon approachtosupervisedlearningdynamicmultihyperplanepartitioning AT watcharapansuwansantisuk approachtosupervisedlearningdynamicmultihyperplanepartitioning AT pinitkumhom approachtosupervisedlearningdynamicmultihyperplanepartitioning |
_version_ |
1724187552896778240 |