An Optimal Random Projection <italic>k</italic> Nearest Neighbors Ensemble via Extended Neighborhood Rule for Binary Classification

This paper presents an ensemble method for binary classification, where each base model is based on an extended neighbourhood rule (ExNRule). The ExNRule identifies the neighbours of an unseen observation in a stepwise manner. This rule first selects the sample point closest to the experimental obse...

Full description

Bibliographic Details
Published in:IEEE Access
Main Authors: Amjad Ali, Zardad Khan, Dost Muhammad Khan, Saeed Aldahmani
Format: Article
Language:English
Published: IEEE 2024-01-01
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10506906/
_version_ 1849862401600520192
author Amjad Ali
Zardad Khan
Dost Muhammad Khan
Saeed Aldahmani
author_facet Amjad Ali
Zardad Khan
Dost Muhammad Khan
Saeed Aldahmani
author_sort Amjad Ali
collection DOAJ
container_title IEEE Access
description This paper presents an ensemble method for binary classification, where each base model is based on an extended neighbourhood rule (ExNRule). The ExNRule identifies the neighbours of an unseen observation in a stepwise manner. This rule first selects the sample point closest to the experimental observation and then selects the second observation nearest to the previously chosen one. To find the required data points in the neighbourhood, this search is repeated up to k steps. The test sample point is predicted using majority voting in the class labels of the k chosen neighbours. In the proposed method, a large number of ExNRule based models are constructed on randomly projected bootstrap samples. The error rates of these models are computed using out-of-bag data points. The models are then ranked according to their out-of-bag errors, and a proportion of the most accurate models are selected. The final ensemble is constructed by combining the selected models. The proposed method is compared with other classical procedures on 15 benchmark datasets in terms of classification accuracy, Kohen&#x2019;s kappa and Brier score (BS) as performance metrics. Boxplots of the results are also constructed. The proposed ensemble is outperforming the existing methods on almost all the benchmark datasets. For further evaluation, the proposed method is compared with other kNN based classifiers on 3 datasets using different k values. Furthermore, the performance of the proposed method is also evaluated using simulated data under different scenarios.
format Article
id doaj-art-3a2ffefcb669447fa2b0750fc8132e28
institution Directory of Open Access Journals
issn 2169-3536
language English
publishDate 2024-01-01
publisher IEEE
record_format Article
spelling doaj-art-3a2ffefcb669447fa2b0750fc8132e282025-08-20T01:18:27ZengIEEEIEEE Access2169-35362024-01-0112614016140910.1109/ACCESS.2024.339272910506906An Optimal Random Projection <italic>k</italic> Nearest Neighbors Ensemble via Extended Neighborhood Rule for Binary ClassificationAmjad Ali0https://orcid.org/0000-0003-1411-954XZardad Khan1https://orcid.org/0000-0003-3933-9143Dost Muhammad Khan2Saeed Aldahmani3https://orcid.org/0000-0002-0826-4540Department of Statistics, Abdul Wali Khan University Mardan, Mardan, PakistanDepartment of Statistics and Business Analytics, United Arab Emirates University, Al Ain, United Arab EmiratesDepartment of Statistics, Abdul Wali Khan University Mardan, Mardan, PakistanDepartment of Statistics and Business Analytics, United Arab Emirates University, Al Ain, United Arab EmiratesThis paper presents an ensemble method for binary classification, where each base model is based on an extended neighbourhood rule (ExNRule). The ExNRule identifies the neighbours of an unseen observation in a stepwise manner. This rule first selects the sample point closest to the experimental observation and then selects the second observation nearest to the previously chosen one. To find the required data points in the neighbourhood, this search is repeated up to k steps. The test sample point is predicted using majority voting in the class labels of the k chosen neighbours. In the proposed method, a large number of ExNRule based models are constructed on randomly projected bootstrap samples. The error rates of these models are computed using out-of-bag data points. The models are then ranked according to their out-of-bag errors, and a proportion of the most accurate models are selected. The final ensemble is constructed by combining the selected models. The proposed method is compared with other classical procedures on 15 benchmark datasets in terms of classification accuracy, Kohen&#x2019;s kappa and Brier score (BS) as performance metrics. Boxplots of the results are also constructed. The proposed ensemble is outperforming the existing methods on almost all the benchmark datasets. For further evaluation, the proposed method is compared with other kNN based classifiers on 3 datasets using different k values. Furthermore, the performance of the proposed method is also evaluated using simulated data under different scenarios.https://ieeexplore.ieee.org/document/10506906/ClassificationkNNextended neighborhood ruleensemble learningbootstrappingrandom projection
spellingShingle Amjad Ali
Zardad Khan
Dost Muhammad Khan
Saeed Aldahmani
An Optimal Random Projection <italic>k</italic> Nearest Neighbors Ensemble via Extended Neighborhood Rule for Binary Classification
Classification
kNN
extended neighborhood rule
ensemble learning
bootstrapping
random projection
title An Optimal Random Projection <italic>k</italic> Nearest Neighbors Ensemble via Extended Neighborhood Rule for Binary Classification
title_full An Optimal Random Projection <italic>k</italic> Nearest Neighbors Ensemble via Extended Neighborhood Rule for Binary Classification
title_fullStr An Optimal Random Projection <italic>k</italic> Nearest Neighbors Ensemble via Extended Neighborhood Rule for Binary Classification
title_full_unstemmed An Optimal Random Projection <italic>k</italic> Nearest Neighbors Ensemble via Extended Neighborhood Rule for Binary Classification
title_short An Optimal Random Projection <italic>k</italic> Nearest Neighbors Ensemble via Extended Neighborhood Rule for Binary Classification
title_sort optimal random projection italic k italic nearest neighbors ensemble via extended neighborhood rule for binary classification
topic Classification
kNN
extended neighborhood rule
ensemble learning
bootstrapping
random projection
url https://ieeexplore.ieee.org/document/10506906/
work_keys_str_mv AT amjadali anoptimalrandomprojectionitalickitalicnearestneighborsensembleviaextendedneighborhoodruleforbinaryclassification
AT zardadkhan anoptimalrandomprojectionitalickitalicnearestneighborsensembleviaextendedneighborhoodruleforbinaryclassification
AT dostmuhammadkhan anoptimalrandomprojectionitalickitalicnearestneighborsensembleviaextendedneighborhoodruleforbinaryclassification
AT saeedaldahmani anoptimalrandomprojectionitalickitalicnearestneighborsensembleviaextendedneighborhoodruleforbinaryclassification
AT amjadali optimalrandomprojectionitalickitalicnearestneighborsensembleviaextendedneighborhoodruleforbinaryclassification
AT zardadkhan optimalrandomprojectionitalickitalicnearestneighborsensembleviaextendedneighborhoodruleforbinaryclassification
AT dostmuhammadkhan optimalrandomprojectionitalickitalicnearestneighborsensembleviaextendedneighborhoodruleforbinaryclassification
AT saeedaldahmani optimalrandomprojectionitalickitalicnearestneighborsensembleviaextendedneighborhoodruleforbinaryclassification