Medical Health Big Data Classification Based on KNN Classification Algorithm

The rapid development of information technology has led to the development of medical informatization in the direction of intelligence. Medical health big data provides a basic data resource guarantee for medical service intelligence and smart healthcare. The classification of medical health big dat...

Full description

Bibliographic Details
Main Authors: Wenchao Xing, Yilin Bei
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8911389/
id doaj-640533c6957248b68eb33edecadd2af8
record_format Article
spelling doaj-640533c6957248b68eb33edecadd2af82021-03-30T02:10:30ZengIEEEIEEE Access2169-35362020-01-018288082881910.1109/ACCESS.2019.29557548911389Medical Health Big Data Classification Based on KNN Classification AlgorithmWenchao Xing0https://orcid.org/0000-0001-9965-9937Yilin Bei1https://orcid.org/0000-0002-8006-5108School of Primary Education, Jining University, Qufu, ChinaSchool of Information Science and Technology, Taishan University, Tai’an, ChinaThe rapid development of information technology has led to the development of medical informatization in the direction of intelligence. Medical health big data provides a basic data resource guarantee for medical service intelligence and smart healthcare. The classification of medical health big data is of great significance for the intelligentization of medical information. Due to the simplicity of KNN (K-Nearest Neighbor) classification algorithm, it has been widely used in many fields. However, when the sample size is large and the feature attributes are large, the efficiency of the KNN algorithm classification will be greatly reduced. This paper proposes an improved KNN algorithm and compares it with the traditional KNN algorithm. The classification is performed in the query instance neighborhood of the conventional KNN classifier, and weights are assigned to each class. The algorithm considers the class distribution around the query instance to ensure that the assigned weight does not adversely affect the outliers. Aiming at the shortcomings of traditional KNN algorithm in processing large data sets, this paper proposes an improved KNN algorithm based on cluster denoising and density cropping. The algorithm performs denoising processing by clustering, and improves the classification efficiency of KNN algorithm by speeding up the search speed of K-nearest neighbors, while maintaining the classification accuracy of KNN algorithm. The experimental results show that the proposed algorithm can effectively improve the classification efficiency of KNN algorithm in processing large data sets, and maintain the classification accuracy of KNN algorithm well, and has good classification performance.https://ieeexplore.ieee.org/document/8911389/Improved KNN classifierweighted KNN algorithmcluster denoisingdensity cropping
collection DOAJ
language English
format Article
sources DOAJ
author Wenchao Xing
Yilin Bei
spellingShingle Wenchao Xing
Yilin Bei
Medical Health Big Data Classification Based on KNN Classification Algorithm
IEEE Access
Improved KNN classifier
weighted KNN algorithm
cluster denoising
density cropping
author_facet Wenchao Xing
Yilin Bei
author_sort Wenchao Xing
title Medical Health Big Data Classification Based on KNN Classification Algorithm
title_short Medical Health Big Data Classification Based on KNN Classification Algorithm
title_full Medical Health Big Data Classification Based on KNN Classification Algorithm
title_fullStr Medical Health Big Data Classification Based on KNN Classification Algorithm
title_full_unstemmed Medical Health Big Data Classification Based on KNN Classification Algorithm
title_sort medical health big data classification based on knn classification algorithm
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description The rapid development of information technology has led to the development of medical informatization in the direction of intelligence. Medical health big data provides a basic data resource guarantee for medical service intelligence and smart healthcare. The classification of medical health big data is of great significance for the intelligentization of medical information. Due to the simplicity of KNN (K-Nearest Neighbor) classification algorithm, it has been widely used in many fields. However, when the sample size is large and the feature attributes are large, the efficiency of the KNN algorithm classification will be greatly reduced. This paper proposes an improved KNN algorithm and compares it with the traditional KNN algorithm. The classification is performed in the query instance neighborhood of the conventional KNN classifier, and weights are assigned to each class. The algorithm considers the class distribution around the query instance to ensure that the assigned weight does not adversely affect the outliers. Aiming at the shortcomings of traditional KNN algorithm in processing large data sets, this paper proposes an improved KNN algorithm based on cluster denoising and density cropping. The algorithm performs denoising processing by clustering, and improves the classification efficiency of KNN algorithm by speeding up the search speed of K-nearest neighbors, while maintaining the classification accuracy of KNN algorithm. The experimental results show that the proposed algorithm can effectively improve the classification efficiency of KNN algorithm in processing large data sets, and maintain the classification accuracy of KNN algorithm well, and has good classification performance.
topic Improved KNN classifier
weighted KNN algorithm
cluster denoising
density cropping
url https://ieeexplore.ieee.org/document/8911389/
work_keys_str_mv AT wenchaoxing medicalhealthbigdataclassificationbasedonknnclassificationalgorithm
AT yilinbei medicalhealthbigdataclassificationbasedonknnclassificationalgorithm
_version_ 1724185631878283264