Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images

A large number of IoT applications require the use of supervised machine learning, a type of machine learning algorithm that requires data to be labeled before the model can be trained. Because manually labeling large datasets is a time-consuming, error-prone, and expensive task, automated machine l...

Full description

Bibliographic Details
Main Authors: Yu Weng, Ning Zhang, Xiaoxian Yang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8741008/
id doaj-08d27f781a054cf881368d040adbccff
record_format Article
spelling doaj-08d27f781a054cf881368d040adbccff2021-03-30T00:12:16ZengIEEEIEEE Access2169-35362019-01-017816918170010.1109/ACCESS.2019.29236948741008Improved Density Peak Clustering Based on Information Entropy for Ancient Character ImagesYu Weng0https://orcid.org/0000-0002-0787-550XNing Zhang1https://orcid.org/0000-0002-0580-1769Xiaoxian Yang2School of Information Engineering, Minzu University of China, Beijing, ChinaSchool of Information Engineering, Minzu University of China, Beijing, ChinaSchool of Computer and Information Engineering, Shanghai Polytechnic University, Shanghai, ChinaA large number of IoT applications require the use of supervised machine learning, a type of machine learning algorithm that requires data to be labeled before the model can be trained. Because manually labeling large datasets is a time-consuming, error-prone, and expensive task, automated machine learning methods can be used. To tackle the challenge in which an ancient character image needs to be manually labeled, this paper explores the classification method of ancient Chinese character images based on density peak clustering. We design a metric function of density peak clustering and propose an improved density peak clustering method based on information entropy for ancient book image classification. The method enumerates the distance threshold of clustering, then calculates the information entropy of the clustering result, and determines the class distance threshold by analyzing the attenuation of the information entropy, thereby completing the image clustering process. The improved metric function is used to calculate the similarity between images. A greedy strategy is used as the basis of the merging operation of the class members to achieve the purpose of increasing the degree of information entropy attenuation. The experimental results on the dataset of the Yi character images prove that the method can accurately classify unknown character images of ancient books.https://ieeexplore.ieee.org/document/8741008/Density peak clusteringinformation entropyancient character image
collection DOAJ
language English
format Article
sources DOAJ
author Yu Weng
Ning Zhang
Xiaoxian Yang
spellingShingle Yu Weng
Ning Zhang
Xiaoxian Yang
Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images
IEEE Access
Density peak clustering
information entropy
ancient character image
author_facet Yu Weng
Ning Zhang
Xiaoxian Yang
author_sort Yu Weng
title Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images
title_short Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images
title_full Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images
title_fullStr Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images
title_full_unstemmed Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images
title_sort improved density peak clustering based on information entropy for ancient character images
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description A large number of IoT applications require the use of supervised machine learning, a type of machine learning algorithm that requires data to be labeled before the model can be trained. Because manually labeling large datasets is a time-consuming, error-prone, and expensive task, automated machine learning methods can be used. To tackle the challenge in which an ancient character image needs to be manually labeled, this paper explores the classification method of ancient Chinese character images based on density peak clustering. We design a metric function of density peak clustering and propose an improved density peak clustering method based on information entropy for ancient book image classification. The method enumerates the distance threshold of clustering, then calculates the information entropy of the clustering result, and determines the class distance threshold by analyzing the attenuation of the information entropy, thereby completing the image clustering process. The improved metric function is used to calculate the similarity between images. A greedy strategy is used as the basis of the merging operation of the class members to achieve the purpose of increasing the degree of information entropy attenuation. The experimental results on the dataset of the Yi character images prove that the method can accurately classify unknown character images of ancient books.
topic Density peak clustering
information entropy
ancient character image
url https://ieeexplore.ieee.org/document/8741008/
work_keys_str_mv AT yuweng improveddensitypeakclusteringbasedoninformationentropyforancientcharacterimages
AT ningzhang improveddensitypeakclusteringbasedoninformationentropyforancientcharacterimages
AT xiaoxianyang improveddensitypeakclusteringbasedoninformationentropyforancientcharacterimages
_version_ 1724188511803801600