Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images
A large number of IoT applications require the use of supervised machine learning, a type of machine learning algorithm that requires data to be labeled before the model can be trained. Because manually labeling large datasets is a time-consuming, error-prone, and expensive task, automated machine l...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8741008/ |
id |
doaj-08d27f781a054cf881368d040adbccff |
---|---|
record_format |
Article |
spelling |
doaj-08d27f781a054cf881368d040adbccff2021-03-30T00:12:16ZengIEEEIEEE Access2169-35362019-01-017816918170010.1109/ACCESS.2019.29236948741008Improved Density Peak Clustering Based on Information Entropy for Ancient Character ImagesYu Weng0https://orcid.org/0000-0002-0787-550XNing Zhang1https://orcid.org/0000-0002-0580-1769Xiaoxian Yang2School of Information Engineering, Minzu University of China, Beijing, ChinaSchool of Information Engineering, Minzu University of China, Beijing, ChinaSchool of Computer and Information Engineering, Shanghai Polytechnic University, Shanghai, ChinaA large number of IoT applications require the use of supervised machine learning, a type of machine learning algorithm that requires data to be labeled before the model can be trained. Because manually labeling large datasets is a time-consuming, error-prone, and expensive task, automated machine learning methods can be used. To tackle the challenge in which an ancient character image needs to be manually labeled, this paper explores the classification method of ancient Chinese character images based on density peak clustering. We design a metric function of density peak clustering and propose an improved density peak clustering method based on information entropy for ancient book image classification. The method enumerates the distance threshold of clustering, then calculates the information entropy of the clustering result, and determines the class distance threshold by analyzing the attenuation of the information entropy, thereby completing the image clustering process. The improved metric function is used to calculate the similarity between images. A greedy strategy is used as the basis of the merging operation of the class members to achieve the purpose of increasing the degree of information entropy attenuation. The experimental results on the dataset of the Yi character images prove that the method can accurately classify unknown character images of ancient books.https://ieeexplore.ieee.org/document/8741008/Density peak clusteringinformation entropyancient character image |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yu Weng Ning Zhang Xiaoxian Yang |
spellingShingle |
Yu Weng Ning Zhang Xiaoxian Yang Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images IEEE Access Density peak clustering information entropy ancient character image |
author_facet |
Yu Weng Ning Zhang Xiaoxian Yang |
author_sort |
Yu Weng |
title |
Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images |
title_short |
Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images |
title_full |
Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images |
title_fullStr |
Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images |
title_full_unstemmed |
Improved Density Peak Clustering Based on Information Entropy for Ancient Character Images |
title_sort |
improved density peak clustering based on information entropy for ancient character images |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
A large number of IoT applications require the use of supervised machine learning, a type of machine learning algorithm that requires data to be labeled before the model can be trained. Because manually labeling large datasets is a time-consuming, error-prone, and expensive task, automated machine learning methods can be used. To tackle the challenge in which an ancient character image needs to be manually labeled, this paper explores the classification method of ancient Chinese character images based on density peak clustering. We design a metric function of density peak clustering and propose an improved density peak clustering method based on information entropy for ancient book image classification. The method enumerates the distance threshold of clustering, then calculates the information entropy of the clustering result, and determines the class distance threshold by analyzing the attenuation of the information entropy, thereby completing the image clustering process. The improved metric function is used to calculate the similarity between images. A greedy strategy is used as the basis of the merging operation of the class members to achieve the purpose of increasing the degree of information entropy attenuation. The experimental results on the dataset of the Yi character images prove that the method can accurately classify unknown character images of ancient books. |
topic |
Density peak clustering information entropy ancient character image |
url |
https://ieeexplore.ieee.org/document/8741008/ |
work_keys_str_mv |
AT yuweng improveddensitypeakclusteringbasedoninformationentropyforancientcharacterimages AT ningzhang improveddensitypeakclusteringbasedoninformationentropyforancientcharacterimages AT xiaoxianyang improveddensitypeakclusteringbasedoninformationentropyforancientcharacterimages |
_version_ |
1724188511803801600 |