Radical extraction of off-line chinese characters by stroke clustering method

碩士 === 國立中央大學 === 資訊及電子工程研究所 === 82 === In this thesis, a novel radical extraction scheme based on stroke clustering methodology is proposed to identify the type of Chinese characters together with the extraction of the corresponding radica...

Full description

Bibliographic Details
Main Authors: Tseng Yao Lung, 曾耀隆
Other Authors: Fan Kuo Chin
Format: Others
Language:en_US
Published: 1994
Online Access:http://ndltd.ncl.edu.tw/handle/25847138665265755816
id ndltd-TW-082NCU00393004
record_format oai_dc
spelling ndltd-TW-082NCU003930042016-07-18T04:09:42Z http://ndltd.ncl.edu.tw/handle/25847138665265755816 Radical extraction of off-line chinese characters by stroke clustering method 利用筆劃群聚法做離線中文字的字根抽取 Tseng Yao Lung 曾耀隆 碩士 國立中央大學 資訊及電子工程研究所 82 In this thesis, a novel radical extraction scheme based on stroke clustering methodology is proposed to identify the type of Chinese characters together with the extraction of the corresponding radicals automatically. The K-means clustering algorithm and rule-based modification method are adopted in our proposed approach to achieve the aforementioned goals. Ten character types are defined for Chinese characters in this thesis. The proposed approach consists of three main modules which are stroke clustering module, rule-based modification module, and character type decision module. In stroke clustering module, K-means clustering algorithm is employed to cluster the strokes of a character into two clusters. Each cluster represents a radical. Since the clustering result may be erroneous. The rule-based modification module is thereby developed to rearrange the mis-clustered strokes. Finally, character type decision module calculating the evaluation distances via dividing paths for each character type is issued to determine which character type the input character is. 2500 most frequently used Chinese characters are tested in our system. Five kinds of fonts are considered in our experiment. The average accurate rate of radical extraction and type decision is 92.57%. The experimental results verify the validity and the feasibility of our proposed approach. Fan Kuo Chin 范國清 1994 學位論文 ; thesis 59 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立中央大學 === 資訊及電子工程研究所 === 82 === In this thesis, a novel radical extraction scheme based on stroke clustering methodology is proposed to identify the type of Chinese characters together with the extraction of the corresponding radicals automatically. The K-means clustering algorithm and rule-based modification method are adopted in our proposed approach to achieve the aforementioned goals. Ten character types are defined for Chinese characters in this thesis. The proposed approach consists of three main modules which are stroke clustering module, rule-based modification module, and character type decision module. In stroke clustering module, K-means clustering algorithm is employed to cluster the strokes of a character into two clusters. Each cluster represents a radical. Since the clustering result may be erroneous. The rule-based modification module is thereby developed to rearrange the mis-clustered strokes. Finally, character type decision module calculating the evaluation distances via dividing paths for each character type is issued to determine which character type the input character is. 2500 most frequently used Chinese characters are tested in our system. Five kinds of fonts are considered in our experiment. The average accurate rate of radical extraction and type decision is 92.57%. The experimental results verify the validity and the feasibility of our proposed approach.
author2 Fan Kuo Chin
author_facet Fan Kuo Chin
Tseng Yao Lung
曾耀隆
author Tseng Yao Lung
曾耀隆
spellingShingle Tseng Yao Lung
曾耀隆
Radical extraction of off-line chinese characters by stroke clustering method
author_sort Tseng Yao Lung
title Radical extraction of off-line chinese characters by stroke clustering method
title_short Radical extraction of off-line chinese characters by stroke clustering method
title_full Radical extraction of off-line chinese characters by stroke clustering method
title_fullStr Radical extraction of off-line chinese characters by stroke clustering method
title_full_unstemmed Radical extraction of off-line chinese characters by stroke clustering method
title_sort radical extraction of off-line chinese characters by stroke clustering method
publishDate 1994
url http://ndltd.ncl.edu.tw/handle/25847138665265755816
work_keys_str_mv AT tsengyaolung radicalextractionofofflinechinesecharactersbystrokeclusteringmethod
AT céngyàolóng radicalextractionofofflinechinesecharactersbystrokeclusteringmethod
AT tsengyaolung lìyòngbǐhuàqúnjùfǎzuòlíxiànzhōngwénzìdezìgēnchōuqǔ
AT céngyàolóng lìyòngbǐhuàqúnjùfǎzuòlíxiànzhōngwénzìdezìgēnchōuqǔ
_version_ 1718352133469765632