A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias

碩士 === 國立雲林科技大學 === 資訊管理系 === 103 === Keywords are a subset of words or phrases from a document those can describe the meaning of the document. The major methods for Chinese keyword extraction are keyword lexicons approaches, statistics approaches, linguistics approaches, etc. Among these methods, k...

Full description

Bibliographic Details
Main Authors: ZENG,YU-HONG, 曾郁閎
Other Authors: HUANG,CHIN-FA
Format: Others
Language:zh-TW
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/75916377647627372221
id ndltd-TW-102YUNT0396066
record_format oai_dc
spelling ndltd-TW-102YUNT03960662016-07-02T04:21:20Z http://ndltd.ncl.edu.tw/handle/75916377647627372221 A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias 利用搜尋引擎與網路百科全書輔助中文關鍵字自動擷取之研究 ZENG,YU-HONG 曾郁閎 碩士 國立雲林科技大學 資訊管理系 103 Keywords are a subset of words or phrases from a document those can describe the meaning of the document. The major methods for Chinese keyword extraction are keyword lexicons approaches, statistics approaches, linguistics approaches, etc. Among these methods, keyword lexicons approaches make keyword extraction high precision and high efficient, but building keyword lexicons spends a lot of time and the maintenance of keyword lexicons is manual. This research presents a Chinese keyword extraction system based on CKIP Chinese word segmentation system. This system provides the recombination of words by using part of speech (POS) combination and automatic words combination via search engine (Google Search) and internet encyclopedia (Wikipedia). This system also focuses on building a keyword lexicon that can update its keywords automatically. The system can improve the disadvantages of keyword lexicons approaches. The results of experiments show that using the CKIP Chinese word segmentation system, POS combination and automatic words combination gains higher precision and the number of documents does not affect the performance of the keyword extraction system. Keywords: Keyword Extraction, Keyword Lexicon, Search Engine, Internet Encyclopedia HUANG,CHIN-FA 黃錦法 2015 學位論文 ; thesis 42 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立雲林科技大學 === 資訊管理系 === 103 === Keywords are a subset of words or phrases from a document those can describe the meaning of the document. The major methods for Chinese keyword extraction are keyword lexicons approaches, statistics approaches, linguistics approaches, etc. Among these methods, keyword lexicons approaches make keyword extraction high precision and high efficient, but building keyword lexicons spends a lot of time and the maintenance of keyword lexicons is manual. This research presents a Chinese keyword extraction system based on CKIP Chinese word segmentation system. This system provides the recombination of words by using part of speech (POS) combination and automatic words combination via search engine (Google Search) and internet encyclopedia (Wikipedia). This system also focuses on building a keyword lexicon that can update its keywords automatically. The system can improve the disadvantages of keyword lexicons approaches. The results of experiments show that using the CKIP Chinese word segmentation system, POS combination and automatic words combination gains higher precision and the number of documents does not affect the performance of the keyword extraction system. Keywords: Keyword Extraction, Keyword Lexicon, Search Engine, Internet Encyclopedia
author2 HUANG,CHIN-FA
author_facet HUANG,CHIN-FA
ZENG,YU-HONG
曾郁閎
author ZENG,YU-HONG
曾郁閎
spellingShingle ZENG,YU-HONG
曾郁閎
A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias
author_sort ZENG,YU-HONG
title A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias
title_short A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias
title_full A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias
title_fullStr A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias
title_full_unstemmed A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias
title_sort study on automatic chinese keyword extraction based on search engines and internet encyclopedias
publishDate 2015
url http://ndltd.ncl.edu.tw/handle/75916377647627372221
work_keys_str_mv AT zengyuhong astudyonautomaticchinesekeywordextractionbasedonsearchenginesandinternetencyclopedias
AT céngyùhóng astudyonautomaticchinesekeywordextractionbasedonsearchenginesandinternetencyclopedias
AT zengyuhong lìyòngsōuxúnyǐnqíngyǔwǎnglùbǎikēquánshūfǔzhùzhōngwénguānjiànzìzìdòngxiéqǔzhīyánjiū
AT céngyùhóng lìyòngsōuxúnyǐnqíngyǔwǎnglùbǎikēquánshūfǔzhùzhōngwénguānjiànzìzìdòngxiéqǔzhīyánjiū
AT zengyuhong studyonautomaticchinesekeywordextractionbasedonsearchenginesandinternetencyclopedias
AT céngyùhóng studyonautomaticchinesekeywordextractionbasedonsearchenginesandinternetencyclopedias
_version_ 1718332891164835840