A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias
碩士 === 國立雲林科技大學 === 資訊管理系 === 103 === Keywords are a subset of words or phrases from a document those can describe the meaning of the document. The major methods for Chinese keyword extraction are keyword lexicons approaches, statistics approaches, linguistics approaches, etc. Among these methods, k...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2015
|
Online Access: | http://ndltd.ncl.edu.tw/handle/75916377647627372221 |
id |
ndltd-TW-102YUNT0396066 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-102YUNT03960662016-07-02T04:21:20Z http://ndltd.ncl.edu.tw/handle/75916377647627372221 A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias 利用搜尋引擎與網路百科全書輔助中文關鍵字自動擷取之研究 ZENG,YU-HONG 曾郁閎 碩士 國立雲林科技大學 資訊管理系 103 Keywords are a subset of words or phrases from a document those can describe the meaning of the document. The major methods for Chinese keyword extraction are keyword lexicons approaches, statistics approaches, linguistics approaches, etc. Among these methods, keyword lexicons approaches make keyword extraction high precision and high efficient, but building keyword lexicons spends a lot of time and the maintenance of keyword lexicons is manual. This research presents a Chinese keyword extraction system based on CKIP Chinese word segmentation system. This system provides the recombination of words by using part of speech (POS) combination and automatic words combination via search engine (Google Search) and internet encyclopedia (Wikipedia). This system also focuses on building a keyword lexicon that can update its keywords automatically. The system can improve the disadvantages of keyword lexicons approaches. The results of experiments show that using the CKIP Chinese word segmentation system, POS combination and automatic words combination gains higher precision and the number of documents does not affect the performance of the keyword extraction system. Keywords: Keyword Extraction, Keyword Lexicon, Search Engine, Internet Encyclopedia HUANG,CHIN-FA 黃錦法 2015 學位論文 ; thesis 42 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立雲林科技大學 === 資訊管理系 === 103 === Keywords are a subset of words or phrases from a document those can describe the meaning of the document. The major methods for Chinese keyword extraction are keyword lexicons approaches, statistics approaches, linguistics approaches, etc. Among these methods, keyword lexicons approaches make keyword extraction high precision and high efficient, but building keyword lexicons spends a lot of time and the maintenance of keyword lexicons is manual.
This research presents a Chinese keyword extraction system based on CKIP Chinese word segmentation system. This system provides the recombination of words by using part of speech (POS) combination and automatic words combination via search engine (Google Search) and internet encyclopedia (Wikipedia). This system also focuses on building a keyword lexicon that can update its keywords automatically. The system can improve the disadvantages of keyword lexicons approaches. The results of experiments show that using the CKIP Chinese word segmentation system, POS combination and automatic words combination gains higher precision and the number of documents does not affect the performance of the keyword extraction system.
Keywords: Keyword Extraction, Keyword Lexicon, Search Engine, Internet Encyclopedia
|
author2 |
HUANG,CHIN-FA |
author_facet |
HUANG,CHIN-FA ZENG,YU-HONG 曾郁閎 |
author |
ZENG,YU-HONG 曾郁閎 |
spellingShingle |
ZENG,YU-HONG 曾郁閎 A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias |
author_sort |
ZENG,YU-HONG |
title |
A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias |
title_short |
A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias |
title_full |
A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias |
title_fullStr |
A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias |
title_full_unstemmed |
A Study on Automatic Chinese Keyword Extraction Based on Search Engines and Internet Encyclopedias |
title_sort |
study on automatic chinese keyword extraction based on search engines and internet encyclopedias |
publishDate |
2015 |
url |
http://ndltd.ncl.edu.tw/handle/75916377647627372221 |
work_keys_str_mv |
AT zengyuhong astudyonautomaticchinesekeywordextractionbasedonsearchenginesandinternetencyclopedias AT céngyùhóng astudyonautomaticchinesekeywordextractionbasedonsearchenginesandinternetencyclopedias AT zengyuhong lìyòngsōuxúnyǐnqíngyǔwǎnglùbǎikēquánshūfǔzhùzhōngwénguānjiànzìzìdòngxiéqǔzhīyánjiū AT céngyùhóng lìyòngsōuxúnyǐnqíngyǔwǎnglùbǎikēquánshūfǔzhùzhōngwénguānjiànzìzìdòngxiéqǔzhīyánjiū AT zengyuhong studyonautomaticchinesekeywordextractionbasedonsearchenginesandinternetencyclopedias AT céngyùhóng studyonautomaticchinesekeywordextractionbasedonsearchenginesandinternetencyclopedias |
_version_ |
1718332891164835840 |