Developing a sense-tagged corpus of Chinese

碩士 === 東吳大學 === 資訊科學系 === 96 === Word sense disambiguation (WSD) is a process that tags the specific meaning of a polysemy in a given sentence. Already a lot of scholars have been devoted to the research of WSD at present, and WSD has been an important role in nature language processing. Sense tagge...

Full description

Bibliographic Details
Main Authors: Shih-yin Liu, 劉詩音
Other Authors: S. J. Ker
Format: Others
Language:zh-TW
Published: 2008
Online Access:http://ndltd.ncl.edu.tw/handle/xr497g
id ndltd-TW-096SCU05394020
record_format oai_dc
spelling ndltd-TW-096SCU053940202019-05-15T19:28:27Z http://ndltd.ncl.edu.tw/handle/xr497g Developing a sense-tagged corpus of Chinese 中文詞義標示集之設計與製作 Shih-yin Liu 劉詩音 碩士 東吳大學 資訊科學系 96 Word sense disambiguation (WSD) is a process that tags the specific meaning of a polysemy in a given sentence. Already a lot of scholars have been devoted to the research of WSD at present, and WSD has been an important role in nature language processing. Sense tagged corpus occupies very important position to natural language processing, but there are few Chinese sense tagged corpus at present. So we designed a all-word Chinese sense tagged corpus which contained more than 110 thousand word. We selected 56 articles from Sinica Corpus and tagged the polysemy combining the n-gram method and probability method automatically, for reach high result of accuracy, we checked the tagged sense manually. S. J. Ker 柯淑津 2008 學位論文 ; thesis 67 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 東吳大學 === 資訊科學系 === 96 === Word sense disambiguation (WSD) is a process that tags the specific meaning of a polysemy in a given sentence. Already a lot of scholars have been devoted to the research of WSD at present, and WSD has been an important role in nature language processing. Sense tagged corpus occupies very important position to natural language processing, but there are few Chinese sense tagged corpus at present. So we designed a all-word Chinese sense tagged corpus which contained more than 110 thousand word. We selected 56 articles from Sinica Corpus and tagged the polysemy combining the n-gram method and probability method automatically, for reach high result of accuracy, we checked the tagged sense manually.
author2 S. J. Ker
author_facet S. J. Ker
Shih-yin Liu
劉詩音
author Shih-yin Liu
劉詩音
spellingShingle Shih-yin Liu
劉詩音
Developing a sense-tagged corpus of Chinese
author_sort Shih-yin Liu
title Developing a sense-tagged corpus of Chinese
title_short Developing a sense-tagged corpus of Chinese
title_full Developing a sense-tagged corpus of Chinese
title_fullStr Developing a sense-tagged corpus of Chinese
title_full_unstemmed Developing a sense-tagged corpus of Chinese
title_sort developing a sense-tagged corpus of chinese
publishDate 2008
url http://ndltd.ncl.edu.tw/handle/xr497g
work_keys_str_mv AT shihyinliu developingasensetaggedcorpusofchinese
AT liúshīyīn developingasensetaggedcorpusofchinese
AT shihyinliu zhōngwéncíyìbiāoshìjízhīshèjìyǔzhìzuò
AT liúshīyīn zhōngwéncíyìbiāoshìjízhīshèjìyǔzhìzuò
_version_ 1719090358412902400