Developing a sense-tagged corpus of Chinese
碩士 === 東吳大學 === 資訊科學系 === 96 === Word sense disambiguation (WSD) is a process that tags the specific meaning of a polysemy in a given sentence. Already a lot of scholars have been devoted to the research of WSD at present, and WSD has been an important role in nature language processing. Sense tagge...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2008
|
Online Access: | http://ndltd.ncl.edu.tw/handle/xr497g |
id |
ndltd-TW-096SCU05394020 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-096SCU053940202019-05-15T19:28:27Z http://ndltd.ncl.edu.tw/handle/xr497g Developing a sense-tagged corpus of Chinese 中文詞義標示集之設計與製作 Shih-yin Liu 劉詩音 碩士 東吳大學 資訊科學系 96 Word sense disambiguation (WSD) is a process that tags the specific meaning of a polysemy in a given sentence. Already a lot of scholars have been devoted to the research of WSD at present, and WSD has been an important role in nature language processing. Sense tagged corpus occupies very important position to natural language processing, but there are few Chinese sense tagged corpus at present. So we designed a all-word Chinese sense tagged corpus which contained more than 110 thousand word. We selected 56 articles from Sinica Corpus and tagged the polysemy combining the n-gram method and probability method automatically, for reach high result of accuracy, we checked the tagged sense manually. S. J. Ker 柯淑津 2008 學位論文 ; thesis 67 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 東吳大學 === 資訊科學系 === 96 === Word sense disambiguation (WSD) is a process that tags the specific meaning of a polysemy in a given sentence. Already a lot of scholars have been devoted to the research of WSD at present, and WSD has been an important role in nature language processing. Sense tagged corpus occupies very important position to natural language processing, but there are few Chinese sense tagged corpus at present. So we designed a all-word Chinese sense tagged corpus which contained more than 110 thousand word. We selected 56 articles from Sinica Corpus and tagged the polysemy combining the n-gram method and probability method automatically, for reach high result of accuracy, we checked the tagged sense manually.
|
author2 |
S. J. Ker |
author_facet |
S. J. Ker Shih-yin Liu 劉詩音 |
author |
Shih-yin Liu 劉詩音 |
spellingShingle |
Shih-yin Liu 劉詩音 Developing a sense-tagged corpus of Chinese |
author_sort |
Shih-yin Liu |
title |
Developing a sense-tagged corpus of Chinese |
title_short |
Developing a sense-tagged corpus of Chinese |
title_full |
Developing a sense-tagged corpus of Chinese |
title_fullStr |
Developing a sense-tagged corpus of Chinese |
title_full_unstemmed |
Developing a sense-tagged corpus of Chinese |
title_sort |
developing a sense-tagged corpus of chinese |
publishDate |
2008 |
url |
http://ndltd.ncl.edu.tw/handle/xr497g |
work_keys_str_mv |
AT shihyinliu developingasensetaggedcorpusofchinese AT liúshīyīn developingasensetaggedcorpusofchinese AT shihyinliu zhōngwéncíyìbiāoshìjízhīshèjìyǔzhìzuò AT liúshīyīn zhōngwéncíyìbiāoshìjízhīshèjìyǔzhìzuò |
_version_ |
1719090358412902400 |