Extraction of Chinese Organization and Company Names

碩士 === 國立中興大學 === 資訊科學與工程學系 === 96 === Proper nouns are classified into many categories. Among these, Organization and company names (abbreviated as OCN) in Chinese language are not completely specified in dictionaries. Thus extracting the name of an organization or a company is a difficult problem...

Full description

Bibliographic Details
Main Authors: Ren-Bin Shih, 施仁斌
Other Authors: Min-Shing Yu
Format: Others
Language:zh-TW
Published: 2008
Online Access:http://ndltd.ncl.edu.tw/handle/49380041480868849854
Description
Summary:碩士 === 國立中興大學 === 資訊科學與工程學系 === 96 === Proper nouns are classified into many categories. Among these, Organization and company names (abbreviated as OCN) in Chinese language are not completely specified in dictionaries. Thus extracting the name of an organization or a company is a difficult problem in Natural Language Processing (NLP). This thesis will discuss some methods for recognizing Chinese organization and company names by using the following three methods: (1) the combination of parts of speech of an OCN. (2) parts of speech surrounding an OCN, and (3) the structure of an OCN in terms of Chinese frequent strings. Such analyses will result in various probability tables which can be used to estimate the probability of a candidate OCN. Using the probability tables created by all three methods above can give better prediction. Thus the extraction of Chinese organization or company names can benefit from the judgment of the possibility for a candidate OCN to be a real Chinese organization or company name.