Mining Relevant Syntactic Patterns for Chinese Text Extraction

碩士 === 國立中央大學 === 資訊工程研究所 === 90 === IE is a research topic related to TREC (Text Retrieval Conference) and MUC (Message Understanding Conference). The target of Information extraction (IE) is to extract specific types of information from text. The IE systems for free text form written in English...

Full description

Bibliographic Details
Main Authors: Dong-shun Wu, 吳東軒
Other Authors: Chia-Hui Chang
Format: Others
Language:zh-TW
Published: 2002
Online Access:http://ndltd.ncl.edu.tw/handle/30689959598695641672
id ndltd-TW-090NCU05392044
record_format oai_dc
spelling ndltd-TW-090NCU053920442015-10-13T12:46:50Z http://ndltd.ncl.edu.tw/handle/30689959598695641672 Mining Relevant Syntactic Patterns for Chinese Text Extraction 中文資料擷取系統之設計與研究 Dong-shun Wu 吳東軒 碩士 國立中央大學 資訊工程研究所 90 IE is a research topic related to TREC (Text Retrieval Conference) and MUC (Message Understanding Conference). The target of Information extraction (IE) is to extract specific types of information from text. The IE systems for free text form written in English are different from the systems for Chinese. In this paper we propose a simple method for extracting information from free text from written in Chinese. We use training examples and encode them with the responding targets. Then we find the repeated substrings within the encoded text. These repeated substrings play the role in our IE system for Chinese which is likes the role of the sentence analyzers in some IE systems for free text form in English. In the phrase for extracting information from testing data, we first encode them and then extract the interesting target by the repeated substrings fined previously. Chia-Hui Chang 張嘉惠 2002 學位論文 ; thesis 39 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中央大學 === 資訊工程研究所 === 90 === IE is a research topic related to TREC (Text Retrieval Conference) and MUC (Message Understanding Conference). The target of Information extraction (IE) is to extract specific types of information from text. The IE systems for free text form written in English are different from the systems for Chinese. In this paper we propose a simple method for extracting information from free text from written in Chinese. We use training examples and encode them with the responding targets. Then we find the repeated substrings within the encoded text. These repeated substrings play the role in our IE system for Chinese which is likes the role of the sentence analyzers in some IE systems for free text form in English. In the phrase for extracting information from testing data, we first encode them and then extract the interesting target by the repeated substrings fined previously.
author2 Chia-Hui Chang
author_facet Chia-Hui Chang
Dong-shun Wu
吳東軒
author Dong-shun Wu
吳東軒
spellingShingle Dong-shun Wu
吳東軒
Mining Relevant Syntactic Patterns for Chinese Text Extraction
author_sort Dong-shun Wu
title Mining Relevant Syntactic Patterns for Chinese Text Extraction
title_short Mining Relevant Syntactic Patterns for Chinese Text Extraction
title_full Mining Relevant Syntactic Patterns for Chinese Text Extraction
title_fullStr Mining Relevant Syntactic Patterns for Chinese Text Extraction
title_full_unstemmed Mining Relevant Syntactic Patterns for Chinese Text Extraction
title_sort mining relevant syntactic patterns for chinese text extraction
publishDate 2002
url http://ndltd.ncl.edu.tw/handle/30689959598695641672
work_keys_str_mv AT dongshunwu miningrelevantsyntacticpatternsforchinesetextextraction
AT wúdōngxuān miningrelevantsyntacticpatternsforchinesetextextraction
AT dongshunwu zhōngwénzīliàoxiéqǔxìtǒngzhīshèjìyǔyánjiū
AT wúdōngxuān zhōngwénzīliàoxiéqǔxìtǒngzhīshèjìyǔyánjiū
_version_ 1716865536063700992