Mining Relevant Syntactic Patterns for Chinese Text Extraction
碩士 === 國立中央大學 === 資訊工程研究所 === 90 === IE is a research topic related to TREC (Text Retrieval Conference) and MUC (Message Understanding Conference). The target of Information extraction (IE) is to extract specific types of information from text. The IE systems for free text form written in English...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2002
|
Online Access: | http://ndltd.ncl.edu.tw/handle/30689959598695641672 |
id |
ndltd-TW-090NCU05392044 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-090NCU053920442015-10-13T12:46:50Z http://ndltd.ncl.edu.tw/handle/30689959598695641672 Mining Relevant Syntactic Patterns for Chinese Text Extraction 中文資料擷取系統之設計與研究 Dong-shun Wu 吳東軒 碩士 國立中央大學 資訊工程研究所 90 IE is a research topic related to TREC (Text Retrieval Conference) and MUC (Message Understanding Conference). The target of Information extraction (IE) is to extract specific types of information from text. The IE systems for free text form written in English are different from the systems for Chinese. In this paper we propose a simple method for extracting information from free text from written in Chinese. We use training examples and encode them with the responding targets. Then we find the repeated substrings within the encoded text. These repeated substrings play the role in our IE system for Chinese which is likes the role of the sentence analyzers in some IE systems for free text form in English. In the phrase for extracting information from testing data, we first encode them and then extract the interesting target by the repeated substrings fined previously. Chia-Hui Chang 張嘉惠 2002 學位論文 ; thesis 39 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中央大學 === 資訊工程研究所 === 90 ===
IE is a research topic related to TREC (Text Retrieval Conference) and MUC (Message Understanding Conference). The target of Information extraction (IE) is to extract specific types of information from text. The IE systems for free text form written in English are different from the systems for Chinese.
In this paper we propose a simple method for extracting information from free text from written in Chinese. We use training examples and encode them with the responding targets. Then we find the repeated substrings within the encoded text. These repeated substrings play the role in our IE system for Chinese which is likes the role of the sentence analyzers in some IE systems for free text form in English. In the phrase for extracting information from testing data, we first encode them and then extract the interesting target by the repeated substrings fined previously.
|
author2 |
Chia-Hui Chang |
author_facet |
Chia-Hui Chang Dong-shun Wu 吳東軒 |
author |
Dong-shun Wu 吳東軒 |
spellingShingle |
Dong-shun Wu 吳東軒 Mining Relevant Syntactic Patterns for Chinese Text Extraction |
author_sort |
Dong-shun Wu |
title |
Mining Relevant Syntactic Patterns for Chinese Text Extraction |
title_short |
Mining Relevant Syntactic Patterns for Chinese Text Extraction |
title_full |
Mining Relevant Syntactic Patterns for Chinese Text Extraction |
title_fullStr |
Mining Relevant Syntactic Patterns for Chinese Text Extraction |
title_full_unstemmed |
Mining Relevant Syntactic Patterns for Chinese Text Extraction |
title_sort |
mining relevant syntactic patterns for chinese text extraction |
publishDate |
2002 |
url |
http://ndltd.ncl.edu.tw/handle/30689959598695641672 |
work_keys_str_mv |
AT dongshunwu miningrelevantsyntacticpatternsforchinesetextextraction AT wúdōngxuān miningrelevantsyntacticpatternsforchinesetextextraction AT dongshunwu zhōngwénzīliàoxiéqǔxìtǒngzhīshèjìyǔyánjiū AT wúdōngxuān zhōngwénzīliàoxiéqǔxìtǒngzhīshèjìyǔyánjiū |
_version_ |
1716865536063700992 |