Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank
碩士 === 國立清華大學 === 資訊工程學系 === 94 === We present a new method for learning to parse a bilingual sentence using Inversion Transduction Grammar trained on a parallel corpus and a monolingual treebank. The method produces a parse tree for a bilingual sentence, showing the shared syntactic structures of i...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2006
|
Online Access: | http://ndltd.ncl.edu.tw/handle/75066565675156337686 |
id |
ndltd-TW-094NTHU5392036 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-094NTHU53920362016-06-01T04:14:42Z http://ndltd.ncl.edu.tw/handle/75066565675156337686 Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank 利用平行語料庫與單語樹庫之雙語剖析研究 Chung-Chi Huang 黃仲淇 碩士 國立清華大學 資訊工程學系 94 We present a new method for learning to parse a bilingual sentence using Inversion Transduction Grammar trained on a parallel corpus and a monolingual treebank. The method produces a parse tree for a bilingual sentence, showing the shared syntactic structures of indivisual sentence and the differences of word order within a syntactic structure. The method involves estimating lexical translation probability based on an existing word alignment system, and inferring probability of ITG rules. At runtime, a CYK-styled bottom-up parser is employed to construct the most probable bilingual parse tree for any given sentene pair. We also describe an implementation of the proposed method. The experimental results indicate the proposed model produces word alignments better than those produced by Giza++, a state-of-the-art word alignment system, in terms of alignment error rate and F-measure. The bilingual parse trees produced for the parallel corpus can be exploited to refine the initial ITG rules and train a decoder for statistical machine translation. Jason S. Chang 張俊盛 2006 學位論文 ; thesis 62 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊工程學系 === 94 === We present a new method for learning to parse a bilingual sentence using Inversion Transduction Grammar trained on a parallel corpus and a monolingual treebank. The method produces a parse tree for a bilingual sentence, showing the shared syntactic structures of indivisual sentence and the differences of word order within a syntactic structure. The method involves estimating lexical translation probability based on an existing word alignment system, and inferring probability of ITG rules. At runtime, a CYK-styled bottom-up parser is employed to construct the most probable bilingual parse tree for any given sentene pair. We also describe an implementation of the proposed method. The experimental results indicate the proposed model produces word alignments better than those produced by Giza++, a state-of-the-art word alignment system, in terms of alignment error rate and F-measure. The bilingual parse trees produced for the parallel corpus can be exploited to refine the initial ITG rules and train a decoder for statistical machine translation.
|
author2 |
Jason S. Chang |
author_facet |
Jason S. Chang Chung-Chi Huang 黃仲淇 |
author |
Chung-Chi Huang 黃仲淇 |
spellingShingle |
Chung-Chi Huang 黃仲淇 Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank |
author_sort |
Chung-Chi Huang |
title |
Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank |
title_short |
Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank |
title_full |
Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank |
title_fullStr |
Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank |
title_full_unstemmed |
Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank |
title_sort |
learning bilingual parsing from parallel corpus and monolingual treebank |
publishDate |
2006 |
url |
http://ndltd.ncl.edu.tw/handle/75066565675156337686 |
work_keys_str_mv |
AT chungchihuang learningbilingualparsingfromparallelcorpusandmonolingualtreebank AT huángzhòngqí learningbilingualparsingfromparallelcorpusandmonolingualtreebank AT chungchihuang lìyòngpíngxíngyǔliàokùyǔdānyǔshùkùzhīshuāngyǔpōuxīyánjiū AT huángzhòngqí lìyòngpíngxíngyǔliàokùyǔdānyǔshùkùzhīshuāngyǔpōuxīyánjiū |
_version_ |
1718287358231576576 |