Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank

碩士 === 國立清華大學 === 資訊工程學系 === 94 === We present a new method for learning to parse a bilingual sentence using Inversion Transduction Grammar trained on a parallel corpus and a monolingual treebank. The method produces a parse tree for a bilingual sentence, showing the shared syntactic structures of i...

Full description

Bibliographic Details
Main Authors: Chung-Chi Huang, 黃仲淇
Other Authors: Jason S. Chang
Format: Others
Language:en_US
Published: 2006
Online Access:http://ndltd.ncl.edu.tw/handle/75066565675156337686
id ndltd-TW-094NTHU5392036
record_format oai_dc
spelling ndltd-TW-094NTHU53920362016-06-01T04:14:42Z http://ndltd.ncl.edu.tw/handle/75066565675156337686 Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank 利用平行語料庫與單語樹庫之雙語剖析研究 Chung-Chi Huang 黃仲淇 碩士 國立清華大學 資訊工程學系 94 We present a new method for learning to parse a bilingual sentence using Inversion Transduction Grammar trained on a parallel corpus and a monolingual treebank. The method produces a parse tree for a bilingual sentence, showing the shared syntactic structures of indivisual sentence and the differences of word order within a syntactic structure. The method involves estimating lexical translation probability based on an existing word alignment system, and inferring probability of ITG rules. At runtime, a CYK-styled bottom-up parser is employed to construct the most probable bilingual parse tree for any given sentene pair. We also describe an implementation of the proposed method. The experimental results indicate the proposed model produces word alignments better than those produced by Giza++, a state-of-the-art word alignment system, in terms of alignment error rate and F-measure. The bilingual parse trees produced for the parallel corpus can be exploited to refine the initial ITG rules and train a decoder for statistical machine translation. Jason S. Chang 張俊盛 2006 學位論文 ; thesis 62 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊工程學系 === 94 === We present a new method for learning to parse a bilingual sentence using Inversion Transduction Grammar trained on a parallel corpus and a monolingual treebank. The method produces a parse tree for a bilingual sentence, showing the shared syntactic structures of indivisual sentence and the differences of word order within a syntactic structure. The method involves estimating lexical translation probability based on an existing word alignment system, and inferring probability of ITG rules. At runtime, a CYK-styled bottom-up parser is employed to construct the most probable bilingual parse tree for any given sentene pair. We also describe an implementation of the proposed method. The experimental results indicate the proposed model produces word alignments better than those produced by Giza++, a state-of-the-art word alignment system, in terms of alignment error rate and F-measure. The bilingual parse trees produced for the parallel corpus can be exploited to refine the initial ITG rules and train a decoder for statistical machine translation.
author2 Jason S. Chang
author_facet Jason S. Chang
Chung-Chi Huang
黃仲淇
author Chung-Chi Huang
黃仲淇
spellingShingle Chung-Chi Huang
黃仲淇
Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank
author_sort Chung-Chi Huang
title Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank
title_short Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank
title_full Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank
title_fullStr Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank
title_full_unstemmed Learning Bilingual Parsing from Parallel Corpus and Monolingual Treebank
title_sort learning bilingual parsing from parallel corpus and monolingual treebank
publishDate 2006
url http://ndltd.ncl.edu.tw/handle/75066565675156337686
work_keys_str_mv AT chungchihuang learningbilingualparsingfromparallelcorpusandmonolingualtreebank
AT huángzhòngqí learningbilingualparsingfromparallelcorpusandmonolingualtreebank
AT chungchihuang lìyòngpíngxíngyǔliàokùyǔdānyǔshùkùzhīshuāngyǔpōuxīyánjiū
AT huángzhòngqí lìyòngpíngxíngyǔliàokùyǔdānyǔshùkùzhīshuāngyǔpōuxīyánjiū
_version_ 1718287358231576576