Joshua 6: A phrase-based and hierarchical statistical machine translation system

We describe the version six release of Joshua, an open-source statistical machine translation toolkit. The main difference from release five is the introduction of a simple, unlexicalized, phrase-based stack decoder. This phrase-based decoder shares a hypergraph format with the syntax-based systems,...

Full description

Bibliographic Details
Main Authors: Post Matt, Cao Yuan, Kumar Gaurav
Format: Article
Language:English
Published: Sciendo 2015-10-01
Series:Prague Bulletin of Mathematical Linguistics
Online Access:https://doi.org/10.1515/pralin-2015-0009
id doaj-ae83b5284ddb44a9b3043b4713a60441
record_format Article
spelling doaj-ae83b5284ddb44a9b3043b4713a604412021-09-05T13:59:53ZengSciendoPrague Bulletin of Mathematical Linguistics 1804-04622015-10-01104151610.1515/pralin-2015-0009pralin-2015-0009Joshua 6: A phrase-based and hierarchical statistical machine translation systemPost Matt0Cao Yuan1Kumar Gaurav2 Human Language Technology Center of Excellence, Johns Hopkins University Center for Language and Speech Processing, Johns Hopkins University Center for Language and Speech Processing, Johns Hopkins UniversityWe describe the version six release of Joshua, an open-source statistical machine translation toolkit. The main difference from release five is the introduction of a simple, unlexicalized, phrase-based stack decoder. This phrase-based decoder shares a hypergraph format with the syntax-based systems, permitting a tight coupling with the existing codebase of feature functions and hypergraph tools. Joshua 6 also includes a number of large-scale discriminative tuners and a simplified sparse feature function interface with reflection-based loading, which allows new features to be used by writing a single function. Finally, Joshua includes a number of simplifications and improvements focused on usability for both researchers and end-users, including the release of language packs — precompiled models that can be run as black boxes.https://doi.org/10.1515/pralin-2015-0009
collection DOAJ
language English
format Article
sources DOAJ
author Post Matt
Cao Yuan
Kumar Gaurav
spellingShingle Post Matt
Cao Yuan
Kumar Gaurav
Joshua 6: A phrase-based and hierarchical statistical machine translation system
Prague Bulletin of Mathematical Linguistics
author_facet Post Matt
Cao Yuan
Kumar Gaurav
author_sort Post Matt
title Joshua 6: A phrase-based and hierarchical statistical machine translation system
title_short Joshua 6: A phrase-based and hierarchical statistical machine translation system
title_full Joshua 6: A phrase-based and hierarchical statistical machine translation system
title_fullStr Joshua 6: A phrase-based and hierarchical statistical machine translation system
title_full_unstemmed Joshua 6: A phrase-based and hierarchical statistical machine translation system
title_sort joshua 6: a phrase-based and hierarchical statistical machine translation system
publisher Sciendo
series Prague Bulletin of Mathematical Linguistics
issn 1804-0462
publishDate 2015-10-01
description We describe the version six release of Joshua, an open-source statistical machine translation toolkit. The main difference from release five is the introduction of a simple, unlexicalized, phrase-based stack decoder. This phrase-based decoder shares a hypergraph format with the syntax-based systems, permitting a tight coupling with the existing codebase of feature functions and hypergraph tools. Joshua 6 also includes a number of large-scale discriminative tuners and a simplified sparse feature function interface with reflection-based loading, which allows new features to be used by writing a single function. Finally, Joshua includes a number of simplifications and improvements focused on usability for both researchers and end-users, including the release of language packs — precompiled models that can be run as black boxes.
url https://doi.org/10.1515/pralin-2015-0009
work_keys_str_mv AT postmatt joshua6aphrasebasedandhierarchicalstatisticalmachinetranslationsystem
AT caoyuan joshua6aphrasebasedandhierarchicalstatisticalmachinetranslationsystem
AT kumargaurav joshua6aphrasebasedandhierarchicalstatisticalmachinetranslationsystem
_version_ 1717812843000102912