Joshua 6: A phrase-based and hierarchical statistical machine translation system
We describe the version six release of Joshua, an open-source statistical machine translation toolkit. The main difference from release five is the introduction of a simple, unlexicalized, phrase-based stack decoder. This phrase-based decoder shares a hypergraph format with the syntax-based systems,...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Sciendo
2015-10-01
|
Series: | Prague Bulletin of Mathematical Linguistics |
Online Access: | https://doi.org/10.1515/pralin-2015-0009 |
id |
doaj-ae83b5284ddb44a9b3043b4713a60441 |
---|---|
record_format |
Article |
spelling |
doaj-ae83b5284ddb44a9b3043b4713a604412021-09-05T13:59:53ZengSciendoPrague Bulletin of Mathematical Linguistics 1804-04622015-10-01104151610.1515/pralin-2015-0009pralin-2015-0009Joshua 6: A phrase-based and hierarchical statistical machine translation systemPost Matt0Cao Yuan1Kumar Gaurav2 Human Language Technology Center of Excellence, Johns Hopkins University Center for Language and Speech Processing, Johns Hopkins University Center for Language and Speech Processing, Johns Hopkins UniversityWe describe the version six release of Joshua, an open-source statistical machine translation toolkit. The main difference from release five is the introduction of a simple, unlexicalized, phrase-based stack decoder. This phrase-based decoder shares a hypergraph format with the syntax-based systems, permitting a tight coupling with the existing codebase of feature functions and hypergraph tools. Joshua 6 also includes a number of large-scale discriminative tuners and a simplified sparse feature function interface with reflection-based loading, which allows new features to be used by writing a single function. Finally, Joshua includes a number of simplifications and improvements focused on usability for both researchers and end-users, including the release of language packs — precompiled models that can be run as black boxes.https://doi.org/10.1515/pralin-2015-0009 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Post Matt Cao Yuan Kumar Gaurav |
spellingShingle |
Post Matt Cao Yuan Kumar Gaurav Joshua 6: A phrase-based and hierarchical statistical machine translation system Prague Bulletin of Mathematical Linguistics |
author_facet |
Post Matt Cao Yuan Kumar Gaurav |
author_sort |
Post Matt |
title |
Joshua 6: A phrase-based and hierarchical statistical machine translation system |
title_short |
Joshua 6: A phrase-based and hierarchical statistical machine translation system |
title_full |
Joshua 6: A phrase-based and hierarchical statistical machine translation system |
title_fullStr |
Joshua 6: A phrase-based and hierarchical statistical machine translation system |
title_full_unstemmed |
Joshua 6: A phrase-based and hierarchical statistical machine translation system |
title_sort |
joshua 6: a phrase-based and hierarchical statistical machine translation system |
publisher |
Sciendo |
series |
Prague Bulletin of Mathematical Linguistics |
issn |
1804-0462 |
publishDate |
2015-10-01 |
description |
We describe the version six release of Joshua, an open-source statistical machine translation toolkit. The main difference from release five is the introduction of a simple, unlexicalized, phrase-based stack decoder. This phrase-based decoder shares a hypergraph format with the syntax-based systems, permitting a tight coupling with the existing codebase of feature functions and hypergraph tools. Joshua 6 also includes a number of large-scale discriminative tuners and a simplified sparse feature function interface with reflection-based loading, which allows new features to be used by writing a single function. Finally, Joshua includes a number of simplifications and improvements focused on usability for both researchers and end-users, including the release of language packs — precompiled models that can be run as black boxes. |
url |
https://doi.org/10.1515/pralin-2015-0009 |
work_keys_str_mv |
AT postmatt joshua6aphrasebasedandhierarchicalstatisticalmachinetranslationsystem AT caoyuan joshua6aphrasebasedandhierarchicalstatisticalmachinetranslationsystem AT kumargaurav joshua6aphrasebasedandhierarchicalstatisticalmachinetranslationsystem |
_version_ |
1717812843000102912 |