An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences

Sign language translation (SLT) is an important application to bridge the communication gap between deaf and hearing people. In recent years, the research on the SLT based on neural translation frameworks has attracted wide attention. Despite the progress, current SLT research is still in the initia...

Full description

Bibliographic Details
Main Authors: Jiangbin Zheng, Zheng Zhao, Min Chen, Jing Chen, Chong Wu, Yidong Chen, Xiaodong Shi, Yiqi Tong
Format: Article
Language:English
Published: Hindawi Limited 2020-01-01
Series:Computational Intelligence and Neuroscience
Online Access:http://dx.doi.org/10.1155/2020/8816125
id doaj-79c1524018b64eafbc371a039e6054b6
record_format Article
spelling doaj-79c1524018b64eafbc371a039e6054b62020-11-25T04:04:43ZengHindawi LimitedComputational Intelligence and Neuroscience1687-52651687-52732020-01-01202010.1155/2020/88161258816125An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign SentencesJiangbin Zheng0Zheng Zhao1Min Chen2Jing Chen3Chong Wu4Yidong Chen5Xiaodong Shi6Yiqi Tong7Department of Artificial Intelligence, School of Informatics, Xiamen University, Xiamen 361005, ChinaChina Mobile (Suzhou) Software Technology Co., LTD, Suzhou 215000, ChinaChina Mobile (Suzhou) Software Technology Co., LTD, Suzhou 215000, ChinaChina Mobile (Suzhou) Software Technology Co., LTD, Suzhou 215000, ChinaDepartment of Electrical Engineering, City University of Hong Kong, Kowloon, Hong KongDepartment of Artificial Intelligence, School of Informatics, Xiamen University, Xiamen 361005, ChinaDepartment of Artificial Intelligence, School of Informatics, Xiamen University, Xiamen 361005, ChinaDepartment of Artificial Intelligence, School of Informatics, Xiamen University, Xiamen 361005, ChinaSign language translation (SLT) is an important application to bridge the communication gap between deaf and hearing people. In recent years, the research on the SLT based on neural translation frameworks has attracted wide attention. Despite the progress, current SLT research is still in the initial stage. In fact, current systems perform poorly in processing long sign sentences, which often involve long-distance dependencies and require large resource consumption. To tackle this problem, we propose two explainable adaptations to the traditional neural SLT models using optimized tokenization-related modules. First, we introduce a frame stream density compression (FSDC) algorithm for detecting and reducing the redundant similar frames, which effectively shortens the long sign sentences without losing information. Then, we replace the traditional encoder in a neural machine translation (NMT) module with an improved architecture, which incorporates a temporal convolution (T-Conv) unit and a dynamic hierarchical bidirectional GRU (DH-BiGRU) unit sequentially. The improved component takes the temporal tokenization information into consideration to extract deeper information with reasonable resource consumption. Our experiments on the RWTH-PHOENIX-Weather 2014T dataset show that the proposed model outperforms the state-of-the-art baseline up to about 1.5+ BLEU-4 score gains.http://dx.doi.org/10.1155/2020/8816125
collection DOAJ
language English
format Article
sources DOAJ
author Jiangbin Zheng
Zheng Zhao
Min Chen
Jing Chen
Chong Wu
Yidong Chen
Xiaodong Shi
Yiqi Tong
spellingShingle Jiangbin Zheng
Zheng Zhao
Min Chen
Jing Chen
Chong Wu
Yidong Chen
Xiaodong Shi
Yiqi Tong
An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences
Computational Intelligence and Neuroscience
author_facet Jiangbin Zheng
Zheng Zhao
Min Chen
Jing Chen
Chong Wu
Yidong Chen
Xiaodong Shi
Yiqi Tong
author_sort Jiangbin Zheng
title An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences
title_short An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences
title_full An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences
title_fullStr An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences
title_full_unstemmed An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences
title_sort improved sign language translation model with explainable adaptations for processing long sign sentences
publisher Hindawi Limited
series Computational Intelligence and Neuroscience
issn 1687-5265
1687-5273
publishDate 2020-01-01
description Sign language translation (SLT) is an important application to bridge the communication gap between deaf and hearing people. In recent years, the research on the SLT based on neural translation frameworks has attracted wide attention. Despite the progress, current SLT research is still in the initial stage. In fact, current systems perform poorly in processing long sign sentences, which often involve long-distance dependencies and require large resource consumption. To tackle this problem, we propose two explainable adaptations to the traditional neural SLT models using optimized tokenization-related modules. First, we introduce a frame stream density compression (FSDC) algorithm for detecting and reducing the redundant similar frames, which effectively shortens the long sign sentences without losing information. Then, we replace the traditional encoder in a neural machine translation (NMT) module with an improved architecture, which incorporates a temporal convolution (T-Conv) unit and a dynamic hierarchical bidirectional GRU (DH-BiGRU) unit sequentially. The improved component takes the temporal tokenization information into consideration to extract deeper information with reasonable resource consumption. Our experiments on the RWTH-PHOENIX-Weather 2014T dataset show that the proposed model outperforms the state-of-the-art baseline up to about 1.5+ BLEU-4 score gains.
url http://dx.doi.org/10.1155/2020/8816125
work_keys_str_mv AT jiangbinzheng animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT zhengzhao animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT minchen animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT jingchen animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT chongwu animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT yidongchen animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT xiaodongshi animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT yiqitong animprovedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT jiangbinzheng improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT zhengzhao improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT minchen improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT jingchen improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT chongwu improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT yidongchen improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT xiaodongshi improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
AT yiqitong improvedsignlanguagetranslationmodelwithexplainableadaptationsforprocessinglongsignsentences
_version_ 1715053413785403392