End-to-end audiovisual speech recognition based on attention fusion of SDBN and BLSTM

An end-to-end audiovisual speech recognition algorithm was proposed.In algorithm,a sparse DBN was constructed by introducing mixed l<sub>1/2</sub>norm and l<sub>1</sub>norm into Deep Belief Network with bottleneck structure to extract the spars...

全面介紹

書目詳細資料
發表在:Dianxin kexue
Main Authors: Yiming WANG, Ken CHEN, Aihaiti ABUDUSALAMU
格式: Article
語言:中文
出版: Beijing Xintong Media Co., Ltd 2019-12-01
主題:
在線閱讀:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2019290/