Dynamic Bayesian Networks for Audio-Visual Speech Recognition

<p/> <p>The use of visual features in audio-visual speech recognition (AVSR) is justified by both the speech generation mechanism, which is essentially bimodal in audio and visual representation, and by the need for features that are invariant to acoustic noise perturbation. As a result,...

詳細記述

書誌詳細
出版年:EURASIP Journal on Advances in Signal Processing
主要な著者: Liang Luhong, Pi Xiaobo, Liu Xiaoxing, Murphy Kevin, Nefian Ara V
フォーマット: 論文
言語:英語
出版事項: SpringerOpen 2002-01-01
主題:
オンライン・アクセス:http://dx.doi.org/10.1155/S1110865702206083