Multi-Angle Lipreading with Angle Classification-Based Feature Extraction and Its Application to Audio-Visual Speech Recognition

Recently, automatic speech recognition (ASR) and visual speech recognition (VSR) have been widely researched owing to the development in deep learning. Most VSR research works focus only on frontal face images. However, assuming real scenes, it is obvious that a VSR system should correctly recognize...

Full description

Bibliographic Details
Main Authors: Shinnosuke Isobe, Satoshi Tamura, Satoru Hayamizu, Yuuto Gotoh, Masaki Nose
Format: Article
Language:English
Published: MDPI AG 2021-07-01
Series:Future Internet
Subjects:
Online Access:https://www.mdpi.com/1999-5903/13/7/182