A 2D Convolutional Gating Mechanism for Mandarin Streaming Speech Recognition

Recent research shows recurrent neural network-Transducer (RNN-T) architecture has become a mainstream approach for streaming speech recognition. In this work, we investigate the VGG2 network as the input layer to the RNN-T in streaming speech recognition. Specifically, before the input feature is p...

Full description

Bibliographic Details
Main Authors: Xintong Wang, Chuangang Zhao
Format: Article
Language:English
Published: MDPI AG 2021-04-01
Series:Information
Subjects:
VGG
Online Access:https://www.mdpi.com/2078-2489/12/4/165