Acoustic Data-Driven Subword Units Obtained through Segment Embedding and Clustering for Spontaneous Speech Recognition

We propose a method to extend a phoneme set by using a large amount of broadcast data to improve the performance of Korean spontaneous speech recognition. In the proposed method, we first extract variable-length phoneme-level segments from broadcast data and then convert them into fixed-length embed...

Full description

Bibliographic Details
Main Authors:	Jeong-Uk Bang, Sang-Hun Kim, Oh-Wook Kwon
Format:	Article
Language:	English
Published:	MDPI AG 2020-03-01
Series:	Applied Sciences
Subjects:	acoustic subword unit phoneme set spontaneous speech recognition
Online Access:	https://www.mdpi.com/2076-3417/10/6/2079

Internet

https://www.mdpi.com/2076-3417/10/6/2079

Acoustic Data-Driven Subword Units Obtained through Segment Embedding and Clustering for Spontaneous Speech Recognition

Internet

Similar Items