Comparison of Khasi speech representations with different spectral features and hidden Markov states

In this paper, we present a comparison of the Khasi speech representations with four different spectral features and novel extension towards the development of Khasi speech corpora. These four features include linear predictive coding (LPC), linear prediction cepstrum coefficient (LPCC), perceptual...

全面介紹

書目詳細資料
發表在:Journal of Electronic Science and Technology
Main Authors: Bronson Syiem, Sushanta Kabir Dutta, Juwesh Binong, Lairenlakpam Joyprakash Singh
格式: Article
語言:英语
出版: KeAi Communications Co., Ltd. 2021-06-01
主題:
在線閱讀:http://www.sciencedirect.com/science/article/pii/S1674862X20300987
實物特徵
總結:In this paper, we present a comparison of the Khasi speech representations with four different spectral features and novel extension towards the development of Khasi speech corpora. These four features include linear predictive coding (LPC), linear prediction cepstrum coefficient (LPCC), perceptual linear prediction (PLP), and Mel frequency cepstral coefficient (MFCC). The 10-h speech data was used for training and 3-h data for testing. For each spectral feature, different hidden Markov model (HMM) based recognizers with variations in HMM states and different Gaussian mixture models (GMMs) were built. The performance was evaluated by using the word error rate (WER). The experimental results showed that MFCC provides a better representation for Khasi speech compared with the other three spectral features.
ISSN:2666-223X