Using K-Nearest Neighbor Method and the Optimal Mel-Frequency Cepstrum Coefficient Feature to Recognize Isolated Mandarin Word for Speaker-Dependent System

碩士 === 中興大學 === 統計學研究所 === 99 === This paper is mainly to discuss the speech recognition of 337 isolation mandarin words for speaker dependent. The feature is Mel-frequency cepstrum coefficient(Mfcc), and the method is k-nearest neighbor(knn), for the recognition, we try to find out the optimal par...

Full description

Bibliographic Details
Main Authors: Jhong-Da Wu, 吳忠達
Other Authors: Chung-Bow Lee
Format: Others
Language:zh-TW
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/47366663775116977709
Description
Summary:碩士 === 中興大學 === 統計學研究所 === 99 === This paper is mainly to discuss the speech recognition of 337 isolation mandarin words for speaker dependent. The feature is Mel-frequency cepstrum coefficient(Mfcc), and the method is k-nearest neighbor(knn), for the recognition, we try to find out the optimal parameters to obtain high performance recognition. Six experimental factors(the length of frame, the dimension of Mfcc, the number of frame, the weight of consonant and vowel, the swing of frame and the duration of consonant) we considered in the work. We find that the best average rate of recognition in database attains 91.5%.