A Design of Korean Speech Recognition System

碩士 === 國立中山大學 === 電機工程學系研究所 === 98 === This thesis investigates the design and implementation strategies for a Korean speech recognition system. It utilizes the speech features of the common Korean mono-syllables as the major training and recognition methodology. A training database of 10 utterances...

Full description

Bibliographic Details
Main Authors: Bing-Yang Wu, 吳秉洋
Other Authors: Chih-Chien Chen
Format: Others
Language:zh-TW
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/09330824297180913969
id ndltd-TW-098NSYS5442094
record_format oai_dc
spelling ndltd-TW-098NSYS54420942015-10-13T18:39:46Z http://ndltd.ncl.edu.tw/handle/09330824297180913969 A Design of Korean Speech Recognition System 韓文語音辨識系統之設計研究 Bing-Yang Wu 吳秉洋 碩士 國立中山大學 電機工程學系研究所 98 This thesis investigates the design and implementation strategies for a Korean speech recognition system. It utilizes the speech features of the common Korean mono-syllables as the major training and recognition methodology. A training database of 10 utterances per mono-syllable is established by applying Korean pronunciation rules. These 10 utterances are collected through reading 5 rounds of the same mono-syllables twice with different tones. The first pronounced pattern has high pitch of tone 1,while the second one has falling pitch of tone 4.Mel-frequency cepstral coefficients, linear predictive cepstrum coefficients, and hidden Markov model are used as the two feature models and the recognition model respectively. Under the Pentium 2.4 GHz personal computer and Ubuntu 9.04 operating system environment, a correct phrase recognition rate of 92.25% can be reached for a 4865 Korean phrase database. The average computation time for each phrase is about 1.5 seconds. Chih-Chien Chen 陳志堅 2010 學位論文 ; thesis 63 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中山大學 === 電機工程學系研究所 === 98 === This thesis investigates the design and implementation strategies for a Korean speech recognition system. It utilizes the speech features of the common Korean mono-syllables as the major training and recognition methodology. A training database of 10 utterances per mono-syllable is established by applying Korean pronunciation rules. These 10 utterances are collected through reading 5 rounds of the same mono-syllables twice with different tones. The first pronounced pattern has high pitch of tone 1,while the second one has falling pitch of tone 4.Mel-frequency cepstral coefficients, linear predictive cepstrum coefficients, and hidden Markov model are used as the two feature models and the recognition model respectively. Under the Pentium 2.4 GHz personal computer and Ubuntu 9.04 operating system environment, a correct phrase recognition rate of 92.25% can be reached for a 4865 Korean phrase database. The average computation time for each phrase is about 1.5 seconds.
author2 Chih-Chien Chen
author_facet Chih-Chien Chen
Bing-Yang Wu
吳秉洋
author Bing-Yang Wu
吳秉洋
spellingShingle Bing-Yang Wu
吳秉洋
A Design of Korean Speech Recognition System
author_sort Bing-Yang Wu
title A Design of Korean Speech Recognition System
title_short A Design of Korean Speech Recognition System
title_full A Design of Korean Speech Recognition System
title_fullStr A Design of Korean Speech Recognition System
title_full_unstemmed A Design of Korean Speech Recognition System
title_sort design of korean speech recognition system
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/09330824297180913969
work_keys_str_mv AT bingyangwu adesignofkoreanspeechrecognitionsystem
AT wúbǐngyáng adesignofkoreanspeechrecognitionsystem
AT bingyangwu hánwényǔyīnbiànshíxìtǒngzhīshèjìyánjiū
AT wúbǐngyáng hánwényǔyīnbiànshíxìtǒngzhīshèjìyánjiū
AT bingyangwu designofkoreanspeechrecognitionsystem
AT wúbǐngyáng designofkoreanspeechrecognitionsystem
_version_ 1718036497536385024