A Design of Korean Speech Recognition System
碩士 === 國立中山大學 === 電機工程學系研究所 === 98 === This thesis investigates the design and implementation strategies for a Korean speech recognition system. It utilizes the speech features of the common Korean mono-syllables as the major training and recognition methodology. A training database of 10 utterances...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/09330824297180913969 |
id |
ndltd-TW-098NSYS5442094 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098NSYS54420942015-10-13T18:39:46Z http://ndltd.ncl.edu.tw/handle/09330824297180913969 A Design of Korean Speech Recognition System 韓文語音辨識系統之設計研究 Bing-Yang Wu 吳秉洋 碩士 國立中山大學 電機工程學系研究所 98 This thesis investigates the design and implementation strategies for a Korean speech recognition system. It utilizes the speech features of the common Korean mono-syllables as the major training and recognition methodology. A training database of 10 utterances per mono-syllable is established by applying Korean pronunciation rules. These 10 utterances are collected through reading 5 rounds of the same mono-syllables twice with different tones. The first pronounced pattern has high pitch of tone 1,while the second one has falling pitch of tone 4.Mel-frequency cepstral coefficients, linear predictive cepstrum coefficients, and hidden Markov model are used as the two feature models and the recognition model respectively. Under the Pentium 2.4 GHz personal computer and Ubuntu 9.04 operating system environment, a correct phrase recognition rate of 92.25% can be reached for a 4865 Korean phrase database. The average computation time for each phrase is about 1.5 seconds. Chih-Chien Chen 陳志堅 2010 學位論文 ; thesis 63 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中山大學 === 電機工程學系研究所 === 98 === This thesis investigates the design and implementation strategies for a Korean speech recognition system. It utilizes the speech features of the common Korean mono-syllables as the major training and recognition methodology. A training database of 10 utterances per mono-syllable is established by applying Korean pronunciation rules. These 10 utterances are collected through reading 5 rounds of the same mono-syllables twice with different tones. The first pronounced pattern has high pitch of tone 1,while the second one has falling pitch of tone 4.Mel-frequency cepstral coefficients, linear predictive cepstrum coefficients, and hidden Markov model are used as the two feature models and the recognition model respectively. Under the Pentium 2.4 GHz personal computer and Ubuntu 9.04 operating system environment, a correct phrase recognition rate of 92.25% can be reached for a 4865 Korean phrase database. The average computation time for each phrase is about 1.5 seconds.
|
author2 |
Chih-Chien Chen |
author_facet |
Chih-Chien Chen Bing-Yang Wu 吳秉洋 |
author |
Bing-Yang Wu 吳秉洋 |
spellingShingle |
Bing-Yang Wu 吳秉洋 A Design of Korean Speech Recognition System |
author_sort |
Bing-Yang Wu |
title |
A Design of Korean Speech Recognition System |
title_short |
A Design of Korean Speech Recognition System |
title_full |
A Design of Korean Speech Recognition System |
title_fullStr |
A Design of Korean Speech Recognition System |
title_full_unstemmed |
A Design of Korean Speech Recognition System |
title_sort |
design of korean speech recognition system |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/09330824297180913969 |
work_keys_str_mv |
AT bingyangwu adesignofkoreanspeechrecognitionsystem AT wúbǐngyáng adesignofkoreanspeechrecognitionsystem AT bingyangwu hánwényǔyīnbiànshíxìtǒngzhīshèjìyánjiū AT wúbǐngyáng hánwényǔyīnbiànshíxìtǒngzhīshèjìyánjiū AT bingyangwu designofkoreanspeechrecognitionsystem AT wúbǐngyáng designofkoreanspeechrecognitionsystem |
_version_ |
1718036497536385024 |