Study of Modulation Spectrum Normalization Techniques for Robust Speech Recognition

碩士 === 國立暨南國際大學 === 電機工程學系 === 96 === The performance of an automatic speech recognition system is often degraded due to the embedded noise in the processed speech signal. A variety of techniques have been proposed to deal with this problem, and one category of these techniques aims to normalize the...

Full description

Bibliographic Details
Main Authors: Chieh-cheng Wang, 王致程
Other Authors: Jeih-weih Hung
Format: Others
Language:zh-TW
Published: 2008
Online Access:http://ndltd.ncl.edu.tw/handle/19625906694881317221
id ndltd-TW-096NCNU0442048
record_format oai_dc
spelling ndltd-TW-096NCNU04420482016-05-16T04:10:38Z http://ndltd.ncl.edu.tw/handle/19625906694881317221 Study of Modulation Spectrum Normalization Techniques for Robust Speech Recognition 調變頻譜正規化法使用於強健語音辨識之研究 Chieh-cheng Wang 王致程 碩士 國立暨南國際大學 電機工程學系 96 The performance of an automatic speech recognition system is often degraded due to the embedded noise in the processed speech signal. A variety of techniques have been proposed to deal with this problem, and one category of these techniques aims to normalize the temporal statistics of the speech features, which is the main direction of our proposed new approaches here. In this thesis, we propose a series of noise robustness approaches, all of which attempt to normalize the modulation spectrum of speech features. They include equi-ripple temporal filtering (ERTF), least-squares spectrum fitting (LSSF) and magnitude spectrum interpolation (MSI). With these approaches, the mismatch between the modulation spectra for clean and noise-corrupted speech features is reduced, and thus the resulting new features are expected to be more noise-robust. Recognition experiments implemented on Aurora-2 digit database show that the three new approaches effectively improve the recognition accuracy under a wide range of noise-corrupted environment. Moreover, it is also shown that they can be successfully combined with some other noise robustness approaches, like CMVN and MVA, to achieve a more excellent recognition performance. Jeih-weih Hung 洪志偉 2008 學位論文 ; thesis 79 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立暨南國際大學 === 電機工程學系 === 96 === The performance of an automatic speech recognition system is often degraded due to the embedded noise in the processed speech signal. A variety of techniques have been proposed to deal with this problem, and one category of these techniques aims to normalize the temporal statistics of the speech features, which is the main direction of our proposed new approaches here. In this thesis, we propose a series of noise robustness approaches, all of which attempt to normalize the modulation spectrum of speech features. They include equi-ripple temporal filtering (ERTF), least-squares spectrum fitting (LSSF) and magnitude spectrum interpolation (MSI). With these approaches, the mismatch between the modulation spectra for clean and noise-corrupted speech features is reduced, and thus the resulting new features are expected to be more noise-robust. Recognition experiments implemented on Aurora-2 digit database show that the three new approaches effectively improve the recognition accuracy under a wide range of noise-corrupted environment. Moreover, it is also shown that they can be successfully combined with some other noise robustness approaches, like CMVN and MVA, to achieve a more excellent recognition performance.
author2 Jeih-weih Hung
author_facet Jeih-weih Hung
Chieh-cheng Wang
王致程
author Chieh-cheng Wang
王致程
spellingShingle Chieh-cheng Wang
王致程
Study of Modulation Spectrum Normalization Techniques for Robust Speech Recognition
author_sort Chieh-cheng Wang
title Study of Modulation Spectrum Normalization Techniques for Robust Speech Recognition
title_short Study of Modulation Spectrum Normalization Techniques for Robust Speech Recognition
title_full Study of Modulation Spectrum Normalization Techniques for Robust Speech Recognition
title_fullStr Study of Modulation Spectrum Normalization Techniques for Robust Speech Recognition
title_full_unstemmed Study of Modulation Spectrum Normalization Techniques for Robust Speech Recognition
title_sort study of modulation spectrum normalization techniques for robust speech recognition
publishDate 2008
url http://ndltd.ncl.edu.tw/handle/19625906694881317221
work_keys_str_mv AT chiehchengwang studyofmodulationspectrumnormalizationtechniquesforrobustspeechrecognition
AT wángzhìchéng studyofmodulationspectrumnormalizationtechniquesforrobustspeechrecognition
AT chiehchengwang diàobiànpínpǔzhèngguīhuàfǎshǐyòngyúqiángjiànyǔyīnbiànshízhīyánjiū
AT wángzhìchéng diàobiànpínpǔzhèngguīhuàfǎshǐyòngyúqiángjiànyǔyīnbiànshízhīyánjiū
_version_ 1718269609220505600