The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition
碩士 === 國立暨南國際大學 === 電機工程學系 === 93 === In recent decades, more and more researchers are dedicated to developing the techniques of automatic speech recognition. In order to apply the speech recognizer in a real environment, the mismatch between training and application conditions must be handled caref...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2005
|
Online Access: | http://ndltd.ncl.edu.tw/handle/12092403064882874846 |
id |
ndltd-TW-093NCNU0442087 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-093NCNU04420872016-06-10T04:16:32Z http://ndltd.ncl.edu.tw/handle/12092403064882874846 The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition 運用頻譜次方調整和自相關法整合能量強化技術之強健性語音辨識 Tsung-Cheng Lin 林宗成 碩士 國立暨南國際大學 電機工程學系 93 In recent decades, more and more researchers are dedicated to developing the techniques of automatic speech recognition. In order to apply the speech recognizer in a real environment, the mismatch between training and application conditions must be handled carefully. For example, various kinds of additive noise in the application environment often degrades the performance of the speech recognizer seriously. In this thesis, we focus on the problem of additive noise and use several approaches to alleviate its influence on speech recognition. These used approaches are categorized into two classes, speech enhancement and the robust speech representation. The objective of speech enhancement approaches is to modify the testing speech features obtained in the noisy environment and make them better match the clean conditions for the pre-trained models. In chapter 3, four approaches of this kind are studied. They are Energy Contour Enhancement (ECE), Spectral Exponent Adjustment (SEA), Spectral Subtraction (SS) and Wiener Filtering (WF). One the other hand, the robust speech representation can reduce the sensitivity of the speech features to a mismatched acoustic condition, and thus decrease the mismatch between the clean training and the noisy testing environments. We study four approaches in deriving robust speech representations in Chapter 4. they are Autocorrelation Mel-Frequency Cepstral Coefficients(AMFCC), Phase Autocorrelation(PAC), Cepstral Mean Subtraction(CMS) and RelAtive SpecTrAl(RASTA). In Chapter 6 and 7, the recognition experiments and discussions are presented. The experimental results that these approaches can effectively improve the performance of the speech recognition under the noisy environment. Furthermore, in order to test whether these approaches are additive, we combine ECE or SEA with all the others to process the speech signals. Experimental results show that such a combination of two approaches can provide better recognition accuracy than the individual one in most cases. As a result, we can conclude that these robustness approaches reduce the influence of additive noise in different directions and are thus additive. Jeih-Weih Hung 洪志偉 2005 學位論文 ; thesis 74 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立暨南國際大學 === 電機工程學系 === 93 === In recent decades, more and more researchers are dedicated to developing the techniques of automatic speech recognition. In order to apply the speech recognizer in a real environment, the mismatch between training and application conditions must be handled carefully. For example, various kinds of additive noise in the application environment often degrades the performance of the speech recognizer seriously. In this thesis, we focus on the problem of additive noise and use several approaches to alleviate its influence on speech recognition. These used approaches are categorized into two classes, speech enhancement and the robust speech representation.
The objective of speech enhancement approaches is to modify the testing speech features obtained in the noisy environment and make them better match the clean conditions for the pre-trained models. In chapter 3, four approaches of this kind are studied. They are Energy Contour Enhancement (ECE), Spectral Exponent Adjustment (SEA), Spectral Subtraction (SS) and Wiener Filtering (WF).
One the other hand, the robust speech representation can reduce the sensitivity of the speech features to a mismatched acoustic condition, and thus decrease the mismatch between the clean training and the noisy testing environments. We study four approaches in deriving robust speech representations in Chapter 4. they are Autocorrelation Mel-Frequency Cepstral Coefficients(AMFCC), Phase Autocorrelation(PAC), Cepstral Mean Subtraction(CMS) and RelAtive SpecTrAl(RASTA).
In Chapter 6 and 7, the recognition experiments and discussions are presented. The experimental results that these approaches can effectively improve the performance of the speech recognition under the noisy environment. Furthermore, in order to test whether these approaches are additive, we combine ECE or SEA with all the others to process the speech signals. Experimental results show that such a combination of two approaches can provide better recognition accuracy than the individual one in most cases. As a result, we can conclude that these robustness approaches reduce the influence of additive noise in different directions and are thus additive.
|
author2 |
Jeih-Weih Hung |
author_facet |
Jeih-Weih Hung Tsung-Cheng Lin 林宗成 |
author |
Tsung-Cheng Lin 林宗成 |
spellingShingle |
Tsung-Cheng Lin 林宗成 The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition |
author_sort |
Tsung-Cheng Lin |
title |
The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition |
title_short |
The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition |
title_full |
The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition |
title_fullStr |
The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition |
title_full_unstemmed |
The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition |
title_sort |
techniques of energy contour enhancement, spectral exponent adjustment and signal autocorrelation for robust speech recognition |
publishDate |
2005 |
url |
http://ndltd.ncl.edu.tw/handle/12092403064882874846 |
work_keys_str_mv |
AT tsungchenglin thetechniquesofenergycontourenhancementspectralexponentadjustmentandsignalautocorrelationforrobustspeechrecognition AT línzōngchéng thetechniquesofenergycontourenhancementspectralexponentadjustmentandsignalautocorrelationforrobustspeechrecognition AT tsungchenglin yùnyòngpínpǔcìfāngdiàozhěnghézìxiāngguānfǎzhěnghénéngliàngqiánghuàjìshùzhīqiángjiànxìngyǔyīnbiànshí AT línzōngchéng yùnyòngpínpǔcìfāngdiàozhěnghézìxiāngguānfǎzhěnghénéngliàngqiánghuàjìshùzhīqiángjiànxìngyǔyīnbiànshí AT tsungchenglin techniquesofenergycontourenhancementspectralexponentadjustmentandsignalautocorrelationforrobustspeechrecognition AT línzōngchéng techniquesofenergycontourenhancementspectralexponentadjustmentandsignalautocorrelationforrobustspeechrecognition |
_version_ |
1718301291122262016 |