The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition

碩士 === 國立暨南國際大學 === 電機工程學系 === 93 === In recent decades, more and more researchers are dedicated to developing the techniques of automatic speech recognition. In order to apply the speech recognizer in a real environment, the mismatch between training and application conditions must be handled caref...

Full description

Bibliographic Details
Main Authors: Tsung-Cheng Lin, 林宗成
Other Authors: Jeih-Weih Hung
Format: Others
Language:zh-TW
Published: 2005
Online Access:http://ndltd.ncl.edu.tw/handle/12092403064882874846
id ndltd-TW-093NCNU0442087
record_format oai_dc
spelling ndltd-TW-093NCNU04420872016-06-10T04:16:32Z http://ndltd.ncl.edu.tw/handle/12092403064882874846 The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition 運用頻譜次方調整和自相關法整合能量強化技術之強健性語音辨識 Tsung-Cheng Lin 林宗成 碩士 國立暨南國際大學 電機工程學系 93 In recent decades, more and more researchers are dedicated to developing the techniques of automatic speech recognition. In order to apply the speech recognizer in a real environment, the mismatch between training and application conditions must be handled carefully. For example, various kinds of additive noise in the application environment often degrades the performance of the speech recognizer seriously. In this thesis, we focus on the problem of additive noise and use several approaches to alleviate its influence on speech recognition. These used approaches are categorized into two classes, speech enhancement and the robust speech representation. The objective of speech enhancement approaches is to modify the testing speech features obtained in the noisy environment and make them better match the clean conditions for the pre-trained models. In chapter 3, four approaches of this kind are studied. They are Energy Contour Enhancement (ECE), Spectral Exponent Adjustment (SEA), Spectral Subtraction (SS) and Wiener Filtering (WF). One the other hand, the robust speech representation can reduce the sensitivity of the speech features to a mismatched acoustic condition, and thus decrease the mismatch between the clean training and the noisy testing environments. We study four approaches in deriving robust speech representations in Chapter 4. they are Autocorrelation Mel-Frequency Cepstral Coefficients(AMFCC), Phase Autocorrelation(PAC), Cepstral Mean Subtraction(CMS) and RelAtive SpecTrAl(RASTA). In Chapter 6 and 7, the recognition experiments and discussions are presented. The experimental results that these approaches can effectively improve the performance of the speech recognition under the noisy environment. Furthermore, in order to test whether these approaches are additive, we combine ECE or SEA with all the others to process the speech signals. Experimental results show that such a combination of two approaches can provide better recognition accuracy than the individual one in most cases. As a result, we can conclude that these robustness approaches reduce the influence of additive noise in different directions and are thus additive. Jeih-Weih Hung 洪志偉 2005 學位論文 ; thesis 74 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立暨南國際大學 === 電機工程學系 === 93 === In recent decades, more and more researchers are dedicated to developing the techniques of automatic speech recognition. In order to apply the speech recognizer in a real environment, the mismatch between training and application conditions must be handled carefully. For example, various kinds of additive noise in the application environment often degrades the performance of the speech recognizer seriously. In this thesis, we focus on the problem of additive noise and use several approaches to alleviate its influence on speech recognition. These used approaches are categorized into two classes, speech enhancement and the robust speech representation. The objective of speech enhancement approaches is to modify the testing speech features obtained in the noisy environment and make them better match the clean conditions for the pre-trained models. In chapter 3, four approaches of this kind are studied. They are Energy Contour Enhancement (ECE), Spectral Exponent Adjustment (SEA), Spectral Subtraction (SS) and Wiener Filtering (WF). One the other hand, the robust speech representation can reduce the sensitivity of the speech features to a mismatched acoustic condition, and thus decrease the mismatch between the clean training and the noisy testing environments. We study four approaches in deriving robust speech representations in Chapter 4. they are Autocorrelation Mel-Frequency Cepstral Coefficients(AMFCC), Phase Autocorrelation(PAC), Cepstral Mean Subtraction(CMS) and RelAtive SpecTrAl(RASTA). In Chapter 6 and 7, the recognition experiments and discussions are presented. The experimental results that these approaches can effectively improve the performance of the speech recognition under the noisy environment. Furthermore, in order to test whether these approaches are additive, we combine ECE or SEA with all the others to process the speech signals. Experimental results show that such a combination of two approaches can provide better recognition accuracy than the individual one in most cases. As a result, we can conclude that these robustness approaches reduce the influence of additive noise in different directions and are thus additive.
author2 Jeih-Weih Hung
author_facet Jeih-Weih Hung
Tsung-Cheng Lin
林宗成
author Tsung-Cheng Lin
林宗成
spellingShingle Tsung-Cheng Lin
林宗成
The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition
author_sort Tsung-Cheng Lin
title The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition
title_short The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition
title_full The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition
title_fullStr The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition
title_full_unstemmed The Techniques of Energy Contour Enhancement, Spectral Exponent Adjustment and Signal Autocorrelation for Robust Speech Recognition
title_sort techniques of energy contour enhancement, spectral exponent adjustment and signal autocorrelation for robust speech recognition
publishDate 2005
url http://ndltd.ncl.edu.tw/handle/12092403064882874846
work_keys_str_mv AT tsungchenglin thetechniquesofenergycontourenhancementspectralexponentadjustmentandsignalautocorrelationforrobustspeechrecognition
AT línzōngchéng thetechniquesofenergycontourenhancementspectralexponentadjustmentandsignalautocorrelationforrobustspeechrecognition
AT tsungchenglin yùnyòngpínpǔcìfāngdiàozhěnghézìxiāngguānfǎzhěnghénéngliàngqiánghuàjìshùzhīqiángjiànxìngyǔyīnbiànshí
AT línzōngchéng yùnyòngpínpǔcìfāngdiàozhěnghézìxiāngguānfǎzhěnghénéngliàngqiánghuàjìshùzhīqiángjiànxìngyǔyīnbiànshí
AT tsungchenglin techniquesofenergycontourenhancementspectralexponentadjustmentandsignalautocorrelationforrobustspeechrecognition
AT línzōngchéng techniquesofenergycontourenhancementspectralexponentadjustmentandsignalautocorrelationforrobustspeechrecognition
_version_ 1718301291122262016