Reference Eigen-Environment and Speaker Weighting for Robust Speech Recognition

碩士 === 國立臺北科技大學 === 電腦與通訊研究所 === 96 === In this study a reference eigen-environment and speaker weighting (RESW) method is proposed for online HMM adaptation. RESW establishes multiple eigen-MLLR subspaces as the set of a priori knowledge according to certain affecting factors, such as noise type, S...

Full description

Bibliographic Details
Main Authors: Hung-Hsiang Fang, 方泓翔
Other Authors: 廖元甫
Format: Others
Language:zh-TW
Published: 2008
Online Access:http://ndltd.ncl.edu.tw/handle/qgqywg
id ndltd-TW-096TIT05652064
record_format oai_dc
spelling ndltd-TW-096TIT056520642019-07-26T03:38:47Z http://ndltd.ncl.edu.tw/handle/qgqywg Reference Eigen-Environment and Speaker Weighting for Robust Speech Recognition 基於雜訊環境與語者特徵參考模型內插之強健性語音辨認 Hung-Hsiang Fang 方泓翔 碩士 國立臺北科技大學 電腦與通訊研究所 96 In this study a reference eigen-environment and speaker weighting (RESW) method is proposed for online HMM adaptation. RESW establishes multiple eigen-MLLR subspaces as the set of a priori knowledge according to certain affecting factors, such as noise type, SNR, male and female. It then projects an input test utterance simultaneously into the set of eigen-subspaces and optimally synthesizes out a set of suitable HMMs. The proposed RESW was evaluated on Aurora 2 multi-condition training task. Experimental results showed that average word error rate (WER) of 6.12% was achieved. Moreover, RESW not only outperformed the multi-condition training baseline (Multi-Con., 13.72%) but also the blind ETSI advanced DSR front-end (ETSI-Adv., 8.65%) and the histogram equalization (HEQ, 8.66%) and the non-blind reference model weighting (RMW, 7.29%) and Eigen-MLLR (6.14%) approaches. 廖元甫 2008 學位論文 ; thesis 66 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺北科技大學 === 電腦與通訊研究所 === 96 === In this study a reference eigen-environment and speaker weighting (RESW) method is proposed for online HMM adaptation. RESW establishes multiple eigen-MLLR subspaces as the set of a priori knowledge according to certain affecting factors, such as noise type, SNR, male and female. It then projects an input test utterance simultaneously into the set of eigen-subspaces and optimally synthesizes out a set of suitable HMMs. The proposed RESW was evaluated on Aurora 2 multi-condition training task. Experimental results showed that average word error rate (WER) of 6.12% was achieved. Moreover, RESW not only outperformed the multi-condition training baseline (Multi-Con., 13.72%) but also the blind ETSI advanced DSR front-end (ETSI-Adv., 8.65%) and the histogram equalization (HEQ, 8.66%) and the non-blind reference model weighting (RMW, 7.29%) and Eigen-MLLR (6.14%) approaches.
author2 廖元甫
author_facet 廖元甫
Hung-Hsiang Fang
方泓翔
author Hung-Hsiang Fang
方泓翔
spellingShingle Hung-Hsiang Fang
方泓翔
Reference Eigen-Environment and Speaker Weighting for Robust Speech Recognition
author_sort Hung-Hsiang Fang
title Reference Eigen-Environment and Speaker Weighting for Robust Speech Recognition
title_short Reference Eigen-Environment and Speaker Weighting for Robust Speech Recognition
title_full Reference Eigen-Environment and Speaker Weighting for Robust Speech Recognition
title_fullStr Reference Eigen-Environment and Speaker Weighting for Robust Speech Recognition
title_full_unstemmed Reference Eigen-Environment and Speaker Weighting for Robust Speech Recognition
title_sort reference eigen-environment and speaker weighting for robust speech recognition
publishDate 2008
url http://ndltd.ncl.edu.tw/handle/qgqywg
work_keys_str_mv AT hunghsiangfang referenceeigenenvironmentandspeakerweightingforrobustspeechrecognition
AT fānghóngxiáng referenceeigenenvironmentandspeakerweightingforrobustspeechrecognition
AT hunghsiangfang jīyúzáxùnhuánjìngyǔyǔzhětèzhēngcānkǎomóxíngnèichāzhīqiángjiànxìngyǔyīnbiànrèn
AT fānghóngxiáng jīyúzáxùnhuánjìngyǔyǔzhětèzhēngcānkǎomóxíngnèichāzhīqiángjiànxìngyǔyīnbiànrèn
_version_ 1719230142679613440