On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS

碩士 === 國立清華大學 === 資訊工程學系 === 100 === This study implements an online Mandarin speech synthesis system with speaker adaptation and proposes a speech feature substitution approach to improve the quality of the synthesized speech. The system takes texts provided by users as input and performs POS and t...

Full description

Bibliographic Details
Main Authors:	HSU, PEI-LIN, 徐培霖
Other Authors:	張智星
Format:	Others
Language:	zh-TW
Published:	2012
Online Access:	http://ndltd.ncl.edu.tw/handle/35836223923640099881

id	ndltd-TW-100NTHU5392092
record_format	oai_dc
spelling	ndltd-TW-100NTHU53920922015-10-13T21:27:24Z http://ndltd.ncl.edu.tw/handle/35836223923640099881 On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS 基於特徵替換法對語者調適語音合成之改進 HSU, PEI-LIN 徐培霖碩士國立清華大學資訊工程學系 100 This study implements an online Mandarin speech synthesis system with speaker adaptation and proposes a speech feature substitution approach to improve the quality of the synthesized speech. The system takes texts provided by users as input and performs POS and tone tagging. The synthesis can be done with the acoustic models of users’ choices. This system also provides a speaker adaptation function. First, the user is asked to record a few sentences through a web interface. A speech scoring technique is used to validate the quality of the recorded utterances. The system then uses these utterances to perform speaker adaptation to adjust the acoustic models for speech synthesis. Moreover, this study proposes a speech feature substitution method to improve the quality of speaker adaptation. This method adopts the spectral features extracted from real speech utterances instead of estimating them from acoustic models. The similarity between the synthesized speech and target speech is therefore increased. The experimental result shows that the proposed method is able to improve upon the original method with an 0.4 increase in MOS score. 張智星 2012 學位論文 ; thesis 52 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立清華大學 === 資訊工程學系 === 100 === This study implements an online Mandarin speech synthesis system with speaker adaptation and proposes a speech feature substitution approach to improve the quality of the synthesized speech. The system takes texts provided by users as input and performs POS and tone tagging. The synthesis can be done with the acoustic models of users’ choices. This system also provides a speaker adaptation function. First, the user is asked to record a few sentences through a web interface. A speech scoring technique is used to validate the quality of the recorded utterances. The system then uses these utterances to perform speaker adaptation to adjust the acoustic models for speech synthesis. Moreover, this study proposes a speech feature substitution method to improve the quality of speaker adaptation. This method adopts the spectral features extracted from real speech utterances instead of estimating them from acoustic models. The similarity between the synthesized speech and target speech is therefore increased. The experimental result shows that the proposed method is able to improve upon the original method with an 0.4 increase in MOS score.
author2	張智星
author_facet	張智星 HSU, PEI-LIN 徐培霖
author	HSU, PEI-LIN 徐培霖
spellingShingle	HSU, PEI-LIN 徐培霖 On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS
author_sort	HSU, PEI-LIN
title	On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS
title_short	On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS
title_full	On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS
title_fullStr	On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS
title_full_unstemmed	On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS
title_sort	on the use of speech feature substitution for speaker adaption within hmm-based tts
publishDate	2012
url	http://ndltd.ncl.edu.tw/handle/35836223923640099881
work_keys_str_mv	AT hsupeilin ontheuseofspeechfeaturesubstitutionforspeakeradaptionwithinhmmbasedtts AT xúpéilín ontheuseofspeechfeaturesubstitutionforspeakeradaptionwithinhmmbasedtts AT hsupeilin jīyútèzhēngtìhuànfǎduìyǔzhědiàoshìyǔyīnhéchéngzhīgǎijìn AT xúpéilín jīyútèzhēngtìhuànfǎduìyǔzhědiàoshìyǔyīnhéchéngzhīgǎijìn
_version_	1718062693566382080

On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS

Similar Items