On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS

碩士 === 國立清華大學 === 資訊工程學系 === 100 === This study implements an online Mandarin speech synthesis system with speaker adaptation and proposes a speech feature substitution approach to improve the quality of the synthesized speech. The system takes texts provided by users as input and performs POS and t...

Full description

Bibliographic Details
Main Authors: HSU, PEI-LIN, 徐培霖
Other Authors: 張智星
Format: Others
Language:zh-TW
Published: 2012
Online Access:http://ndltd.ncl.edu.tw/handle/35836223923640099881
id ndltd-TW-100NTHU5392092
record_format oai_dc
spelling ndltd-TW-100NTHU53920922015-10-13T21:27:24Z http://ndltd.ncl.edu.tw/handle/35836223923640099881 On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS 基於特徵替換法對語者調適語音合成之改進 HSU, PEI-LIN 徐培霖 碩士 國立清華大學 資訊工程學系 100 This study implements an online Mandarin speech synthesis system with speaker adaptation and proposes a speech feature substitution approach to improve the quality of the synthesized speech. The system takes texts provided by users as input and performs POS and tone tagging. The synthesis can be done with the acoustic models of users’ choices. This system also provides a speaker adaptation function. First, the user is asked to record a few sentences through a web interface. A speech scoring technique is used to validate the quality of the recorded utterances. The system then uses these utterances to perform speaker adaptation to adjust the acoustic models for speech synthesis. Moreover, this study proposes a speech feature substitution method to improve the quality of speaker adaptation. This method adopts the spectral features extracted from real speech utterances instead of estimating them from acoustic models. The similarity between the synthesized speech and target speech is therefore increased. The experimental result shows that the proposed method is able to improve upon the original method with an 0.4 increase in MOS score. 張智星 2012 學位論文 ; thesis 52 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊工程學系 === 100 === This study implements an online Mandarin speech synthesis system with speaker adaptation and proposes a speech feature substitution approach to improve the quality of the synthesized speech. The system takes texts provided by users as input and performs POS and tone tagging. The synthesis can be done with the acoustic models of users’ choices. This system also provides a speaker adaptation function. First, the user is asked to record a few sentences through a web interface. A speech scoring technique is used to validate the quality of the recorded utterances. The system then uses these utterances to perform speaker adaptation to adjust the acoustic models for speech synthesis. Moreover, this study proposes a speech feature substitution method to improve the quality of speaker adaptation. This method adopts the spectral features extracted from real speech utterances instead of estimating them from acoustic models. The similarity between the synthesized speech and target speech is therefore increased. The experimental result shows that the proposed method is able to improve upon the original method with an 0.4 increase in MOS score.
author2 張智星
author_facet 張智星
HSU, PEI-LIN
徐培霖
author HSU, PEI-LIN
徐培霖
spellingShingle HSU, PEI-LIN
徐培霖
On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS
author_sort HSU, PEI-LIN
title On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS
title_short On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS
title_full On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS
title_fullStr On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS
title_full_unstemmed On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS
title_sort on the use of speech feature substitution for speaker adaption within hmm-based tts
publishDate 2012
url http://ndltd.ncl.edu.tw/handle/35836223923640099881
work_keys_str_mv AT hsupeilin ontheuseofspeechfeaturesubstitutionforspeakeradaptionwithinhmmbasedtts
AT xúpéilín ontheuseofspeechfeaturesubstitutionforspeakeradaptionwithinhmmbasedtts
AT hsupeilin jīyútèzhēngtìhuànfǎduìyǔzhědiàoshìyǔyīnhéchéngzhīgǎijìn
AT xúpéilín jīyútèzhēngtìhuànfǎduìyǔzhědiàoshìyǔyīnhéchéngzhīgǎijìn
_version_ 1718062693566382080