On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS
碩士 === 國立清華大學 === 資訊工程學系 === 100 === This study implements an online Mandarin speech synthesis system with speaker adaptation and proposes a speech feature substitution approach to improve the quality of the synthesized speech. The system takes texts provided by users as input and performs POS and t...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2012
|
Online Access: | http://ndltd.ncl.edu.tw/handle/35836223923640099881 |
id |
ndltd-TW-100NTHU5392092 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-100NTHU53920922015-10-13T21:27:24Z http://ndltd.ncl.edu.tw/handle/35836223923640099881 On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS 基於特徵替換法對語者調適語音合成之改進 HSU, PEI-LIN 徐培霖 碩士 國立清華大學 資訊工程學系 100 This study implements an online Mandarin speech synthesis system with speaker adaptation and proposes a speech feature substitution approach to improve the quality of the synthesized speech. The system takes texts provided by users as input and performs POS and tone tagging. The synthesis can be done with the acoustic models of users’ choices. This system also provides a speaker adaptation function. First, the user is asked to record a few sentences through a web interface. A speech scoring technique is used to validate the quality of the recorded utterances. The system then uses these utterances to perform speaker adaptation to adjust the acoustic models for speech synthesis. Moreover, this study proposes a speech feature substitution method to improve the quality of speaker adaptation. This method adopts the spectral features extracted from real speech utterances instead of estimating them from acoustic models. The similarity between the synthesized speech and target speech is therefore increased. The experimental result shows that the proposed method is able to improve upon the original method with an 0.4 increase in MOS score. 張智星 2012 學位論文 ; thesis 52 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊工程學系 === 100 === This study implements an online Mandarin speech synthesis system with speaker adaptation and proposes a speech feature substitution approach to improve the quality of the synthesized speech. The system takes texts provided by users as input and performs POS and tone tagging. The synthesis can be done with the acoustic models of users’ choices.
This system also provides a speaker adaptation function. First, the user is asked to record a few sentences through a web interface. A speech scoring technique is used to validate the quality of the recorded utterances. The system then uses these utterances to perform speaker adaptation to adjust the acoustic models for speech synthesis.
Moreover, this study proposes a speech feature substitution method to improve the quality of speaker adaptation. This method adopts the spectral features extracted from real speech utterances instead of estimating them from acoustic models. The similarity between the synthesized speech and target speech is therefore increased. The experimental result shows that the proposed method is able to improve upon the original method with an 0.4 increase in MOS score.
|
author2 |
張智星 |
author_facet |
張智星 HSU, PEI-LIN 徐培霖 |
author |
HSU, PEI-LIN 徐培霖 |
spellingShingle |
HSU, PEI-LIN 徐培霖 On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS |
author_sort |
HSU, PEI-LIN |
title |
On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS |
title_short |
On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS |
title_full |
On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS |
title_fullStr |
On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS |
title_full_unstemmed |
On the Use of Speech Feature Substitution for Speaker Adaption within HMM-based TTS |
title_sort |
on the use of speech feature substitution for speaker adaption within hmm-based tts |
publishDate |
2012 |
url |
http://ndltd.ncl.edu.tw/handle/35836223923640099881 |
work_keys_str_mv |
AT hsupeilin ontheuseofspeechfeaturesubstitutionforspeakeradaptionwithinhmmbasedtts AT xúpéilín ontheuseofspeechfeaturesubstitutionforspeakeradaptionwithinhmmbasedtts AT hsupeilin jīyútèzhēngtìhuànfǎduìyǔzhědiàoshìyǔyīnhéchéngzhīgǎijìn AT xúpéilín jīyútèzhēngtìhuànfǎduìyǔzhědiàoshìyǔyīnhéchéngzhīgǎijìn |
_version_ |
1718062693566382080 |