An Improvement on Break Tag Prediction for Mandarin Speech
碩士 === 國立交通大學 === 電信工程研究所 === 100 === This thesis proposed an improvement method on break tag prediction for Mandarin speech synthesis. The linguistic features given from parser were utilized for the prediction of break tags due to the lack of prosodic-acoustic features in TTS. Generally, the lingui...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2012
|
Online Access: | http://ndltd.ncl.edu.tw/handle/08866294816508930949 |
id |
ndltd-TW-100NCTU5435104 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-100NCTU54351042016-03-28T04:20:39Z http://ndltd.ncl.edu.tw/handle/08866294816508930949 An Improvement on Break Tag Prediction for Mandarin Speech 中文語音停頓韻律標記預估之改進 Chen, Jui-Chuan 陳睿詮 碩士 國立交通大學 電信工程研究所 100 This thesis proposed an improvement method on break tag prediction for Mandarin speech synthesis. The linguistic features given from parser were utilized for the prediction of break tags due to the lack of prosodic-acoustic features in TTS. Generally, the linguistic features generated by parser belong to the word-level and sentence-level. However, the syntactic and semantic information still remain insufficient even the word-level and sentence-level features are given. In order to improve the break prediction for Mandarin speech, more linguistic features for describing the syntactic and semantic information are needed. This research classifies and labels the common and special word chunk as well as the phrase artificially, analyze the inter-syllable break appeared at special position in word chunk or phrase by using statistic distribution and decision tree, and investigate at last the mutual strength between these special position and the boundary of word chunk and phrase. The analyzed result showed that the inter-syllable break at special position in word chunk are mostly non-break, while the break of special position in word chunk or phrase are affected by the structure of word chunk or phrase. Furthermore, the smaller structure of word chunk and phrase posseses higher probability to follow the rule of mutual strength related between special position and the boundary of word chunk or phrase. The experiment results also showed that the adding of linguistic features of word chunk and phrase can in fact improve the prediction of break tags. Either using the linguistic features to predict inter-syllable break tags statically, or assisting the dynamic search for boundary of prosodic unit could the TTS achieve a more effective capability of break tags prediction. It virtually showed that the addition of word chunk and phrase information is capable of describing the syntactic structure more correctly, and then improve a more precise prediction of break tags. Chen, Sin-Horng 陳信宏 2012 學位論文 ; thesis 78 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 電信工程研究所 === 100 === This thesis proposed an improvement method on break tag prediction for Mandarin speech synthesis. The linguistic features given from parser were utilized for the prediction of break tags due to the lack of prosodic-acoustic features in TTS. Generally, the linguistic features generated by parser belong to the word-level and sentence-level. However, the syntactic and semantic information still remain insufficient even the word-level and sentence-level features are given. In order to improve the break prediction for Mandarin speech, more linguistic features for describing the syntactic and semantic information are needed. This research classifies and labels the common and special word chunk as well as the phrase artificially, analyze the inter-syllable break appeared at special position in word chunk or phrase by using statistic distribution and decision tree, and investigate at last the mutual strength between these special position and the boundary of word chunk and phrase.
The analyzed result showed that the inter-syllable break at special position in word chunk are mostly non-break, while the break of special position in word chunk or phrase are affected by the structure of word chunk or phrase. Furthermore, the smaller structure of word chunk and phrase posseses higher probability to follow the rule of mutual strength related between special position and the boundary of word chunk or phrase.
The experiment results also showed that the adding of linguistic features of word chunk and phrase can in fact improve the prediction of break tags. Either using the linguistic features to predict inter-syllable break tags statically, or assisting the dynamic search for boundary of prosodic unit could the TTS achieve a more effective capability of break tags prediction. It virtually showed that the addition of word chunk and phrase information is capable of describing the syntactic structure more correctly, and then improve a more precise prediction of break tags.
|
author2 |
Chen, Sin-Horng |
author_facet |
Chen, Sin-Horng Chen, Jui-Chuan 陳睿詮 |
author |
Chen, Jui-Chuan 陳睿詮 |
spellingShingle |
Chen, Jui-Chuan 陳睿詮 An Improvement on Break Tag Prediction for Mandarin Speech |
author_sort |
Chen, Jui-Chuan |
title |
An Improvement on Break Tag Prediction for Mandarin Speech |
title_short |
An Improvement on Break Tag Prediction for Mandarin Speech |
title_full |
An Improvement on Break Tag Prediction for Mandarin Speech |
title_fullStr |
An Improvement on Break Tag Prediction for Mandarin Speech |
title_full_unstemmed |
An Improvement on Break Tag Prediction for Mandarin Speech |
title_sort |
improvement on break tag prediction for mandarin speech |
publishDate |
2012 |
url |
http://ndltd.ncl.edu.tw/handle/08866294816508930949 |
work_keys_str_mv |
AT chenjuichuan animprovementonbreaktagpredictionformandarinspeech AT chénruìquán animprovementonbreaktagpredictionformandarinspeech AT chenjuichuan zhōngwényǔyīntíngdùnyùnlǜbiāojìyùgūzhīgǎijìn AT chénruìquán zhōngwényǔyīntíngdùnyùnlǜbiāojìyùgūzhīgǎijìn AT chenjuichuan improvementonbreaktagpredictionformandarinspeech AT chénruìquán improvementonbreaktagpredictionformandarinspeech |
_version_ |
1718213519196815360 |