An Implementation of Hakka Text-to-Speech System
碩士 === 國立交通大學 === 電信工程系所 === 95 === In this thesis, a Hakka Text-to-Speech (TTS) system is implemented. It consists of four main parts: Text Analyzer, RNN prosody generator, waveform inventory of synthesis units and PSOLA synthesizer. The input text is first tagged in the text analyzer into word seq...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2006
|
Online Access: | http://ndltd.ncl.edu.tw/handle/39336830137118858666 |
Summary: | 碩士 === 國立交通大學 === 電信工程系所 === 95 === In this thesis, a Hakka Text-to-Speech (TTS) system is implemented. It consists of four main parts: Text Analyzer, RNN prosody generator, waveform inventory of synthesis units and PSOLA synthesizer. The input text is first tagged in the text analyzer into word sequence. Then, the RNN prosody generator is used to generate the prosodic information by using linguistic feature extracted from the word sequence.The Waveform corresponding to the word sequence is then extracted from the waveform inventory and prosodically-adjusted to generate the output speech. The basic implementation of the system follows the Mandarin TTS system developed previously in NCTU.A demo system operating on the Windows platform by using a SDI(Single Document Interface)text editor with the synthesis kernel was last realized. Informal listening tests show that most synthesized speeches sound fair.
|
---|