Design of a Multi-Condition Emotional Speech Synthesizer

Recently, researchers have developed text-to-speech models based on deep learning, which have produced results superior to those of previous approaches. However, because those systems only mimic the generic speaking style of reference audio, it is difficult to assign user-defined emotional types to...

Full description

Bibliographic Details
Main Authors: Sung-Woo Byun, Seok-Pil Lee
Format: Article
Language:English
Published: MDPI AG 2021-01-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/3/1144