Lhasa-Tibetan Speech Synthesis Using End-to-End Model
With the development of deep learning technology, speech synthesis based on deep neural networks has gradually become the mainstream method in the field of speech synthesis. In this paper, we explored the Tacotron2 model for Lhasa-Tibetan dialect speech synthesis by constructing a feature prediction...
Main Authors: | Yue Zhao, Panhua Hu, Xiaona Xu, Licheng Wu, Xiali Li |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8827457/ |
Similar Items
-
End-to-End-Based Tibetan Multitask Speech Recognition
by: Yue Zhao, et al.
Published: (2019-01-01) -
WaveNet With Cross-Attention for Audiovisual Speech Recognition
by: Hui Wang, et al.
Published: (2020-01-01) -
Statistical analysis of acoustic characteristics of Tibetan Lhasa dialect speech emotion
by: Guo Dandan, et al.
Published: (2016-01-01) -
The Tonal and Intonational Phonology of Lhasa Tibetan
by: Lim, Keh Sheng
Published: (2018) -
End-to-End Amdo-Tibetan Speech Recognition Based on Knowledge Transfer
by: Xiaojun Zhu, et al.
Published: (2020-01-01)