CCLCap-AE-AVSS: Cycle consistency loss based capsule autoencoders for audio–visual speech synthesis

Audio–visual speech synthesis (AVSS) is a rapidly growing field in the paradigm of audio–visual learning, involving the conversion of one person’s speech into the audio–visual stream of another while preserving the speech content. AVSS comprises two primary components: voice conversion (VC), which a...

Full description

Bibliographic Details
Published in:Journal of Intelligent Systems
Main Authors: Ghosh Subhayu, Jana Nanda Dulal, Si Tapas, Mallik Saurav, Shah Mohd Asif
Format: Article
Language:English
Published: De Gruyter 2024-06-01
Subjects:
Online Access:https://doi.org/10.1515/jisys-2023-0171