CCLCap-AE-AVSS: Cycle consistency loss based capsule autoencoders for audio–visual speech synthesis

Audio–visual speech synthesis (AVSS) is a rapidly growing field in the paradigm of audio–visual learning, involving the conversion of one person’s speech into the audio–visual stream of another while preserving the speech content. AVSS comprises two primary components: voice conversion (VC), which a...

Full description

Bibliographic Details
Published in:	Journal of Intelligent Systems
Main Authors:	Ghosh Subhayu, Jana Nanda Dulal, Si Tapas, Mallik Saurav, Shah Mohd Asif
Format:	Article
Language:	English
Published:	De Gruyter 2024-06-01
Subjects:	voice conversion audio–visual synthesis autoencoder capsule network cycle consistency loss
Online Access:	https://doi.org/10.1515/jisys-2023-0171

Internet

https://doi.org/10.1515/jisys-2023-0171

CCLCap-AE-AVSS: Cycle consistency loss based capsule autoencoders for audio–visual speech synthesis

Internet

Similar Items