Generating Synthetic Facial Expression Images Using EmoStyle

Synthetic data has emerged as a significant alternative to more costly and time-consuming data collection methods. This assertion is particularly salient in the context of training facial expression recognition (FER) and generation models. The EmoStyle model represents a state-of-the-art method for...

詳細記述

書誌詳細
出版年:	Applied Sciences
主要な著者:	Clément Gérard Daniel Darne, Changqin Quan, Zhiwei Luo
フォーマット:	論文
言語:	英語
出版事項:	MDPI AG 2025-10-01
主題:	facial expression generation synthetic data valence–arousal StyleGAN2 EmoStyle facial expression accuracy
オンライン･アクセス:	https://www.mdpi.com/2076-3417/15/19/10636

その他の書誌記述
要約:	Synthetic data has emerged as a significant alternative to more costly and time-consuming data collection methods. This assertion is particularly salient in the context of training facial expression recognition (FER) and generation models. The EmoStyle model represents a state-of-the-art method for editing images of facial expressions in the latent space of StyleGAN2, using a continuous valence–arousal (VA) representation of emotions. While the model has demonstrated promising results in terms of high-quality image generation and strong identity preservation, its accuracy in reproducing facial expressions across the VA space remains to be systematically examined. To address this gap, the present study proposes a systematic evaluation of EmoStyle’s ability to generate facial expressions across the full VA space, including four levels of emotional intensity. While prior work on expression manipulation has mainly focused its evaluations on perceptual quality, diversity, identity preservation, or classification accuracy, to the best of our knowledge, no study to date has systematically evaluated the accuracy of generated expressions across the VA space. The evaluation’s findings include a consistent weakness in the VA direction range of 242–329°, where EmoStyle demonstrates the inability to produce distinct expressions. Building on these findings, we outline recommendations for enhancing the generation pipeline and release an open-source EmoStyle-based toolkit that integrates fixes to the original EmoStyle repository, an API wrapper, and our experiment scripts. Collectively, these contributions furnish both novel insights into the model’s capacities and practical resources for further research.
ISSN:	2076-3417

Generating Synthetic Facial Expression Images Using EmoStyle

類似資料