Diff-KT: Text-Driven Image Editing by Knowledge Enhancement and Mask Transformer

Recent advancements in text-to-image generation have demonstrated significant progress, especially with diffusion-based models conditioned on textual prompts, which excel in image quality and diversity. However, these methods often encounter a semantic gap between image and text modalities and suffe...

وصف كامل

التفاصيل البيبلوغرافية
الحاوية / القاعدة:IEEE Access
المؤلفون الرئيسيون: Hong Zhao, Wengai Li, Zhaobin Chang, Ce Yang
التنسيق: مقال
اللغة:الإنجليزية
منشور في: IEEE 2024-01-01
الموضوعات:
الوصول للمادة أونلاين:https://ieeexplore.ieee.org/document/10634108/