Diff-KT: Text-Driven Image Editing by Knowledge Enhancement and Mask Transformer
Recent advancements in text-to-image generation have demonstrated significant progress, especially with diffusion-based models conditioned on textual prompts, which excel in image quality and diversity. However, these methods often encounter a semantic gap between image and text modalities and suffe...
| الحاوية / القاعدة: | IEEE Access |
|---|---|
| المؤلفون الرئيسيون: | , , , |
| التنسيق: | مقال |
| اللغة: | الإنجليزية |
| منشور في: |
IEEE
2024-01-01
|
| الموضوعات: | |
| الوصول للمادة أونلاين: | https://ieeexplore.ieee.org/document/10634108/ |
