Revamping Image-Recipe Cross-Modal Retrieval with Dual Cross Attention Encoders

The image-recipe cross-modal retrieval task, which retrieves the relevant recipes according to food images and vice versa, is now attracting widespread attention. There are two main challenges for image-recipe cross-modal retrieval task. Firstly, a recipe’s different components (words in a sentence,...

وصف كامل

التفاصيل البيبلوغرافية
الحاوية / القاعدة:	Mathematics
المؤلفون الرئيسيون:	Wenhao Liu, Simiao Yuan, Zhen Wang, Xinyi Chang, Limeng Gao, Zhenrui Zhang
التنسيق:	مقال
اللغة:	الإنجليزية
منشور في:	MDPI AG 2024-10-01
الموضوعات:	image-recipe cross-modal retrieval cross attention recipe encoder image encoder
الوصول للمادة أونلاين:	https://www.mdpi.com/2227-7390/12/20/3181

الانترنت

https://www.mdpi.com/2227-7390/12/20/3181

Revamping Image-Recipe Cross-Modal Retrieval with Dual Cross Attention Encoders

الانترنت

مواد مشابهة