Review of Visual Question Answering Technology

Visual question answering (VQA) is a popular cross-modal task that combines natural language pro-cessing and computer vision techniques. The main objective of this task is to enable computers to intelligently recognize and retrieve visual content and provide accurate answers. VQA involves the integr...

Full description

Bibliographic Details
Published in:Jisuanji kexue yu tansuo
Main Author: WANG Yu, SUN Haichun
Format: Article
Language:Chinese
Published: Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press 2023-07-01
Subjects:
Online Access:http://fcst.ceaj.org/fileup/1673-9418/PDF/2303025.pdf