Dual-Modal Transformer with Enhanced Inter- and Intra-Modality Interactions for Image Captioning

Image captioning is oriented towards describing an image with the best possible use of words that can provide a semantic, relatable meaning of the scenario inscribed. Different models can be used to accomplish this arduous task depending on the context and requirement of what needs to be achieved. A...

Full description

Bibliographic Details
Published in:Applied Sciences
Main Authors: Deepika Kumar, Varun Srivastava, Daniela Elena Popescu, Jude D. Hemanth
Format: Article
Language:English
Published: MDPI AG 2022-07-01
Subjects:
Online Access:https://www.mdpi.com/2076-3417/12/13/6733