Dual-Modal Transformer with Enhanced Inter- and Intra-Modality Interactions for Image Captioning
Image captioning is oriented towards describing an image with the best possible use of words that can provide a semantic, relatable meaning of the scenario inscribed. Different models can be used to accomplish this arduous task depending on the context and requirement of what needs to be achieved. A...
| Published in: | Applied Sciences |
|---|---|
| Main Authors: | , , , |
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2022-07-01
|
| Subjects: | |
| Online Access: | https://www.mdpi.com/2076-3417/12/13/6733 |
