Video Captioning Based on Channel Soft Attention and Semantic Reconstructor
Video captioning is a popular task which automatically generates a natural-language sentence to describe video content. Previous video captioning works mainly use the encoder–decoder framework and exploit special techniques such as attention mechanisms to improve the quality of generated sentences....
Main Authors: | Zhou Lei, Yiyong Huang |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-02-01
|
Series: | Future Internet |
Subjects: | |
Online Access: | https://www.mdpi.com/1999-5903/13/2/55 |
Similar Items
-
Video captioning with stacked attention and semantic hard pull
by: Md. Mushfiqur Rahman, et al.
Published: (2021-08-01) -
Fully Convolutional CaptionNet: Siamese Difference Captioning Attention Model
by: Ariyo Oluwasanmi, et al.
Published: (2019-01-01) -
Variational Autoencoder-Based Multiple Image Captioning Using a Caption Attention Map
by: Boeun Kim, et al.
Published: (2019-07-01) -
An ANN-Based Smart Tomographic Reconstructor in a Dynamic Environment
by: Francisco J. de Cos Juez, et al.
Published: (2012-06-01) -
Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation
by: Ling Cheng, et al.
Published: (2020-01-01)