Video Captioning Based on Channel Soft Attention and Semantic Reconstructor

Video Captioning Based on Channel Soft Attention and Semantic Reconstructor

Video captioning is a popular task which automatically generates a natural-language sentence to describe video content. Previous video captioning works mainly use the encoder–decoder framework and exploit special techniques such as attention mechanisms to improve the quality of generated sentences....

Full description

Bibliographic Details
Main Authors:	Zhou Lei, Yiyong Huang
Format:	Article
Language:	English
Published:	MDPI AG 2021-02-01
Series:	Future Internet
Subjects:	video captioning channel soft attention semantic reconstructor recurrent convolution networks
Online Access:	https://www.mdpi.com/1999-5903/13/2/55

Similar Items

Video captioning with stacked attention and semantic hard pull
by: Md. Mushfiqur Rahman, et al.
Published: (2021-08-01)

Fully Convolutional CaptionNet: Siamese Difference Captioning Attention Model
by: Ariyo Oluwasanmi, et al.
Published: (2019-01-01)

Variational Autoencoder-Based Multiple Image Captioning Using a Caption Attention Map
by: Boeun Kim, et al.
Published: (2019-07-01)

An ANN-Based Smart Tomographic Reconstructor in a Dynamic Environment
by: Francisco J. de Cos Juez, et al.
Published: (2012-06-01)

Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation
by: Ling Cheng, et al.
Published: (2020-01-01)

A Semantics-Assisted Video Captioning Model Trained With Scheduled Sampling
by: Haoran Chen, et al.
Published: (2020-09-01)

Video Captioning With Adaptive Attention and Mixed Loss Optimization
by: Huanhou Xiao, et al.
Published: (2019-01-01)

VSAM-Based Visual Keyword Generation for Image Caption
by: Suya Zhang, et al.
Published: (2021-01-01)

Exploring Multi-Level Attention and Semantic Relationship for Remote Sensing Image Captioning
by: Zhenghang Yuan, et al.
Published: (2020-01-01)

Multilayer Dense Attention Model for Image Caption
by: Ke Wang, et al.
Published: (2019-01-01)

Cascade Semantic Fusion for Image Captioning
by: Shiwei Wang, et al.
Published: (2019-01-01)

CaptionNet: Automatic End-to-End Siamese Difference Captioning Model With Attention
by: Ariyo Oluwasanmi, et al.
Published: (2019-01-01)

Panoptic Segmentation-Based Attention for Image Captioning
by: Wenjie Cai, et al.
Published: (2020-01-01)

VAA: Visual Aligning Attention Model for Remote Sensing Image Captioning
by: Zhengyuan Zhang, et al.
Published: (2019-01-01)

Multi-Gate Attention Network for Image Captioning
by: Weitao Jiang, et al.
Published: (2021-01-01)

A Fine-Grained Spatial-Temporal Attention Model for Video Captioning
by: An-An Liu, et al.
Published: (2018-01-01)

Attention-Guided Network for Semantic Video Segmentation
by: Jiangyun Li, et al.
Published: (2019-01-01)

Video Caption Based Searching Using End-to-End Dense Captioning and Sentence Embeddings
by: Akshay Aggarwal, et al.
Published: (2020-06-01)

Extracting Structured Supervision From Captions for Weakly Supervised Semantic Segmentation
by: Daniel R. Vilar, et al.
Published: (2021-01-01)

Context-Driven Image Caption With Global Semantic Relations of the Named Entities
by: Yun Jing, et al.
Published: (2020-01-01)

Hierarchical Attention-Based Fusion for Image Caption With Multi-Grained Rewards
by: Chunlei Wu, et al.
Published: (2020-01-01)

Social Image Captioning: Exploring Visual Attention and User Attention
by: Leiquan Wang, et al.
Published: (2018-02-01)

Hybrid Attention Distribution and Factorized Embedding Matrix in Image Captioning
by: Jian Wang, et al.
Published: (2020-01-01)

Cross-Lingual Image Caption Generation Based on Visual Attention Model
by: Bin Wang, et al.
Published: (2020-01-01)

Structure Preserving Convolutional Attention for Image Captioning
by: Shichen Lu, et al.
Published: (2019-07-01)

A Sparse Transformer-Based Approach for Image Captioning
by: Zhou Lei, et al.
Published: (2020-01-01)

A Video Captioning Method Based on Multi-Representation Switching for Sustainable Computing
by: Heechan Kim, et al.
Published: (2021-02-01)

Landslide Image Captioning Method Based on Semantic Gate and Bi-Temporal LSTM
by: Wenqi Cui, et al.
Published: (2020-03-01)

A Multi-Level Attention Model for Remote Sensing Image Captions
by: Yangyang Li, et al.
Published: (2020-03-01)

ATT-BM-SOM: A Framework of Effectively Choosing Image Information and Optimizing Syntax for Image Captioning
by: Zhenyu Yang, et al.
Published: (2020-01-01)

Automatic Image and Video Caption Generation With Deep Learning: A Concise Review and Algorithmic Overlap
by: Soheyla Amirian, et al.
Published: (2020-01-01)

Boosted Transformer for Image Captioning
by: Jiangyun Li, et al.
Published: (2019-08-01)

Understanding Objects in Video: Object-Oriented Video Captioning via Structured Trajectory and Adversarial Learning
by: Fangyi Zhu, et al.
Published: (2020-01-01)

LAM: Remote Sensing Image Captioning with Label-Attention Mechanism
by: Zhengyuan Zhang, et al.
Published: (2019-10-01)

An Attentive Fourier-Augmented Image-Captioning Transformer
by: Raymond Ian Osolo, et al.
Published: (2021-09-01)

VIDEO SCENE DETECTION USING CLOSED CAPTION TEXT
by: Smith, Gregory
Published: (2009)

The effects of captioning texts and caption ordering on L2 listening comprehension and vocabulary learning
by: Fatemeh Alikhani, et al.
Published: (2013-07-01)

Activity retrieval in closed captioned videos
by: Gupta, Sonal
Published: (2010)

Deep Learning pentru descrierea automată a imaginilor în limbaj natural - Image Captioning
by: Anca Mihaela HOTĂRAN, et al.
Published: (2020-03-01)

Retrieval Topic Recurrent Memory Network for Remote Sensing Image Captioning
by: Binqiang Wang, et al.
Published: (2020-01-01)