Voice Keyword Retrieval Method Using Attention Mechanism and Multimodal Information Fusion

A cross-modal speech-text retrieval method using interactive learning convolution automatic encoder (CAE) is proposed. First, an interactive learning autoencoder structure is proposed, including two inputs of speech and text, as well as processing links such as encoding, hidden layer interaction, an...

Full description

Bibliographic Details
Main Author: Hongli Zhang
Format: Article
Language:English
Published: Hindawi Limited 2021-01-01
Series:Scientific Programming
Online Access:http://dx.doi.org/10.1155/2021/6662841