Multimodal Representation Learning for Visual Reasoning and Text-to-Image Translation
abstract: Multimodal Representation Learning is a multi-disciplinary research field which aims to integrate information from multiple communicative modalities in a meaningful manner to help solve some downstream task. These modalities can be visual, acoustic, linguistic, haptic etc. The interpretati...
Other Authors: | |
---|---|
Format: | Dissertation |
Language: | English |
Published: |
2018
|
Subjects: | |
Online Access: | http://hdl.handle.net/2286/R.I.51644 |