Learning Aligned Cross-Modal Representations from Weakly Aligned Data

People can recognize scenes across many different modalities beyond natural images. In this paper, we investigate how to learn cross-modal scene representations that transfer across modalities. To study this problem, we introduce a new cross-modal scene dataset. While convolutional neural networks c...

Full description

Bibliographic Details
Main Authors: Castrejon, Lluis (Author), Pirsiavash, Hamed (Author), Aytar, Yusuf (Contributor), Vondrick, Carl Martin (Contributor), Torralba, Antonio (Contributor)
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers (IEEE), 2017-12-29T19:43:54Z.
Subjects:
Online Access:Get fulltext