Learning Ordinal Relationships for Mid-Level Vision

We propose a framework that infers mid-level visual properties of an image by learning about ordinal relationships. Instead of estimating metric quantities directly, the system proposes pairwise relationship estimates for points in the input image. These sparse probabilistic ordinal measurements are...

Full description

Bibliographic Details
Main Authors:	Krishnan, Dilip (Author), Freeman, William T. (Author), Zoran, Daniel (Contributor), Isola, Phillip John (Contributor)
Other Authors:	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory (Contributor)
Format:	Article
Language:	English
Published:	Institute of Electrical and Electronics Engineers (IEEE), 2018-06-06T15:24:09Z.
Subjects:	Article
Online Access:	Get fulltext


LEADER	02079 am a22002173u 4500
001	116143
042			\|a dc
100	1	0	\|a Krishnan, Dilip \|e author
100	1	0	\|a Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory \|e contributor
100	1	0	\|a Zoran, Daniel \|e contributor
100	1	0	\|a Isola, Phillip John \|e contributor
700	1	0	\|a Freeman, William T. \|e author
700	1	0	\|a Zoran, Daniel \|e author
700	1	0	\|a Isola, Phillip John \|e author
245	0	0	\|a Learning Ordinal Relationships for Mid-Level Vision
260			\|b Institute of Electrical and Electronics Engineers (IEEE), \|c 2018-06-06T15:24:09Z.
856			\|z Get fulltext \|u http://hdl.handle.net/1721.1/116143
520			\|a We propose a framework that infers mid-level visual properties of an image by learning about ordinal relationships. Instead of estimating metric quantities directly, the system proposes pairwise relationship estimates for points in the input image. These sparse probabilistic ordinal measurements are globalized to create a dense output map of continuous metric measurements. Estimating order relationships between pairs of points has several advantages over metric estimation: it solves a simpler problem than metric regression, humans are better at relative judgements, so data collection is easier, ordinal relationships are invariant to monotonic transformations of the data, thereby increasing the robustness of the system and providing qualitatively different information. We demonstrate that this frame-work works well on two important mid-level vision tasks: intrinsic image decomposition and depth from an RGB image. We train two systems with the same architecture on data from these two modalities. We provide an analysis of the resulting models, showing that they learn a number of simple rules to make ordinal decisions. We apply our algorithm to depth estimation, with good results, and intrinsic image decomposition, with state-of-the-art results.
546			\|a en_US
655	7		\|a Article
773			\|t 2015 IEEE International Conference on Computer Vision (ICCV)

Learning Ordinal Relationships for Mid-Level Vision

Similar Items