Toward Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models

被引:30
作者
Li, Congcong [1 ]
Kowdle, Adarsh [1 ]
Saxena, Ashutosh [2 ]
Chen, Tsuhan [1 ]
机构
[1] Cornell Univ, Dept Elect & Comp Engn, Ithaca, NY 14853 USA
[2] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
关键词
Scene understanding; classification; machine learning; robotics; ALGORITHM;
D O I
10.1109/TPAMI.2011.232
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene understanding includes many related subtasks, such as scene categorization, depth estimation, object detection, etc. Each of these subtasks is often notoriously hard, and state-of-the-art classifiers already exist for many of them. These classifiers operate on the same raw image and provide correlated outputs. It is desirable to have an algorithm that can capture such correlation without requiring any changes to the inner workings of any classifier. We propose Feedback Enabled Cascaded Classification Models (FE-CCM), that jointly optimizes all the subtasks while requiring only a "black box" interface to the original classifier for each subtask. We use a two-layer cascade of classifiers, which are repeated instantiations of the original ones, with the output of the first layer fed into the second layer as input. Our training method involves a feedback step that allows later classifiers to provide earlier classifiers information about which error modes to focus on. We show that our method significantly improves performance in all the subtasks in the domain of scene understanding, where we consider depth estimation, scene categorization, event categorization, object detection, geometric labeling, and saliency detection. Our method also improves performance in two robotic applications: an object-grasping robot and an object-finding robot.
引用
收藏
页码:1394 / 1408
页数:15
相关论文
共 68 条
[1]  
AGARWAL A, 2005, P IEEE C COMP VIS PA
[2]  
Ando RK, 2005, J MACH LEARN RES, V6, P1817
[3]  
[Anonymous], 2008, P EUR C COMP VIS
[4]  
[Anonymous], P IEEE C COMP VIS PA
[5]  
[Anonymous], P NEUR INF PROC SYST
[6]  
[Anonymous], DISCRIMINATIVELY TRA
[7]  
[Anonymous], 2008, P IEEE C COMP VIS PA
[8]  
[Anonymous], 2007, P 11 IEEE INT C COMP
[9]  
[Anonymous], P IEEE C COMP VIS PA
[10]  
[Anonymous], 2009, P 12 IEEE INT C COMP