Describing visual scenes using transformed objects and parts

被引:85
作者
Sudderth, Erik B. [1 ]
Torralba, Antonio [2 ]
Freeman, William T. [2 ]
Willsky, Alan S. [2 ]
机构
[1] Univ Calif Berkeley, Div Comp Sci, Berkeley, CA 94720 USA
[2] MIT, Cambridge, MA 02139 USA
关键词
object recognition; Dirichlet process; hierarchical Dirichlet process; transformation; context; graphical models; scene analysis;
D O I
10.1007/s11263-007-0069-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop hierarchical, probabilistic models for objects, the parts composing them, and the visual scenes surrounding them. Our approach couples topic models originally developed for text analysis with spatial transformations, and thus consistently accounts for geometric constraints. By building integrated scene models, we may discover contextual relationships, and better exploit partially labeled training images. We first consider images of isolated objects, and show that sharing parts among object categories improves detection accuracy when learning from few examples. Turning to multiple object scenes, we propose nonparametric models which use Dirichlet processes to automatically learn the number of parts underlying each object category, and objects composing each scene. The resulting transformed Dirichlet process (TDP) leads to Monte Carlo algorithms which simultaneously segment and recognize objects in street and office scenes.
引用
收藏
页码:291 / 330
页数:40
相关论文
共 69 条
[1]   Dynamic trees for image modelling [J].
Adams, NJ ;
Williams, CKI .
IMAGE AND VISION COMPUTING, 2003, 21 (10) :865-877
[2]  
AMIT Y, 2007, CATEGORY LEVEL OBJEC
[3]  
[Anonymous], IEEE CVPR
[4]  
[Anonymous], 2021, Bayesian Data Analysis
[5]  
[Anonymous], 2004, ECCV WORKSHOPS
[6]   Matching words and pictures [J].
Barnard, K ;
Duygulu, P ;
Forsyth, D ;
de Freitas, N ;
Blei, DM ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1107-1135
[7]   Shape matching and object recognition using shape contexts [J].
Belongie, S ;
Malik, J ;
Puzicha, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522
[8]  
Bienenstock E, 1997, ADV NEUR IN, V9, P838
[9]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[10]  
Borenstein E, 2002, LECT NOTES COMPUT SC, V2351, P109