Latent Dirichlet Allocation Models for Image Classification

被引:76
作者
Rasiwasia, Nikhil [1 ]
Vasconcelos, Nuno [2 ]
机构
[1] Yahoo Labs Bangalore, Bengaluru 560071, Karnataka, India
[2] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
基金
美国国家科学基金会;
关键词
Image classification; graphical models; latent Dirichlet allocation; semantic classification; attributes; HIERARCHICAL MODEL;
D O I
10.1109/TPAMI.2013.69
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Two new extensions of latent Dirichlet allocation (LDA), denoted topic-supervised LDA (ts-LDA) and class-specific-simplex LDA (css-LDA), are proposed for image classification. An analysis of the supervised LDA models currently used for this task shows that the impact of class information on the topics discovered by these models is very weak in general. This implies that the discovered topics are driven by general image regularities, rather than the semantic regularities of interest for classification. To address this, ts-LDA models are introduced which replace the automated topic discovery of LDA with specified topics, identical to the classes of interest for classification. While this results in improvements in classification accuracy over existing LDA models, it compromises the ability of LDA to discover unanticipated structure of interest. This limitation is addressed by the introduction of css-LDA, an LDA model with class supervision at the level of image features. In css-LDA topics are discovered per class, i.e., a single set of topics shared across classes is replaced by multiple class-specific topic sets. The css-LDA model is shown to combine the labeling strength of topic-supervision with the flexibility of topic-discovery. Its effectiveness is demonstrated through an extensive experimental evaluation, involving multiple benchmark datasets, where it is shown to outperform existing LDA-based image classification approaches.
引用
收藏
页码:2665 / 2679
页数:15
相关论文
共 28 条
[1]  
[Anonymous], 2003, P 26 ANN INT ACM SIG
[2]  
[Anonymous], 2008, Advances in Neural Information Processing Systems
[3]  
[Anonymous], 2006, PATTERN RECOGN
[4]  
[Anonymous], 2007, Handbook of latent semantic analysis
[5]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[6]   Scene classification using a hybrid generative/discriminative approach [J].
Bosch, Anna ;
Zisserman, Andrew ;
Munoz, Xavier .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (04) :712-727
[7]   Learning Mid-Level Features For Recognition [J].
Boureau, Y-Lan ;
Bach, Francis ;
LeCun, Yann ;
Ponce, Jean .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :2559-2566
[8]   Operations for Learning with Graphical Models [J].
Buntine, Wray L. .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1994, 2 :159-225
[9]   Supervised learning of semantic classes for image annotation and retrieval [J].
Carneiro, Gustavo ;
Chan, Antoni B. ;
Moreno, Pedro J. ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (03) :394-410
[10]  
Wang C, 2009, PROC CVPR IEEE, P1903, DOI [10.1109/CVPR.2009.5206800, 10.1109/CVPRW.2009.5206800]