CENTRIST: A Visual Descriptor for Scene Categorization

被引:499
作者
Wu, Jianxin [1 ]
Rehg, James M. [2 ,3 ,4 ,5 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
[2] Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30332 USA
[3] Georgia Inst Technol, Ctr Robot & Intelligent Machines, Atlanta, GA 30332 USA
[4] Georgia Inst Technol, Ctr Behav Imaging, Atlanta, GA 30332 USA
[5] Georgia Inst Technol, Computat Percept Lab, Atlanta, GA 30332 USA
基金
美国国家科学基金会;
关键词
Place recognition; scene recognition; visual descriptor; Census Transform; SIFT; Gist; SIMULTANEOUS LOCALIZATION; CLASSIFICATION; SHAPE;
D O I
10.1109/TPAMI.2010.224
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
CENsus TRansform hISTogram (CENTRIST), a new visual descriptor for recognizing topological places or scene categories, is introduced in this paper. We show that place and scene recognition, especially for indoor environments, require its visual descriptor to possess properties that are different from other vision domains (e.g., object recognition). CENTRIST satisfies these properties and suits the place and scene recognition task. It is a holistic representation and has strong generalizability for category recognition. CENTRIST mainly encodes the structural properties within an image and suppresses detailed textural information. Our experiments demonstrate that CENTRIST outperforms the current state of the art in several place and scene recognition data sets, compared with other descriptors such as SIFT and Gist. Besides, it is easy to implement and evaluates extremely fast.
引用
收藏
页码:1489 / 1501
页数:13
相关论文
共 39 条
[1]  
[Anonymous], P EUR C COMP VIS
[2]  
[Anonymous], IEEE INT C ROB AUT
[3]   Scene classification using a hybrid generative/discriminative approach [J].
Bosch, Anna ;
Zisserman, Andrew ;
Munoz, Xavier .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (04) :712-727
[4]   Topological simultaneous localization and mapping (SLAM): Toward exact localization without explicit localization [J].
Choset, H ;
Nagatani, K .
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 2001, 17 (02) :125-137
[5]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[6]   Simultaneous localization and mapping: Part I [J].
Durrant-Whyte, Hugh ;
Bailey, Tim .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2006, 13 (02) :99-108
[7]  
Fei-Fei L, 2005, PROC CVPR IEEE, P524
[8]  
FEIFEI L, 2004, P IEEE CS C COMP VIS
[9]   Pictorial structures for object recognition [J].
Felzenszwalb, PF ;
Huttenlocher, DP .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 61 (01) :55-79
[10]  
FELZENSZWALB PF, 2007, P IEEE CS C COMP VIS