Ordinal pyramid coding for rotation invariant feature extraction

被引:3
作者
Wang, Guoli [1 ,2 ]
Fan, Bin [1 ]
Zhou, Zhili [3 ,4 ]
Pan, Chunhong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, 95 Zhong Guan Cun East Rd, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp & Control Engn, Beijing 101408, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Jiangsu Engn Ctr Network Monitoring, Nanjing 210044, Jiangsu, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Rotation invariant; Ordinal pyramid pooling; Fisher vector; Feature extraction; SCENE CLASSIFICATION; IMAGE CLASSIFICATION; OBJECT DETECTION; REPRESENTATION; EFFICIENT; DEEP; HOG;
D O I
10.1016/j.neucom.2017.02.071
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel rotation invariant feature for object recognition. Firstly, the local Fourier transform features of pixels in the described region are encoded by Fisher Vectors. Then, the encoded vectors are aggregated into a final representation by ordinal pyramid pooling, which hierarchically partitions the described region into sub-regions based on the orders of its pixels' rotation invariants. Since both the encoded Fisher Vectors and the ordinal pyramid pooling strategy are rotation invariant, the extracted feature is rotation invariant by nature. Two kinds of rotation invariants are investigated in this framework, one is the Radial Gradient Orientation and the other is the Radial Gradient Angle. Experiments on handwritten digit recognition and airplane/car detection in aerial images demonstrate the effectiveness of the proposed method, which outperforms the state of the art. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:150 / 160
页数:11
相关论文
共 44 条
  • [1] [Anonymous], 2010, P 18 ACM INT C MULT, DOI [10.1145/1873951.1874249, 10.1145/1873951.1874249.2]
  • [2] [Anonymous], 2005, PROC CVPR IEEE
  • [3] [Anonymous], 2007, P IEEE COMP SOC C CO
  • [4] Speeded-Up Robust Features (SURF)
    Bay, Herbert
    Ess, Andreas
    Tuytelaars, Tinne
    Van Gool, Luc
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) : 346 - 359
  • [5] Aircraft Detection by Deep Belief Nets
    Chen, Xueyun
    Xiang, Shiming
    Liu, Cheng-Lin
    Pan, Chun-Hong
    [J]. 2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 54 - 58
  • [6] Vehicle Detection in Satellite Images by Hybrid Deep Convolutional Neural Networks
    Chen, Xueyun
    Xiang, Shiming
    Liu, Cheng-Lin
    Pan, Chun-Hong
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (10) : 1797 - 1801
  • [7] BING: Binarized Normed Gradients for Objectness Estimation at 300fps
    Cheng, Ming-Ming
    Zhang, Ziming
    Lin, Wen-Yan
    Torr, Philip
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3286 - 3293
  • [8] Csurka G., 2004, WORKSH STAT LEARN CO, V1, P1, DOI DOI 10.1234/12345678
  • [9] The Pascal Visual Object Classes (VOC) Challenge
    Everingham, Mark
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
  • [10] Rotationally Invariant Descriptors Using Intensity Order Pooling
    Fan, Bin
    Wu, Fuchao
    Hu, Zhanyi
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (10) : 2031 - 2045