Ordinal pyramid coding for rotation invariant feature extraction

被引：3

作者：

Wang, Guoli ^{[1
,2
]}

Fan, Bin ^{[1
]}

Zhou, Zhili ^{[3
,4
]}

Pan, Chunhong ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, 95 Zhong Guan Cun East Rd, Beijing, Peoples R China

[2] Univ Chinese Acad Sci, Sch Comp & Control Engn, Beijing 101408, Peoples R China

[3] Nanjing Univ Informat Sci & Technol, Jiangsu Engn Ctr Network Monitoring, Nanjing 210044, Jiangsu, Peoples R China

[4] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Jiangsu, Peoples R China

来源：

NEUROCOMPUTING | 2017年 / 242卷

基金：

中国国家自然科学基金;

关键词：

Rotation invariant; Ordinal pyramid pooling; Fisher vector; Feature extraction; SCENE CLASSIFICATION; IMAGE CLASSIFICATION; OBJECT DETECTION; REPRESENTATION; EFFICIENT; DEEP; HOG;

D O I：

10.1016/j.neucom.2017.02.071

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a novel rotation invariant feature for object recognition. Firstly, the local Fourier transform features of pixels in the described region are encoded by Fisher Vectors. Then, the encoded vectors are aggregated into a final representation by ordinal pyramid pooling, which hierarchically partitions the described region into sub-regions based on the orders of its pixels' rotation invariants. Since both the encoded Fisher Vectors and the ordinal pyramid pooling strategy are rotation invariant, the extracted feature is rotation invariant by nature. Two kinds of rotation invariants are investigated in this framework, one is the Radial Gradient Orientation and the other is the Radial Gradient Angle. Experiments on handwritten digit recognition and airplane/car detection in aerial images demonstrate the effectiveness of the proposed method, which outperforms the state of the art. (C) 2017 Elsevier B.V. All rights reserved.

引用

页码：150 / 160

页数：11

共 44 条

[1] [Anonymous], 2010, P 18 ACM INT C MULT, DOI [10.1145/1873951.1874249, 10.1145/1873951.1874249.2]
[2] [Anonymous], 2005, PROC CVPR IEEE
[3] [Anonymous], 2007, P IEEE COMP SOC C CO
[4] Speeded-Up Robust Features (SURF)
Bay, Herbert
Ess, Andreas
Tuytelaars, Tinne
Van Gool, Luc
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) : 346 - 359
[5] Aircraft Detection by Deep Belief Nets
Chen, Xueyun
Xiang, Shiming
Liu, Cheng-Lin
Pan, Chun-Hong
[J]. 2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 54 - 58
[6] Vehicle Detection in Satellite Images by Hybrid Deep Convolutional Neural Networks
Chen, Xueyun
Xiang, Shiming
Liu, Cheng-Lin
Pan, Chun-Hong
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (10) : 1797 - 1801
[7] BING: Binarized Normed Gradients for Objectness Estimation at 300fps
Cheng, Ming-Ming
Zhang, Ziming
Lin, Wen-Yan
Torr, Philip
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3286 - 3293
[8] Csurka G., 2004, WORKSH STAT LEARN CO, V1, P1, DOI DOI 10.1234/12345678
[9] The Pascal Visual Object Classes (VOC) Challenge
Everingham, Mark
Van Gool, Luc
Williams, Christopher K. I.
Winn, John
Zisserman, Andrew
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
[10] Rotationally Invariant Descriptors Using Intensity Order Pooling
Fan, Bin
Wu, Fuchao
Hu, Zhanyi
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (10) : 2031 - 2045

← 1 2 3 4 5 →