Distributed Object Detection With Linear SVMs

被引:60
作者
Pang, Yanwei [1 ]
Zhang, Kun [1 ]
Yuan, Yuan [2 ]
Wang, Kongqiao [3 ]
机构
[1] Tianjin Univ, Sch Elect Informat Engn, Tianjin 300072, Peoples R China
[2] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Ctr Opt Imagery Anal & Learning, State Key Lab Transient Opt & Photon, Xian 710119, Peoples R China
[3] Nokia Res Ctr, Beijing 100176, Peoples R China
基金
中国国家自然科学基金;
关键词
Cell-based histograms of oriented gradients (CHOG); computer vision; feature extraction; linear classifier; machine learning; object detection; FACE; DIAGNOSIS; SYSTEM;
D O I
10.1109/TCYB.2014.2301453
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In vision and learning, low computational complexity and high generalization are two important goals for video object detection. Low computational complexity here means not only fast speed but also less energy consumption. The sliding window object detection method with linear support vector machines (SVMs) is a general object detection framework. The computational cost is herein mainly paid in complex feature extraction and innerproduct-based classification. This paper first develops a distributed object detection framework (DOD) by making the best use of spatial-temporal correlation, where the process of feature extraction and classification is distributed in the current frame and several previous frames. In each framework, only subfeature vectors are extracted and the response of partial linear classifier (i.e., subdecision value) is computed. To reduce the dimension of traditional block-based histograms of oriented gradients (BHOG) feature vector, this paper proposes a cell-based HOG (CHOG) algorithm, where the features in one cell are not shared with overlapping blocks. Using CHOG as feature descriptor, we develop CHOG-DOD as an instance of DOD framework. Experimental results on detection of hand, face, and pedestrian in video show the superiority of the proposed method.
引用
收藏
页码:2122 / 2133
页数:12
相关论文
共 58 条
[21]   Semi-Supervised Dimension Reduction Using Trace Ratio Criterion [J].
Huang, Yi ;
Xu, Dong ;
Nie, Feiping .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (03) :519-526
[22]   Patch Distribution Compatible Semisupervised Dimension Reduction for Face and Human Gait Recognition [J].
Huang, Yi ;
Xu, Dong ;
Nie, Feiping .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (03) :479-488
[23]   Tangent Hyperplane Kernel Principal Component Analysis for Denoising [J].
Im, Joon-Ku ;
Apley, Daniel W. ;
Runger, George C. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (04) :644-656
[24]   CONDENSATION - Conditional density propagation for visual tracking [J].
Isard, M ;
Blake, A .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1998, 29 (01) :5-28
[25]   Efficient Subwindow Search: A Branch and Bound Framework for Object Localization [J].
Lampert, Christoph H. ;
Blaschko, Matthew B. ;
Hofmann, Thomas .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (12) :2129-2142
[26]   Fast PRISM: Branch and Bound Hough Transform for Object Class Detection [J].
Lehmann, Alain ;
Leibe, Bastian ;
Van Gool, Luc .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 94 (02) :175-197
[27]   Robust object detection with interleaved categorization and segmentation [J].
Leibe, Bastian ;
Leonardis, Ales ;
Schiele, Bernt .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 77 (1-3) :259-289
[28]   Textual Query of Personal Photos Facilitated by Large-Scale Web Data [J].
Liu, Yiming ;
Xu, Dong ;
Tsang, Ivor Wai-Hung ;
Luo, Jiebo .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) :1022-1036
[29]   Distinctive image features from scale-invariant keypoints [J].
Lowe, DG .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110
[30]   ASIFT: A New Framework for Fully Affine Invariant Image Comparison [J].
Morel, Jean-Michel ;
Yu, Guoshen .
SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (02) :438-469