Distributed Object Detection With Linear SVMs

被引：60

作者：

Pang, Yanwei ^{[1
]}

Zhang, Kun ^{[1
]}

Yuan, Yuan ^{[2
]}

Wang, Kongqiao ^{[3
]}

机构：

[1] Tianjin Univ, Sch Elect Informat Engn, Tianjin 300072, Peoples R China

[2] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Ctr Opt Imagery Anal & Learning, State Key Lab Transient Opt & Photon, Xian 710119, Peoples R China

[3] Nokia Res Ctr, Beijing 100176, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2014年 / 44卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Cell-based histograms of oriented gradients (CHOG); computer vision; feature extraction; linear classifier; machine learning; object detection; FACE; DIAGNOSIS; SYSTEM;

D O I：

10.1109/TCYB.2014.2301453

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In vision and learning, low computational complexity and high generalization are two important goals for video object detection. Low computational complexity here means not only fast speed but also less energy consumption. The sliding window object detection method with linear support vector machines (SVMs) is a general object detection framework. The computational cost is herein mainly paid in complex feature extraction and innerproduct-based classification. This paper first develops a distributed object detection framework (DOD) by making the best use of spatial-temporal correlation, where the process of feature extraction and classification is distributed in the current frame and several previous frames. In each framework, only subfeature vectors are extracted and the response of partial linear classifier (i.e., subdecision value) is computed. To reduce the dimension of traditional block-based histograms of oriented gradients (BHOG) feature vector, this paper proposes a cell-based HOG (CHOG) algorithm, where the features in one cell are not shared with overlapping blocks. Using CHOG as feature descriptor, we develop CHOG-DOD as an instance of DOD framework. Experimental results on detection of hand, face, and pedestrian in video show the superiority of the proposed method.

引用

页码：2122 / 2133

页数：12

共 58 条

[1] Face description with local binary patterns:: Application to face recognition [J].

Ahonen, Timo ;

Hadid, Abdenour ;

Pietikainen, Matti .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) :2037-2041

[2]

[Anonymous], 2005, PROC CVPR IEEE

[3] SURF: Speeded up robust features [J].

Bay, Herbert ;

Tuytelaars, Tinne ;

Van Gool, Luc .

COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :404-417

[4] On the design of Cascades of boosted ensembles for face detection [J].

Brubaker, S. Charles ;

Wu, Jianxin ;

Sun, Jie ;

Mullin, Matthew D. ;

Rehg, James M. .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 77 (1-3) :65-86

[5] A tutorial on Support Vector Machines for pattern recognition [J].

Burges, CJC .

DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167

[6] BRIEF: Binary Robust Independent Elementary Features [J].

Calonder, Michael ;

Lepetit, Vincent ;

Strecha, Christoph ;

Fua, Pascal .

COMPUTER VISION-ECCV 2010, PT IV, 2010, 6314 :778-792

[7] Visual Attention Accelerated Vehicle Detection in Low-Altitude Airborne Video of Urban Environment [J].

Cao, Xianbin ;

Lin, Renjun ;

Yan, Pingkun ;

Li, Xuelong .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (03) :366-378

[8] Compressed Histogram of Gradients: A Low-Bitrate Descriptor [J].

Chandrasekhar, Vijay ;

Takacs, Gabriel ;

Chen, David M. ;

Tsai, Sam S. ;

Reznik, Yuriy ;

Grzeszczuk, Radek ;

Girod, Bernd .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 96 (03) :384-399

[9]

Chang E., 2007, C NEUR INF PROC SYST, P213

[10] Mean shift: A robust approach toward feature space analysis [J].

Comaniciu, D ;

Meer, P .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) :603-619

← 1 2 3 4 5 6 →