Moving object classification using local shape and HOG features in wavelet-transformed space with hierarchical SVM classifiers

被引:31
作者
Liang, Chung-Wei [1 ]
Juang, Chia-Feng [1 ]
机构
[1] Natl Chung Hsing Univ, Dept Elect Engn, Taichung 402, Taiwan
关键词
Pedestrian detection; Car detection; Multi class object classification; Haar wavelet transform; Histogram of oriented gradients (HOG); Support vector machines; SYSTEM;
D O I
10.1016/j.asoc.2014.09.051
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an integrated system for the segmentation and classification of four moving objects, including pedestrians, cars, motorcycles, and bicycles, from their side-views in a video sequence. Based on the use of an adaptive background in the red-green-blue (RGB) color model, each moving object is segmented with its minimum enclosing rectangle (MER) window by using a histogram-based projection approach or a tracking-based approach. Additionally, a shadow removal technique is applied to the segmented objects to improve the classification performance. For the MER windows with different sizes, a window scaling operation followed by an adaptive block-shifting operation is applied to obtain a fixed feature dimension. A weight mask, which is constructed according to the frequency of occurrence of an object in each position within a square window, is proposed to enhance the distinguishing pixels in the rescaled MER window. To extract classification features, a two-level Haar wavelet transform is applied to the rescaled MER window. The local shape features and the modified histogram of oriented gradients (HOG) are extracted from the level-two and level-one sub-bands, respectively, of the wavelet-transformed space. A hierarchical linear support vector machine classification configuration is proposed to classify the four classes of objects. Six video sequences are used to test the classification performance of the proposed method. The computer processing times of the object segmentation, object tracking, and feature extraction and classification approaches are 79 ms, 211 ms, and 0.01 ms, respectively. Comparisons with different well-known classification approaches verify the superiority of the proposed classification method. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:483 / 497
页数:15
相关论文
共 42 条
[1]   Face description with local binary patterns:: Application to face recognition [J].
Ahonen, Timo ;
Hadid, Abdenour ;
Pietikainen, Matti .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) :2037-2041
[2]  
[Anonymous], PROC IEEE INTELL VEH
[3]  
[Anonymous], IFEEITSJ ILL CONF IN
[4]  
[Anonymous], PROC IEEE INTELL TRA
[5]  
[Anonymous], 1999, P IEEE C COMPUTER VI
[6]  
[Anonymous], PROC IEEE INTELL VEH
[7]  
[Anonymous], P IEEE RSJ INT C INT
[8]  
[Anonymous], IEEE T INTEL TRANSOR
[9]   Practical mixtures of Gaussians with brightness monitoring [J].
Atev, S ;
Masoud, O ;
Papanikolopoulos, N .
ITSC 2004: 7TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, PROCEEDINGS, 2004, :423-428
[10]   A Real-Time Vision System for Nighttime Vehicle Detection and Traffic Surveillance [J].
Chen, Yen-Lin ;
Wu, Bing-Fei ;
Huang, Hao-Yu ;
Fan, Chung-Jui .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2011, 58 (05) :2030-2044