Accelerating AdaBoost Algorithm Using GPU for Multi-Object Recognition

被引:0
作者
Tsai, Pin Yi [1 ]
Hsu, Yarsun [2 ]
Chiu, Ching-Te [3 ]
Chu, Tsai-Te [3 ]
机构
[1] Natl Tsing Hua Univ, Inst Informat Syst & Applicat, Hsinchu, Taiwan
[2] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu, Taiwan
[3] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
来源
2015 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS) | 2015年
关键词
advanced driver assistance system (ADAS); Compute Unified Device Architecture (CUDA); adaptive boosting (AdaBoost); graphics processing unit (GPU); object recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Traditionally, an adaptive boosting (AdaBoost) algorithm is used for object recognition because of its prevalent usage and well-trained results. However, because the computation of AdaBoost is extremely time-consuming, it is difficult to guarantee that the computations reflect the latest information in real time. To speed-up the operation, the original AdaBoost algorithm was accelerated with a graphics processing unit (GPU). In this study, Compute Unified Device Architecture (CUDA) was used to accelerate two parts of the AdaBoost algorithm, including feature extraction and training, by applying various strategies to system components such as how the data is put in the memory, amount of CUDA streams, trunk size, and block size. In Feature Extraction of the car datasets, the most time-consuming step feature-value computation is 47.18 times faster than the CPU version. For AdaBoost Training, the total execution is accelerated by 34.23 times.
引用
收藏
页码:738 / 741
页数:4
相关论文
共 15 条
[1]  
[Anonymous], ADV DRIV ASS SYST AD
[2]  
[Anonymous], IET COMPUTERS DIGITA
[3]  
[Anonymous], THRUST NVIDIA DEV
[4]   Efficient Integral Image Computation on the GPU [J].
Bilgic, Berkin ;
Horn, Berthold K. P. ;
Masaki, Ichiro .
2010 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2010, :528-533
[5]   A 0.64 mm2 Real-Time Cascade Face Detection Design Based on Reduced Two-Field Extraction [J].
Chen, Chih-Rung ;
Wong, Wei-Su ;
Chiu, Ching-Te .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2011, 19 (11) :1937-1948
[6]   Cascade Object Detection with Deformable Part Models [J].
Felzenszwalb, Pedro F. ;
Girshick, Ross B. ;
McAllester, David .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :2241-2248
[7]  
Hwu Wen-meiW., 2010, Programming Massively Parallel Processors ~ A Hands-on Approach
[8]  
Martínez-Zarzuela M, 2011, LECT NOTES COMPUT SC, V6687, P333, DOI 10.1007/978-3-642-21326-7_36
[9]  
Masek J, 2013, 2013 36TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), P808, DOI 10.1109/TSP.2013.6614050
[10]   Design and Performance Evaluation of Image Processing Algorithms on GPUs [J].
Park, In Kyu ;
Singhal, Nitin ;
Lee, Man Hee ;
Cho, Sungdae ;
Kim, Chris W. .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (01) :91-104