Performance Optimisation of Parallelized ADAS Applications in FPGA-GPU Heterogeneous Systems: A Case Study With Lane Detection

被引:9
作者
Wang, Xiebing [1 ]
Huang, Kai [2 ,3 ]
Knoll, Alois [1 ]
机构
[1] Tech Univ Munich, Dept Informat, D-85748 Garching, Germany
[2] Sun Yat Sen Univ, Key Lab Machine Intelligence & Adv Comp, Minist Educ, Guangzhou 510006, Peoples R China
[3] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Peoples R China
来源
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2019年 / 4卷 / 04期
基金
中国国家自然科学基金;
关键词
ADAS; FPGA; GPU; OpenCL; lane detection; VISION;
D O I
10.1109/TIV.2019.2938092
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The explosive growth of massive data captured by various sensors on modern vehicles has impelled the deployment of Commercial Off-The-Shelf (COTS) accelerators for the research and development of Advanced Driver Assistance Systems (ADAS). Although the advent of cross-platform programming framework such as Open Computing Language (OpenCL) facilitates the programmability of ADAS applications on heterogeneous devices, the performance portability is still vulnerable and subject to different hardware implementations by the heterogeneous manufacturers. With this issue in mind, in this article we propose a detailed procedure that helps guide the performance optimisation of parallelized ADAS applications in an FPGA-GPU combined heterogeneous system. Taking two different lane detection applications as case studies, we provide one intra-accelerator and two interaccelerator optimisation methods, as well as both FPGA-specific and application-oriented optimisation strategies, to boost the program runtime performance. Experiment results on a heterogeneous platform with COTS FPGA and GPU components reveal that the optimal designs generated from the procedure can improve the runtime performance of the two applications by an average of 109.21% and 83.48% over the native parallel implementations, respectively.
引用
收藏
页码:519 / 531
页数:13
相关论文
共 42 条
  • [1] Aly M., CALTECH LANES
  • [2] Real-time lane departure warning system based on a single FPGA
    An, Xiangjing
    Shang, Erke
    Song, Jinze
    Li, Jian
    He, Hangen
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2013,
  • [3] Characterizing a Heterogeneous System for Person Detection in Video Using Histograms of Oriented Gradients: Power Versus Speed Versus Accuracy
    Blair, Calum
    Robertson, Neil M.
    Hume, Danny
    [J]. IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2013, 3 (02) : 236 - 247
  • [4] Dietrich R, 2015, INT WORKSH INT DATA, P334, DOI 10.1109/IDAACS.2015.7340754
  • [5] A Precise Lane Detection Algorithm Based on Top View Image Transformation and Least-Square Approaches
    Dorj, Byambaa
    Lee, Deok Jin
    [J]. JOURNAL OF SENSORS, 2016, 2016
  • [6] Fan L., 2016, IACR, V2016, P1
  • [7] Fickenscher J, 2017, IEEE INT VEH SYM, P959, DOI 10.1109/IVS.2017.7995839
  • [8] RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY
    FISCHLER, MA
    BOLLES, RC
    [J]. COMMUNICATIONS OF THE ACM, 1981, 24 (06) : 381 - 395
  • [9] OpenCL Application Auto-Tuning and Run-Time Resource Management for Multi-Core Platforms
    Gadioli, Davide
    Libutti, Simone
    Massari, Giuseppe
    Paone, Edoardo
    Scandale, Michele
    Bellasi, Patrick
    Palermo, Gianluca
    Zaccaria, Vittorio
    Agosta, Giovanni
    Fornaciari, William
    Silvano, Cristina
    [J]. 2014 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA), 2014, : 127 - 133
  • [10] Accelerating Mobile Audio Sensing Algorithms through On-Chip GPU Offloading
    Georgiev, Petko
    Lane, Nicholas D.
    Mascolo, Cecilia
    Chu, David
    [J]. MOBISYS'17: PROCEEDINGS OF THE 15TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS, APPLICATIONS, AND SERVICES, 2017, : 306 - 318