F-E3D: FPGA-based Acceleration of an Efficient 3D Convolutional Neural Network for Human Action Recognition

被引:37
|
作者
Fan, Hongxiang [1 ]
Luo, Cheng [2 ]
Zeng, Chenglong [3 ]
Ferianc, Martin [1 ]
Que, Zhiqiang [1 ]
Liu, Shuanglong [1 ]
Niu, Xinyu [4 ]
Luk, Wayne [1 ]
机构
[1] Imperial Coll London, Sch Engn, Dept Comp, London, England
[2] Fudan Univ, State Key Lab ASIC & Syst, Shanghai, Peoples R China
[3] Tianjin Univ, Sch Microelect, Tianjin, Peoples R China
[4] Corerain Technol Ltd, Shenzhen, Peoples R China
来源
2019 IEEE 30TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2019) | 2019年
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/ASAP.2019.00-44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Three-dimensional convolutional neural networks (3D CNNs) have demonstrated their outstanding classification accuracy for human action recognition (HAR). However, the large number of computations and parameters in 3D CNNs limits their deployability in real-life applications. To address this challenge, this paper adopts an algorithm-hardware co-design method by proposing an efficient 3D CNN building unit called 3D-1 bottleneck residual block (3D-1 BRB) at the algorithm level, and a corresponding FPGA-based hardware architecture called F-E3D at hardware level. Based on 3D-1 BRB, a novel 3D CNN model called E3DNet is developed, which achieves nearly 37 times reduction in model size and 5% improvement in accuracy compared to standard 3D CNNs on the UCF101 dataset. Together with several hardware optimizations, including 3D fused BRB, online blocking and kernel reuse, the proposed F-E3D is nearly 13 times faster than a previous FPGA design for 3D CNNs, with performance and accuracy comparable to other state-of-the-art 3D CNN models on GPU platforms while requiring only 7% of their energy consumption.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [41] Modulation recognition using an FPGA-based convolutional neural network
    Liu, Xueyuan
    Shang, Jing
    Leong, Philip H. W.
    Liu, Cheng
    2019 22ND INTERNATIONAL CONFERENCE ON ELECTRICAL MACHINES AND SYSTEMS (ICEMS 2019), 2019, : 3165 - 3170
  • [42] Action recognition method based on a novel keyframe extraction method and enhanced 3D convolutional neural network
    Tian, Qiuhong
    Li, Saiwei
    Zhang, Yuankui
    Lu, Hongyi
    Pan, Hao
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (01) : 475 - 491
  • [43] 2D and 3D Face Recognition Using Convolutional Neural Network
    Hu, Huiying
    Shah, Syed Afaq Ali
    Bennamoun, Mohammed
    Molton, Michael
    TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 133 - 138
  • [44] Facial Expression Recognition Using 3D Convolutional Neural Network
    Byeon, Young-Hyen
    Kwak, Keun-Chang
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (12) : 107 - 112
  • [45] Working activity recognition approach based on 3D deep convolutional neural network
    Liu T.
    Lu Z.
    Sun Y.
    Liu F.
    He B.
    Zhong J.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2020, 26 (08): : 2143 - 2156
  • [46] Micro-expression recognition based on 3D flow convolutional neural network
    Li, Jing
    Wang, Yandan
    See, John
    Liu, Wenbin
    PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (04) : 1331 - 1339
  • [47] Micro-expression recognition based on 3D flow convolutional neural network
    Jing Li
    Yandan Wang
    John See
    Wenbin Liu
    Pattern Analysis and Applications, 2019, 22 : 1331 - 1339
  • [48] Dynamic Hand Gesture Recognition Based on 3D Convolutional Neural Network Models
    Zhang, Wenjin
    Wang, Jiacun
    PROCEEDINGS OF THE 2019 IEEE 16TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2019), 2019, : 224 - 229
  • [49] Point cloud based deep convolutional neural network for 3D face recognition
    Bhople, Anagha R.
    Shrivastava, Akhilesh M.
    Prakash, Surya
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (20) : 30237 - 30259
  • [50] Point cloud based deep convolutional neural network for 3D face recognition
    Anagha R. Bhople
    Akhilesh M. Shrivastava
    Surya Prakash
    Multimedia Tools and Applications, 2021, 80 : 30237 - 30259