SpikeMotion: A Transformer Framework for High-Throughput Video Segmentation on FPGA

被引:0
|
作者
Udeji, Uchechukwu Leo [1 ]
Margala, Martin [2 ]
机构
[1] Univ Massachusetts Lowell, Elect & Comp Engn Dept, Lowell, MA 01854 USA
[2] Univ Louisiana Lafayette, Comp & Informat Dept, Lafayette, LA 70504 USA
关键词
Transformer; Self-Attention; Hyperattention; Spiking neural networks; FPGA; Neuromorphic computing; ACCELERATOR;
D O I
10.1109/MWSCAS60917.2024.10658767
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transformers have emerged at the forefront in the training and inference of diverse machine learning tasks, encompassing video processing, image generation and classification, and natural language processing (NLP). Despite their increasing prevalence, a comprehensive framework to efficiently implement them has been lacking. This study introduces a transformer-based framework which accelerates image processing of UNETR (U-shaped neural network transformer) model for video segmentation task using the cityscapes dataset. Given the large size of images in the dataset we incorporate hyperattention and mixed precision in our design. Our model is trained on Google A1OO GPU accelerator and profiled. Finally, our design is implemented on FPGA to take advantage of the reconfigurable and high-throughput characteristics of system-on-chips (SoC) for image processing. Our results indicate improvements compared to existing research in this domain.
引用
收藏
页码:818 / 822
页数:5
相关论文
共 50 条
  • [1] HitGraph: High-throughput Graph Processing Framework on FPGA
    Zhou, Shijie
    Kannan, Rajgopal
    Prasanna, Viktor K.
    Seetharaman, Guna
    Wu, Qing
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (10) : 2249 - 2264
  • [2] High-throughput Online Hash Table on FPGA
    Tong, Da
    Zhou, Shijie
    Prasanna, Viktor K.
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 105 - 112
  • [3] High-Throughput FPGA Implementation of QR Decomposition
    Munoz, Sergio D.
    Hormigo, Javier
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2015, 62 (09) : 861 - 865
  • [4] A high-throughput framework for lattice dynamics
    Zhu, Zhuoying
    Park, Junsoo
    Sahasrabuddhe, Hrushikesh
    Ganose, Alex M.
    Chang, Rees
    Lawson, John W.
    Jain, Anubhav
    NPJ COMPUTATIONAL MATERIALS, 2024, 10 (01)
  • [5] High-Throughput Polynomial Multiplier for Accelerating Saber on FPGA
    Cui, Yijun
    Zhang, Yuantuo
    Ni, Ziying
    Yu, Shichao
    Wang, Chenghua
    Liu, Weiqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (09) : 3584 - 3588
  • [6] GeneTrail: A Framework for the Analysis of High-Throughput Profiles
    Gerstner, Nico
    Kehl, Tim
    Lenhof, Kerstin
    Eckhart, Lea
    Schneider, Lara
    Stoeckel, Daniel
    Backes, Christina
    Meese, Eckart
    Keller, Andreas
    Lenhof, Hans-Peter
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2021, 8
  • [7] A framework for high-throughput and comparative cell biology
    Avasthi, P.
    Matus, D.
    Mets, D.
    York, R.
    MOLECULAR BIOLOGY OF THE CELL, 2023, 34 (02) : 855 - 856
  • [8] A Novel High-Throughput Multispectral Cell Segmentation Algorithm
    Golbstein, Jenia
    Tocker, Yaniv
    Sharivkin, Revital
    Tarcic, Gabi
    Vidne, Michael
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS (MIUA 2017), 2017, 723 : 754 - 766
  • [9] High-throughput segmentation of unmyelinated axons by deep learning
    Emanuele Plebani
    Natalia P. Biscola
    Leif A. Havton
    Bartek Rajwa
    Abida Sanjana Shemonti
    Deborah Jaffey
    Terry Powley
    Janet R. Keast
    Kun-Han Lu
    M. Murat Dundar
    Scientific Reports, 12
  • [10] High-throughput segmentation of unmyelinated axons by deep learning
    Plebani, Emanuele
    Biscola, Natalia P.
    Havton, Leif A.
    Rajwa, Bartek
    Shemonti, Abida Sanjana
    Jaffey, Deborah
    Powley, Terry
    Keast, Janet R.
    Lu, Kun-Han
    Dundar, M. Murat
    SCIENTIFIC REPORTS, 2022, 12 (01)