Hardware Acceleration and Implementation of YOLOX-s for On-Orbit FPGA

被引:2
|
作者
Wang, Ling [1 ,2 ]
Zhou, Hai [1 ]
Bian, Chunjiang [1 ]
Jiang, Kangning [1 ,2 ]
Cheng, Xiaolei [1 ,2 ]
机构
[1] Chinese Acad Sci, Natl Space Sci Ctr, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
convolutional neural network; remote sensing image processing; on-orbit high-performance computing; YOLOX-s; FPGA hardware acceleration; CNN;
D O I
10.3390/electronics11213473
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid development of remote sensing technology has brought about a sharp increase in the amount of remote sensing image data. However, due to the satellite's limited hardware resources, space, and power consumption constraints, it is difficult to process massive remote sensing images efficiently and robustly using the traditional remote sensing image processing methods. Additionally, the task of satellite-to-ground target detection has higher requirements for speed and accuracy under the conditions of more and more remote sensing data. To solve these problems, this paper proposes an extremely efficient and reliable acceleration architecture for forward inference of the YOLOX-s detection network an on-orbit FPGA. Considering the limited onboard resources, the design strategy of the parallel loop unrolling of the input channels and output channels is adopted to build the largest DSP computing array to ensure a reliable and full utilization of the limited computing resources, thus reducing the inference delay of the entire network. Meanwhile, a three-path cache queue and a small-scale cascaded pooling array are designed, which maximize the reuse of on-chip cache data, effectively reduce the bandwidth bottleneck of the external memory, and ensure an efficient computing of the entire computing array. The experimental results show that at the 200 MHz operating frequency of the VC709, the overall inference performance of the FPGA acceleration can reach 399.62 GOPS, the peak performance can reach 408.4 GOPS, and the overall computing efficiency of the DSP array can reach 97.56%. Compared with the previous work, our architecture design further improves the computing efficiency under limited hardware resources.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] AES hardware implementation in FPGA for algorithm acceleration purpose
    Gielata, Artur
    Russek, Pawel
    Wiatr, Kazimierz
    ICSES 2008 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS, CONFERENCE PROCEEDINGS, 2008, : 137 - 140
  • [2] Efficient FPGA Implementation of Feedback Perceptron for Hardware Acceleration
    Mohaidat, Tamador
    Syed, Azeemuddin
    Alqodah, Mohammed
    Khalil, Kasem
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [3] The method and implementation of a Taiwan building recognition model based on YOLOX-S and illustration enhancement
    Zhuang, Yung-Yu
    Chen, Wei-Hsiang
    Wu, Shao-Kai
    Chang, Wen-Yao
    TERRESTRIAL ATMOSPHERIC AND OCEANIC SCIENCES, 2024, 35 (01):
  • [4] The method and implementation of a Taiwan building recognition model based on YOLOX-S and illustration enhancement
    Yung-Yu Zhuang
    Wei-Hsiang Chen
    Shao-Kai Wu
    Wen-Yao Chang
    Terrestrial, Atmospheric and Oceanic Sciences, 2024, 35
  • [5] Design and implementation of on-board high reliability FPGA on-orbit reconfiguration
    Yu, Longkun
    Yang, Shuai
    Wen, Xiangyang
    Liu, Jiacong
    Yan, Jiarong
    Li, Xinqiao
    Xiong, Shaolin
    Lu, Zhigang
    Gong, Ke
    Liu, Yaqing
    An, Zhenghua
    Zhang, Dali
    Liu, Xiaojing
    Zhao, Xiaoyun
    Zhang, Fan
    Wang, Jinzhou
    Gao, Min
    Xu, Yanbing
    Sun, Xilei
    Wang, Hui
    Zheng, Chao
    Feng, Peiyi
    Wang, Chenwei
    Zhang, Yanqiu
    Zhang, Peng
    Ding, Zhiqiang
    Li, Yongye
    Zhang, Chenxing
    Wang, Jing
    Chen, Li
    MEASUREMENT & CONTROL, 2025,
  • [6] Improving YOLOX-s Dense Garbage Detection Method
    Xie, Ruobing
    Li, Maojun
    Li, Yiwei
    Hu, Jianwen
    Computer Engineering and Applications, 2024, 60 (05) : 250 - 258
  • [7] Visual SLAM Based on YOLOX-S in Dynamic Scenes
    Tian, YingLiang
    Xu, GaoChao
    Li, JiaXing
    Sun, YingJie
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 262 - 266
  • [8] 改进YOLOX-s的密集垃圾检测方法
    谢若冰
    李茂军
    李宜伟
    胡建文
    计算机工程与应用, 2024, 60 (05) : 250 - 258
  • [9] 基于改进YOLOX-S的火灾检测方法
    路佩东
    范菁
    曲金帅
    孙书魁
    云南民族大学学报(自然科学版), 2023, (06) : 771 - 778
  • [10] 基于YOLOX-s的农业害虫检测研究
    张剑飞
    柯赛
    计算机技术与发展, 2023, 33 (05) : 208 - 213