Design and evaluation of multi-GPU enabled Multiple Symbol Detection algorithm

被引:0
|
作者
Ying Liu
Haixin Zheng
Renliang Zhao
Liheng Jian
机构
[1] University of Chinese Academy of Sciences,School of Computer and Control
[2] Chinese Academy of Sciences,Key Lab of Big Data Mining and Knowledge Management
[3] Academy of Equipment,School of Electronic, Electrical and Communication Engineering
[4] University of Chinese Academy of Sciences,undefined
来源
The Journal of Supercomputing | 2016年 / 72卷
关键词
Parallel computing; CUDA; Multiple Symbol Detection; Multi-GPU; Demodulation; Telemetry;
D O I
暂无
中图分类号
学科分类号
摘要
Multiple Symbol Detection (MSD) is an important technique in digital signal processing. It estimates the sequence of the received signal by maximum-likelihood principle. Due to its high computational complexity, currently, MSD algorithms were implemented in specialized signal processing devices, such as Field Programmable Gate Arrays (FPGAs). As the rapid development of CUDA, GPU has successfully accelerated applications in a variety of domains. In this paper, we explore to utilize CUDA-enabled GPUs to accelerate MSD algorithm. The computation core of MSD, sliding correlation problem, is formulated and an efficient CUDA parallelization scheme is proposed. CUDA-enabled MSD (CU-MSD) algorithm is implemented by adapting CUDA-enabled sliding correlation. To further improve the scalability of CU-MSD, the implementation on multiple GPUs is proposed as well. Various optimization techniques are used to maximize the performance. The performance of CU-MSD is evaluated by an MSD-based demodulation for PCM/FM telemetry system. Four data sets from a real aerospace PCM/FM integrated baseband system were used in our experiments. The experimental results demonstrate up to 133.3×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document} speedup using a single GPU and 514.64×\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times $$\end{document} speedup using 4 GPUs in a single server.
引用
收藏
页码:2111 / 2131
页数:20
相关论文
共 31 条
  • [21] Multi-GPU Accelerated Admittance Method for High-Resolution Human Exposure Evaluation
    Xiong, Zubiao
    Feng, Shi
    Kautz, Richard
    Chandra, Sandeep
    Altunyurt, Nevin
    Chen, Ji
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2015, 62 (12) : 2920 - 2930
  • [22] Multi-GPU Island-Based Genetic Algorithm for Solving the Knapsack Problem
    Jaros, Jiri
    2012 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2012,
  • [23] A Multi-GPU Parallel Genetic Algorithm For Large-Scale Vehicle Routing Problems
    Abdelatti, Marwan
    Sodhi, Manbir
    Sendag, Resit
    2022 IEEE HIGH PERFORMANCE EXTREME COMPUTING VIRTUAL CONFERENCE (HPEC), 2022,
  • [24] An OpenMP-CUDA Implementation of Multilevel Fast Multipole Algorithm for Electromagnetic Simulation on Multi-GPU Computing Systems
    Guan, Jian
    Yan, Su
    Jin, Jian-Ming
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2013, 61 (07) : 3607 - 3616
  • [25] Locality-aware Thread Block Design in Single and Multi-GPU Graph Processing
    Fan, Quan
    Chen, Zizhong
    2021 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE AND STORAGE (NAS), 2021, : 148 - 151
  • [26] A New Parallel Frequency-Domain Finite-Difference Algorithm Using Multi-GPU
    Wang, Yijing
    He, Xinbo
    Wei, Bin
    IEEE MICROWAVE AND WIRELESS TECHNOLOGY LETTERS, 2024, 34 (08): : 971 - 974
  • [27] A Multi-GPU Design for Large Size Cryo-EM 3D Reconstruction
    Wang, Zihao
    Wan, Xiaohua
    Liu, Zhiyong
    Fan, Qianshuo
    Zhang, Fa
    Tan, Guangming
    2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 847 - 858
  • [28] Fast multi-processor multi-GPU based algorithm of tomographic inversion for 3D image reconstruction
    Bajpai, Manish
    Gupta, Phalguni
    Munshi, Prabhat
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2015, 29 (01): : 64 - 72
  • [29] Empirical Performance Evaluation of Communication Libraries for Multi-GPU based Distributed Deep Learning in a Container Environment
    Choi, HyeonSeong
    Kim, Youngrang
    Lee, Jaehwan
    Kim, Yoonhee
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (03): : 911 - 931
  • [30] GPU-Aware MPI on RDMA-Enabled Clusters: Design, Implementation and Evaluation
    Wang, Hao
    Potluri, Sreeram
    Bureddy, Devendar
    Rosales, Carlos
    Panda, Dhabaleswar K.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (10) : 2595 - 2605