High performance hardware architecture for singular spectrum analysis of Hankel tensors

被引:3
|
作者
Huang, Wei-pei [1 ]
Kwan, Bowen P. Y. [1 ]
Ding, Weiyang [2 ]
Min, Biao [1 ]
Cheung, Ray C. C. [1 ]
Qi, Liqun [2 ]
Yan, Hong [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Dept Appl Math, Hong Kong, Peoples R China
关键词
Hardware architecture; Hankel tensor; Tucker decomposition(TKD); Higher-order singular value decomposition (HOSVD); MULTILINEAR-ALGEBRA; DECOMPOSITION;
D O I
10.1016/j.micpro.2018.10.004
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a hardware architecture for singular spectrum analysis of Hankel tensors, including computation of tucker decomposition, tensor reconstruction and final Hankelization, In the proposed design, we explore two level of optimization. First, in algorithm level, we optimize the calculation process by exploiting the Hankel property to reduce the computation complexity and on-chip BRAM resource usage. Secondly, in hardware level, parallelism is explored for acceleration. Resource sharing is applied to reduce look-up tables (LUTs) usage. To enable flexibility, the number of processing elements (PEs) can be changed through parameter setting. Our proposed design is implemented on Field-Programmable Gate Arrays (FPGAs) to process third order tensors. Experiment results show that our design achieve a speed-up from 172 to 1004 compared with CPU implementation via Intel MKL and 5 to 40 compared with GPU implementation. (C) 2018 Published by Elsevier B.V.
引用
收藏
页码:120 / 127
页数:8
相关论文
共 50 条
  • [21] A High Performance Reconfigurable Hardware Architecture for Lightweight Convolutional Neural Network
    An, Fubang
    Wang, Lingli
    Zhou, Xuegong
    ELECTRONICS, 2023, 12 (13)
  • [22] High Performance Hardware Architecture of Linear Filters for Intelligent Video Processing
    Chen, Tse-Wei
    Chien, Shao-Yi
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 5353 : 834 - +
  • [23] High-Performance Reconfigurable Hardware Architecture for Restricted Boltzmann Machines
    Le Ly, Daniel
    Chow, Paul
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2010, 21 (11): : 1780 - 1792
  • [24] Performance of ultrasound echo decomposition using singular spectrum analysis
    Pereira, WCD
    Maciel, CD
    ULTRASOUND IN MEDICINE AND BIOLOGY, 2001, 27 (09): : 1231 - 1238
  • [25] Algorithm performance dependent on hardware architecture
    Kuester, U.
    Lammers, P.
    COMPUTATIONAL SCIENCE AND HIGH PERFORMANCE COMPUTING II, 2006, 91 : 303 - +
  • [26] An architecture for high performance network analysis
    Risso, F
    Degioanni, L
    PROCEEDINGS OF THE SIXTH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, 2001, : 686 - 693
  • [27] EXPLOITING SYMMETRY IN TENSORS FOR HIGH PERFORMANCE: MULTIPLICATION WITH SYMMETRIC TENSORS
    Schatz, Martin D.
    Low, Tze Meng
    van de Geijn, Robert A.
    Kolda, Tamara G.
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2014, 36 (05): : C453 - C479
  • [28] A high performance architecture for computing the time-frequency spectrum
    Abdullah Al-Shoshan
    Mohammad AlOqeely
    Circuits, Systems and Signal Processing, 2000, 19 : 437 - 450
  • [29] A high performance architecture for computing the time-frequency spectrum
    Al-Shoshan, A
    AlOqeely, M
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2000, 19 (05) : 437 - 450
  • [30] A High Performance Hardware Architecture for Multi-Frame Hierarchical Motion Estimation
    Ho, Huong
    Klepko, Robert
    Ninh, Nam
    Wang, Demin
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (02) : 794 - 801