Implementing High-performance Complex Matrix Multiplication via the 3m and 4m Methods

被引:12
|
作者
Van Zee, Field G. [1 ,2 ]
Smith, Tyler M. [1 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[2] Univ Texas Austin, Inst Computat Engn & Sci, Austin, TX 78712 USA
来源
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE | 2017年 / 44卷 / 01期
基金
美国国家科学基金会;
关键词
Linear algebra; DLA; high-performance; complex; matrix; multiplication; micro-kernel; kernel; BLAS; BLIS; 3m; 4m; induced; STABILITY;
D O I
10.1145/3086466
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this article, we explore the implementation of complex matrix multiplication. We begin by briefly identifying various challenges associated with the conventional approach, which calls for a carefully written kernel that implements complex arithmetic at the lowest possible level (i.e., assembly language). We then set out to develop a method of complex matrix multiplication that avoids the need for complex kernels altogether. This constraint promotes code reuse and portability within libraries such as Basic Linear Algebra Subprograms and BLAS-Like Library Instantiation Software (BLIS) and allows kernel developers to focus their efforts on fewer and simpler kernels. We develop two alternative approaches-one based on the 3M method and one that reflects the classic 4M formulation-each with multiple variants, all of which rely only on real matrix multiplication kernels. We discuss the performance characteristics of these "induced" methods and observe that the assembly-level method actually resides along the 4M spectrum of algorithmic variants. Implementations are developed within the BLIS framework, and testing on modern hardware confirms that while the less numerically stable 3M method yields the fastest runtimes, the more stable (and thus widely applicable) 4M method's performance is somewhat limited due to implementation challenges that appear inherent in nature.
引用
收藏
页数:36
相关论文
共 9 条
  • [1] IMPLEMENTING HIGH-PERFORMANCE COMPLEX MATRIX MULTIPLICATION VIA THE 1M METHOD
    Van Zee, Field G.
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2020, 42 (05) : C221 - C244
  • [2] Passivated Perovskite Crystallization via g-C3N4 for High-Performance Solar Cells
    Jiang, Lu-Lu
    Wang, Zhao-Kui
    Li, Meng
    Zhang, Cong-Cong
    Ye, Qing-Qing
    Hu, Ke-Hao
    Lu, Ding-Ze
    Fang, Peng-Fei
    Liao, Liang-Sheng
    ADVANCED FUNCTIONAL MATERIALS, 2018, 28 (07)
  • [3] High-performance Co x M 3-x AlO y (M=Ni, Mn) catalysts derived from microwave -assisted synthesis of hydrotalcite precursors for methane catalytic combustion
    Fan, Xionghua
    Li, Luming
    Yang, Xiushan
    Guo, Zhanglong
    Jing, Fangli
    Chu, Wei
    CATALYSIS TODAY, 2020, 347 : 23 - 30
  • [4] Insights into LiMXO4F (M-X = Al-P and Mg-S) as Cathode Coatings for High-Performance Lithium-Ion Batteries
    Liu, Bo
    Shi, Xiaowen
    Gu, Lanhui
    Wen, Yufeng
    Sun, Xinyuan
    Ruan, Wen
    Zhou, Shenlin
    Jiang, Daguo
    Han, Xiang
    Zhang, Wenqing
    Shi, Siqi
    ACS APPLIED MATERIALS & INTERFACES, 2022, 14 (39) : 44859 - 44868
  • [5] Fabrication of a high-performance nano-structured Ln1-xSrxMO3-δ (Ln = La, Sm; M = Mn, Co, Fe) SOC electrode through infiltration
    Wu, Tong
    Yu, Bo
    Zhang, Wenqiang
    Chen, Jing
    Zhao, Suoqi
    RSC ADVANCES, 2016, 6 (72) : 68379 - 68387
  • [6] Ca2La(MS4)(BS3) (M = Ge/Si and Sn/Si): high-performance infrared nonlinear optical materials designed using an atomic site co-occupancy strategy
    Han, Ya-Xiang
    Hu, Chun-Li
    Mao, Jiang-Gao
    INORGANIC CHEMISTRY FRONTIERS, 2025, 12 (05): : 1950 - 1957
  • [7] High-performance CsPbBr3@Cs4PbBr6/SiO2 nanocrystals via double coating layers for white light emission and visible light communication
    Li, Xianwen
    Ma, Wen
    Liang, Dehai
    Cai, Wensi
    Zhao, Shuangyi
    Zang, Zhigang
    ESCIENCE, 2022, 2 (06): : 646 - 654
  • [8] Tailoring High-Performance Ternary Transition Metal-Based NASICON-Na(9-2x-3y-4z)Mn x V y Zr z (PO4)3 Cathodes via Combinatorial Chemical Substitutions
    Patra, Biplab
    Kumar, Keshav
    Ghosh, Subham
    Mazumder, Madhulika
    Pati, Swapan K.
    Senguttuvan, Premkumar
    CHEMISTRY OF MATERIALS, 2024, 36 (07) : 3107 - 3119
  • [9] Modulating the structure of O3-type NaNi 1/3 Fe 1/3 Mn 1/3 O 2 for high-performance sodium-ion batteries via Na2MoO4 reactive wetting coating combined with Mo doping and interface reconstruction
    Song, Miaoyan
    Xu, Lin
    Wang, Kemeng
    Chen, Guohu
    Tang, Zhaohong
    Zhou, Kaiwen
    Wu, Wenwei
    Wu, Xuehang
    JOURNAL OF COLLOID AND INTERFACE SCIENCE, 2025, 684 : 148 - 158