Distributed-Memory Parallel JointNMF

被引:1
|
作者
Eswar, Srinivas [1 ]
Cobb, Benjamin [2 ]
Hayashi, Koby [2 ]
Kannan, Ramakrishnan [3 ]
Ballard, Grey [4 ]
Vuduc, Richard [2 ]
Park, Haesun [2 ]
机构
[1] Argonne Natl Lab, Lemont, IL 60439 USA
[2] Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA 30332 USA
[3] Oak Ridge Natl Lab, Oak Ridge, TN USA
[4] Wake Forest Univ, Dept Comp Sci, Winston Salem, NC 27101 USA
基金
美国国家科学基金会; 美国能源部;
关键词
High Performance Computing; Multimodal Inputs; Nonnegative Matrix Factorization; NONNEGATIVE MATRIX; COMMUNICATION; MPI;
D O I
10.1145/3577193.3593733
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Joint Nonnegative Matrix Factorization (JointNMF) is a hybrid method for mining information from datasets that contain both feature and connection information. We propose distributed-memory parallelizations of three algorithms for solving the JointNMF problem based on Alternating Nonnegative Least Squares, Projected Gradient Descent, and Projected Gauss-Newton. We extend well-known communication-avoiding algorithms using a single processor grid case to our coupled case on two processor grids. We demonstrate the scalability of the algorithms on up to 960 cores (40 nodes) with 60% parallel efficiency. The more sophisticated Alternating Nonnegative Least Squares (ANLS) and Gauss-Newton variants outperform the first-order gradient descent method in reducing the objective on large-scale problems. We perform a topic modelling task on a large corpus of academic papers that consists of over 37 million paper abstracts and nearly a billion citation relationships, demonstrating the utility and scalability of the methods.
引用
收藏
页码:301 / 312
页数:12
相关论文
共 50 条
  • [21] PARALLEL COMPUTATION OF GROBNER BASES ON DISTRIBUTED-MEMORY MACHINES
    SAWADA, H
    TERASAKI, S
    AIBA, A
    JOURNAL OF SYMBOLIC COMPUTATION, 1994, 18 (03) : 207 - 222
  • [22] A PARALLEL VECTOR EQUATION SOLVER FOR DISTRIBUTED-MEMORY COMPUTERS
    QIN, JN
    NGUYEN, DT
    COMPUTING SYSTEMS IN ENGINEERING, 1994, 5 (01): : 19 - 25
  • [23] Parallel volume rendering on distributed-memory multiprocessor system
    Chen, WP
    Deng, JH
    Tang, ZS
    SECOND INTERNATION CONFERENCE ON IMAGE AND GRAPHICS, PTS 1 AND 2, 2002, 4875 : 969 - 974
  • [24] Parallel FP-LAPW for distributed-memory machines
    Dohmen, R
    Pichlmeier, J
    Petersen, M
    Wagner, F
    Scheffler, M
    COMPUTING IN SCIENCE & ENGINEERING, 2001, 3 (04) : 18 - 29
  • [25] PSEUDOSPECTRAL CORRELATION METHODS ON DISTRIBUTED-MEMORY PARALLEL ARCHITECTURES
    MARTINEZ, TJ
    CARTER, EA
    CHEMICAL PHYSICS LETTERS, 1995, 241 (5-6) : 490 - 496
  • [26] PARALLEL SOLUTION OF TRIANGULAR SYSTEMS ON DISTRIBUTED-MEMORY MULTIPROCESSORS
    HEATH, MT
    ROMINE, CH
    SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1988, 9 (03): : 558 - 588
  • [27] A Distributed-Memory Parallel Approach for Volume Rendering with Shadows
    Mathai, Manish
    Larsen, Matthew
    Childs, Hank
    2023 IEEE 13TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION, LDAV, 2023, : 22 - 31
  • [28] Asynchronous Distributed-Memory Parallel Algorithms for Influence Maximization
    Singhal, Shubhendra Pal
    Hati, Souvadra
    Young, Jeffrey
    Sarkar, Vivek
    Hayashi, Akihiro
    Vuduc, Richard
    SC24: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2024, 2024,
  • [29] Massively Parallel Polar Decomposition on Distributed-memory Systems
    Ltaief, Hatem
    Sukkari, Dalal
    Esposito, Aniello
    Nakatsukasa, Yuji
    Keyes, David
    ACM TRANSACTIONS ON PARALLEL COMPUTING, 2019, 6 (01)
  • [30] New parallel scheduling algorithm on distributed-memory systems
    Lu, G.H.
    Sun, S.X.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2001, 38 (02):