Data Parallel Implementation of Belief Propagation in Factor Graphs on Multi-core Platforms

被引:1
作者
Ma, Nam [1 ]
Xia, Yinglong [2 ]
Prasanna, Viktor K. [3 ]
机构
[1] Univ So Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
[3] Univ So Calif, Ming Hsieh Dept Elect Engn, Los Angeles, CA 90089 USA
基金
美国国家科学基金会;
关键词
Data parallelism; Belief propagation; Factor graph; Multi-core; NUMA;
D O I
10.1007/s10766-013-0246-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We investigate data parallel techniques for belief propagation in acyclic factor graphs on multi-core systems. Belief propagation is a key inference algorithm in factor graph, a probabilistic graphical model that has found applications in many domains. In this paper, we explore data parallelism for basic operations over the potential tables in belief propagation. Data parallel techniques for these table operations are developed for shared memory platforms. We then propose a complete belief propagation algorithm using these table operations to perform exact inference in factor graphs. The proposed algorithms are implemented on state-of-the-art multi-socket multi-core systems with additional NUMA-aware optimizations. Our proposed algorithms exhibit good scalability using a representative set of factor graphs. On a four-socket Intel Westmere-EX system with 40 cores, we achieve 39.5 speedup for the table operations and 39 speedup for the complete algorithm using factor graphs with large potential tables.
引用
收藏
页码:219 / 237
页数:19
相关论文
共 28 条
  • [1] Agarwal V., 2010, P 22 IEEE ACM SUP C
  • [2] [Anonymous], 1992, Introduction to Parallel Algorithms
  • [3] [Anonymous], INT WORKSH ECML PKDD
  • [4] [Anonymous], NEUR INF PROC SYST N
  • [5] SCALABLE DATA-PARALLEL ALGORITHMS FOR TEXTURE SYNTHESIS USING GIBBS RANDOM-FIELDS
    BADER, DA
    JALA, J
    CHELLAPPA, R
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 1995, 4 (10) : 1456 - 1460
  • [6] Buyya R., 1999, HIGH PERFORMANCE CLU, V1
  • [7] Coarfa Cristian., 2005, Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming, PPoPP '05, P36, DOI DOI 10.1145/1065944.1065950
  • [8] Massively LDPC Decoding on Multicore Architectures
    Falcao, Gabriel
    Sousa, Leonel
    Silva, Vitor
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (02) : 309 - 322
  • [9] Gonzalez J., 2009, C UNC ART INT UAI MO
  • [10] Karkooti M., 2004, P INT C INF TECHN CO