CUDA-based Parallel Implementation of IBM Word Alignment Algorithm for Statistical Machine Translation

被引:0
作者
Jing, Si-Yuan [1 ]
Yan, Gao-Rong [2 ]
Chen, Xing-Yuan [1 ]
Jin, Peng [1 ]
Guo, Zhao-Yi [1 ]
机构
[1] Leshan Normal Univ, Sch Comp Sci, Leshan, Peoples R China
[2] Leshan Normal Univ, Sch Foreign Language, Leshan, Peoples R China
来源
2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT) | 2016年
关键词
Word Alignment; GPU; Parallel Computation; Expectation-Maximization Algorithm; CUDA;
D O I
10.1109/PDCAT.2016.49
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Word alignment is a basic task in natural language processing and it usually serves as the starting point when building a modern statistical machine translation system. However, the state-of-art parallel algorithm for word alignment is still time-consuming. In this work, we explore a parallel implementation of word alignment algorithm on Graphics Processor Unit (GPU), which has been widely available in the field of high performance computing. We use the Compute Unified Device Architecture (CUDA) programming model to re-implement a state-of-the-art word alignment algorithm, called IBM Expectation-Maximization (EM) algorithm. A Tesla K40M card with 2880 cores is used for experiments and execution times obtained with the proposed algorithm are compared with a sequential algorithm and a multi-threads algorithm on an IBM X3850 server, which has two Intel Xeon E7 CPUs (2.0GHz * 10 cores). The best experimental results show a 16.8-fold speedup compared to the multi-threads algorithm and a 234.7-fold speedup compared to the sequential algorithm.
引用
收藏
页码:189 / 194
页数:6
相关论文
共 15 条
  • [1] [Anonymous], 2010, PROGRAMMING MASSIVEL
  • [2] [Anonymous], 2010, Statistical Machine Translation
  • [3] [Anonymous], 1996, P 16 C COMP LING, DOI [DOI 10.3115/993268.993313, 10.3115/993268.993313]
  • [4] [Anonymous], 2014, COMPUTER SCI
  • [5] Brown P.F., 1993, COMPUT LINGUIST, V19, P2
  • [6] Gao Q, 2008, ACL-08 HLT-Softw. Eng. Testing, Qual. Assur. Nat. Lang. Process, P49, DOI 10.3115/1622110.1622119
  • [7] NVIDIA Tesla: A unified graphics and computing architecture
    Lindholm, Erik
    Nickolls, John
    Oberman, Stuart
    Montrym, John
    [J]. IEEE MICRO, 2008, 28 (02) : 39 - 55
  • [8] Improving Statistical Machine Translation Using Bayesian Word Alignment and Gibbs Sampling
    Mermer, Coskun
    Saraclar, Murat
    Sarikaya, Ruhi
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (05): : 1090 - 1101
  • [9] Och FJ, 2000, 38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P440
  • [10] Optimization Principles and Application Performance Evaluation of a Multithreaded GPU Using CUDA
    Ryoo, Shane
    Rodrigues, Christopher I.
    Baghsorkhi, Sara S.
    Stone, Sam S.
    Kirk, David B.
    Hwu, Wen-mei W.
    [J]. PPOPP'08: PROCEEDINGS OF THE 2008 ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2008, : 73 - 82