CUDA-based Parallel Implementation of IBM Word Alignment Algorithm for Statistical Machine Translation

被引：0

作者：

Jing, Si-Yuan ^{[1
]}

Yan, Gao-Rong ^{[2
]}

Chen, Xing-Yuan ^{[1
]}

Jin, Peng ^{[1
]}

Guo, Zhao-Yi ^{[1
]}

机构：

[1] Leshan Normal Univ, Sch Comp Sci, Leshan, Peoples R China

[2] Leshan Normal Univ, Sch Foreign Language, Leshan, Peoples R China

来源：

2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT) | 2016年

关键词：

Word Alignment; GPU; Parallel Computation; Expectation-Maximization Algorithm; CUDA;

D O I：

10.1109/PDCAT.2016.49

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Word alignment is a basic task in natural language processing and it usually serves as the starting point when building a modern statistical machine translation system. However, the state-of-art parallel algorithm for word alignment is still time-consuming. In this work, we explore a parallel implementation of word alignment algorithm on Graphics Processor Unit (GPU), which has been widely available in the field of high performance computing. We use the Compute Unified Device Architecture (CUDA) programming model to re-implement a state-of-the-art word alignment algorithm, called IBM Expectation-Maximization (EM) algorithm. A Tesla K40M card with 2880 cores is used for experiments and execution times obtained with the proposed algorithm are compared with a sequential algorithm and a multi-threads algorithm on an IBM X3850 server, which has two Intel Xeon E7 CPUs (2.0GHz * 10 cores). The best experimental results show a 16.8-fold speedup compared to the multi-threads algorithm and a 234.7-fold speedup compared to the sequential algorithm.

引用

页码：189 / 194

页数：6

共 15 条

[1] [Anonymous], 2010, PROGRAMMING MASSIVEL
[2] [Anonymous], 2010, Statistical Machine Translation
[3] [Anonymous], 1996, P 16 C COMP LING, DOI [DOI 10.3115/993268.993313, 10.3115/993268.993313]
[4] [Anonymous], 2014, COMPUTER SCI
[5] Brown P.F., 1993, COMPUT LINGUIST, V19, P2
[6] Gao Q, 2008, ACL-08 HLT-Softw. Eng. Testing, Qual. Assur. Nat. Lang. Process, P49, DOI 10.3115/1622110.1622119
[7] NVIDIA Tesla: A unified graphics and computing architecture
Lindholm, Erik
Nickolls, John
Oberman, Stuart
Montrym, John
[J]. IEEE MICRO, 2008, 28 (02) : 39 - 55
[8] Improving Statistical Machine Translation Using Bayesian Word Alignment and Gibbs Sampling
Mermer, Coskun
Saraclar, Murat
Sarikaya, Ruhi
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (05): : 1090 - 1101
[9] Och FJ, 2000, 38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P440
[10] Optimization Principles and Application Performance Evaluation of a Multithreaded GPU Using CUDA
Ryoo, Shane
Rodrigues, Christopher I.
Baghsorkhi, Sara S.
Stone, Sam S.
Kirk, David B.
Hwu, Wen-mei W.
[J]. PPOPP'08: PROCEEDINGS OF THE 2008 ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2008, : 73 - 82

← 1 2 →