A Scalable Turbo Decoding Algorithm for High-Throughput Network-on-Chip Implementation

被引:4
作者
Al-Dujaily, Ra'ed
Li, An
Maunder, Robert G.
Mak, Terrence [2 ]
Al-Hashimi, Bashir M. [2 ]
Hanzo, Lajos [1 ]
机构
[1] Univ Southampton, Sch Elect & Comp Sci, Southampton Wireless, Southampton SO17 1BJ, Hants, England
[2] Univ Southampton, Elect & Software Syst Res Grp, Southampton SO17 1BJ, Hants, England
基金
英国工程与自然科学研究理事会;
关键词
Turbo codes; BCJR; network-on-chip; performance evaluation;
D O I
10.1109/ACCESS.2016.2628801
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wireless communication at near-capacity transmission throughputs is facilitated by employing sophisticated Error Correction Codes (ECCs), such as turbo codes. However, real-time communication at high transmission throughputs is only possible if the challenge of implementing turbo decoders having equally high processing throughputs can be overcome. Furthermore, in many applications, turbo decoders are required to have the flexibility of supporting a wide variety of turbo code parametrizations. This motivates the implementation of turbo decoders using networks-on-chip (NoCs), which facilitate flexible and high-throughput parallel processing. However, turbo decoders conventionally operate on the basis of the Logarithmic Bahl-Cocke-Jelinek-Raviv (Log-BCJR) algorithm, which has an inherently serial nature, owing to its data dependencies. This limits the exploitation of the NoC's computing resources, particularly as the size of the NoC is scaled up. Motivated by this, we propose a novel turbo decoder algorithm, which eliminates the data dependencies of the Log-BCJR algorithm and, therefore, has an inherently parallel nature. We show that by jointly optimizing the proposed algorithm with the NoC architecture, a significantly improved utility of the available computing resources is achieved. Owing to this, our proposed turbo decoder achieves a factor of up to 2.13 higher processing throughput than a Log-BCJR bench marker.
引用
收藏
页码:9880 / 9894
页数:15
相关论文
共 35 条
[1]  
Abbasfar A, 2004, IEEE MILIT COMMUN C, P169
[2]  
Ahonen T., 2004, Proceedings of the 2004 International Workshop on System Level Interconnect Prediction, P53
[3]   Embedded Transitive Closure Network for Runtime Deadlock Detection in Networks-on-Chip [J].
Al-Dujaily, Ra'ed ;
Mak, Terrence ;
Xia, Fei ;
Yakovlev, Alexandre ;
Palesi, Maurizio .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2012, 23 (07) :1205-1215
[4]  
Ascia G, 2008, IEEE T COMPUT, V57, P809, DOI [10.1109/TC.2008.38, 10.1109/TC.2007.38]
[5]   Networks on chips: A new SoC paradigm [J].
Benini, L ;
De Micheli, G .
COMPUTER, 2002, 35 (01) :70-+
[6]   20 Years of Turbo Coding and Energy-Aware Design Guidelines for Energy-Constrained Wireless Applications [J].
Brejza, Matthew F. ;
Li, Liang ;
Maunder, Robert G. ;
Al-Hashimi, Bashir M. ;
Berrou, Claude ;
Hanzo, Lajos .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (01) :8-28
[7]   The odd-even turn model for adaptive routing [J].
Chiu, GM .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2000, 11 (07) :729-738
[8]   VLSI Implementation of a Multi-Mode Turbo/LDPC Decoder Architecture [J].
Condo, Carlo ;
Martina, Maurizio ;
Masera, Guido .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2013, 60 (06) :1441-1454
[9]  
Condo C, 2012, DES AUT TEST EUROPE, P1525
[10]  
Dally W.J., 2004, Morgan Kaufmann Series in Computer Architecture and Design