Low-Latency, Low-Area, and Scalable Systolic-Like Modular Multipliers for GF(2m) Based on Irreducible All-One Polynomials

被引:23
作者
Meher, Pramod Kumar [1 ]
Lou, Xin [2 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
Elliptic curve cryptography (ECC); error-control-coding; finite field multiplication; systolic array; very large scale integration (VLSI); BIT-PARALLEL MULTIPLIER; ARCHITECTURE;
D O I
10.1109/TCSI.2016.2614309
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, an efficient recursive formulation is suggested for systolic implementation of canonical basis finite field multiplication over GF(2(m)) based on irreducible AOP. We have derived a recursive algorithm for the multiplication, and used that to design a regular and localized bit-level dependence graph (DG) for systolic computation. The bit-level regular DG is converted into a fine-grained DG by node-splitting, and mapped that into a parallel systolic architecture. Unlike most of the existing structures, it does not involve any global communications for modular reduction. The proposed bit-parallel systolic structure has the same cycle time as that of the best existing bit-parallel systolic structure [1], but involves significantly less number of registers. The proposed bit-parallel design has a scalable latency of l + [ log(2)s] + 1 cycles which is considerably low compared with those of existing systolic designs. Moreover, the proposed time-multiplexed structure is designed specifically for scalability of throughput and hardware-complexity to meet the area-time trade-off in resource-constrained applications while maintaining or reducing the overall latency. The ASIC synthesis report shows that the proposed bit-parallel structures offers nearly 30% saving of area and nearly 38% saving of power consumption over the best of the existing AOP-based systolic finite field multiplier.
引用
收藏
页码:399 / 408
页数:10
相关论文
共 35 条
[1]  
[Anonymous], 1988, VLSI Array Processors
[2]   Low complexity bit-parallel multiplier for GF(2m) defined by all-one polynomials using redundant representation [J].
Chang, KY ;
Hong, DW ;
Cho, HS .
IEEE TRANSACTIONS ON COMPUTERS, 2005, 54 (12) :1628-1630
[3]  
Chiou-Yng Lee, 2001, ISCAS 2001. The 2001 IEEE International Symposium on Circuits and Systems (Cat. No.01CH37196), P578, DOI 10.1109/ISCAS.2001.922303
[4]   Bit-serial multiplication in GF(2m) using irreducible all-one polynomials [J].
Fenn, STJ ;
Parker, MG ;
Benaissa, M ;
Taylor, D .
IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1997, 144 (06) :391-393
[5]   A low-complexity power-sum circuit for GF(2m) and its applications [J].
Guo, JH ;
Wang, CL .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-ANALOG AND DIGITAL SIGNAL PROCESSING, 2000, 47 (10) :1091-1097
[6]   MODULAR CONSTRUCTION OF LOW COMPLEXITY PARALLEL MULTIPLIERS FOR A CLASS OF FINITE-FIELDS GF(2(M)) [J].
HASAN, MA ;
WANG, MZ ;
BHARGAVA, VK .
IEEE TRANSACTIONS ON COMPUTERS, 1992, 41 (08) :962-971
[7]  
HASAN MA, 1991, IEEE PACIF, P211, DOI 10.1109/PACRIM.1991.160717
[8]   A COMPARISON OF VLSI ARCHITECTURE OF FINITE-FIELD MULTIPLIERS USING DUAL, NORMAL, OR STANDARD BASES [J].
HSU, IS ;
TRUONG, TK ;
DEUTSCH, LJ ;
REED, IS .
IEEE TRANSACTIONS ON COMPUTERS, 1988, 37 (06) :735-739
[9]   STRUCTURE OF PARALLEL MULTIPLIERS FOR A CLASS OF FIELDS GF(2M) [J].
ITOH, T ;
TSUJII, S .
INFORMATION AND COMPUTATION, 1989, 83 (01) :21-40
[10]   Efficient semisystolic architectures for finite-field arithmetic [J].
Jain, SK ;
Song, LL ;
Parhi, KK .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 1998, 6 (01) :101-113