Comb to Pipeline: Fast Software Encryption Revisited

被引:10
作者
Bogdanov, Andrey [1 ]
Lauridsen, Martin M. [1 ]
Tischhauser, Dinar [1 ]
机构
[1] Tech Univ Denmark, DTU Compute, DK-2800 Lyngby, Denmark
来源
FAST SOFTWARE ENCRYPTION, FSE 2015 | 2015年 / 9054卷
关键词
AES-NI; pclmulqdq; Haswell; Authenticated encryption; CAESAR; CBC; OFB; CFB; CMAC; CCM; GCM; OCB3; OTR; CLOC; COBRA; JAMBU; SILC; McOE-G; COPA; POET; Julius; PERFORMANCE;
D O I
10.1007/978-3-662-48116-5_8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
AES-NI, or Advanced Encryption Standard New Instructions, is an extension of the x86 architecture proposed by Intel in 2008. With a pipelined implementation utilizing AES-NI, parallelizable modes such as AES-CTR become extremely efficient. However, out of the four non-trivial NIST-recommended encryption modes, three are inherently sequential: CBC, CFB, and OFB. This inhibits the advantage of using AES-NI significantly. Similar observations apply to CMAC, CCM and a great deal of other modes. We address this issue by proposing the comb scheduler - a fast scheduling algorithm based on an efficient look-ahead strategy, featuring a low overhead - with which sequential modes profit from the AES-NI pipeline in real-world settings by filling it with multiple, independent messages. We apply the comb scheduler to implementations on Haswell, Intel's latest microarchitecture, for a wide range of modes. We observe a drastic speed-up of factor 5 for NIST's CBC, CFB, OFB and CMAC performing around 0.88 cpb. Surprisingly, contrary to the entire body of previous performance analysis, the throughput of the authenticated encryption (AE) mode CCM gets very close to that of GCM and OCB3, with about 1.64 cpb (vs. 1.63 cpb and 1.51 cpb, resp.), despite Haswell's heavily improved binary field multiplication. This suggests CCM as an AE mode of choice as it is NIST-recommended, does not have any weak-key issues like GCM, and is royalty-free as opposed to OCB3. Among the CAESAR contestants, the comb scheduler significantly speeds up CLOC/SILC, JAMBU, and POET, with the mostly sequential nonce-misuse resistant design of POET, performing at 2.14 cpb, becoming faster than the well-parallelizable COPA. Finally, this paper provides the first optimized AES-NI implementations for the novel AE modes OTR, CLOC/SILC, COBRA, POET, McOE-G, and Julius.
引用
收藏
页码:150 / 171
页数:22
相关论文
共 37 条
  • [1] Pipelineable On-line Encryption
    Abed, Farzaneh
    Fluhrer, Scott
    Forler, Christian
    List, Eik
    Lucks, Stefan
    McGrew, David
    Wenzel, Jakob
    [J]. FAST SOFTWARE ENCRYPTION, FSE 2014, 2015, 8540 : 205 - 223
  • [2] Akdemir K., 2010, White paper
  • [3] APE: Authenticated Permutation-Based Encryption for Lightweight Cryptography
    Andreeva, Elena
    Bilgin, Begul
    Bogdanov, Andrey
    Luykx, Atul
    Mennink, Bart
    Mouha, Nicky
    Yasuda, Kan
    [J]. FAST SOFTWARE ENCRYPTION, FSE 2014, 2015, 8540 : 168 - 186
  • [4] COBRA: A Parallelizable Authenticated Online Cipher Without Block Cipher Inverse
    Andreeva, Elena
    Luykx, Atul
    Mennink, Bart
    Yasuda, Kan
    [J]. FAST SOFTWARE ENCRYPTION, FSE 2014, 2015, 8540 : 187 - 204
  • [5] Andreeva E, 2013, LECT NOTES COMPUT SC, V8269, P424, DOI 10.1007/978-3-642-42033-7_22
  • [6] [Anonymous], SILC SIMPLE LIGHTWEI
  • [7] [Anonymous], 2007, TECHNICAL REPORT
  • [8] Aoki K., 2012, DIAC 2012 DIRECTIONS
  • [9] Bahack L., JULIUS SECURE MODE O
  • [10] Bernstein DJ, 2008, LECT NOTES COMPUT SC, V5365, P322, DOI 10.1007/978-3-540-89754-5_25