Efficient Low-Latency Hardware Architecture for Module-Lattice-Based Digital Signature Standard

被引:1
作者
Truong, Quang Dang [1 ]
Duong, Phap Ngoc [1 ,2 ]
Lee, Hanho [1 ]
机构
[1] Inha Univ, Dept Elect & Comp Engn, Incheon 22212, South Korea
[2] Univ Danang, Vietnam Korea Univ Informat & Commun Technol, Fac Comp Engn & Elect, Da Nang 50000, Vietnam
关键词
Computer architecture; Digital signatures; Standards; NIST; Arithmetic; Low latency communication; Quantum computing; Cryptography; Lattices; Public key cryptography; Field programmable gate arrays; Security management; Hardware security; Post-quantum cryptography (PQC); module-lattice-based digital signature standard (ML-DSA); crystals-Dilithium; lattice-based cryptography (LBC); number theoretic transform (NTT);
D O I
10.1109/ACCESS.2024.3370470
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid advancement of powerful quantum computers poses a significant security risk to current public-key cryptosystems, which heavily rely on the computational complexity of problems such as discrete logarithms and integer factorization. As a result, CRYSTALS-Dilithium, a lattice-based digital signature scheme with the potential to be an alternative algorithm that can withstand both quantum and classical attacks, has been standardized as ML-DSA after NIST Post-Quantum Cryptography competition. While prior studies have proposed hardware designs to accelerate this cryptosystem, there is room for further optimization in the tradeoff between performance and hardware consumption. This paper addresses these limitations by presenting an efficient low-latency hardware architecture for ML-DSA, leveraging optimized timing schedules for its three main algorithms. The hardware implementation enables runtime switching main operations in ML-DSA with various security levels. We design flexible arithmetic and hash modules tailored for ML-DSA, the most time-consuming submodules and key determinants of the scheme implementation. Combined with efficient operation scheduling to maximize the utilized time of submodules, our design achieves the best latency among FPGA-based implementations, outperforming stateof-the-art works by 1.27 similar to 2.58x in terms of the area-time tradeoff metric. Therefore, the proposed hardware architecture demonstrates its practical applicability for digital signature cryptosystems in post-quantum era.
引用
收藏
页码:32395 / 32407
页数:13
相关论文
共 28 条
  • [1] : A Crystal for Post-Quantum Security Using Kyber and Dilithium
    Aikata, Aikata
    Mert, Ahmet Can
    Imran, Malik
    Pagliarini, Samuel
    Roy, Sujoy Sinha
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (02) : 747 - 758
  • [2] Alagic G., 2022, Status report on the third round of the NIST post-quantum cryptography standardization process, DOI DOI 10.6028/NIST.IR.8413-UPD1
  • [3] [Anonymous], 2023, Federal Information Processing Standards Publication (FIPS) NIST FIPS186-5, DOI [10.6028/NIST.FIPS.186-5, DOI 10.6028/NIST.FIPS.186-5]
  • [4] [Anonymous], 2014, document UG573 (v1.1)
  • [5] Bai Shi., 2020, CRYSTALS-Dilithium Algorithm Specifications and Supporting Documentation
  • [6] Becker H., 2021, IACR Trans. Cryptogr. Hardw. Embedded Syst., p2021/986, DOI [10.46586/tches.v2022.i1.221-244, 10.46586/tches.v2022, DOI 10.46586/TCHES.V2022]
  • [7] Beckwith L., 2021, P INT C FIELD PROGR, P1
  • [8] Bertoni G., 2020, Keccak VHDL
  • [9] Bruinderink L.G., 2018, IACR T CRYPTOGRAPHIC, P21, DOI [DOI 10.13154/TCHES.V2018.I3.21-43, 10.13154/tches.v2018.i3.21-43]
  • [10] Area-Efficient Number Theoretic Transform Architecture for Homomorphic Encryption
    Duong-Ngoc, Phap
    Kwon, Sunmin
    Yoo, Donghoon
    Lee, Hanho
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (03) : 1270 - 1283