Towards Efficient Kyber on FPGAs: A Processor for Vector of Polynomials

2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)(2020)

引用 27|浏览59
暂无评分
摘要
Kyber is a promising candidate in post-quantum cryptography standardization process. In this paper, we propose a targeted optimization strategy and implement a processor for Kyber on FPGAs. By merging the operations, we cut off 29.4% clock cycles for Kyber512 and 33.3% for Kyber1024 compared with the textbook implementations. We utilize Gentlemen-Sande (GS) butterfly to optimize the Number-Theoretic Transform (NTT) implementation. The bottleneck of memory access is broken taking advantage of a dual-column sequential scheme. We further propose a pipeline architecture for better performance. The optimizations help the processor achieve 31684 NTT operations per second using only 477 LUTs, 237 FFs and 1 DSP. Our strategy is at least 3x more efficient than the state-of-the-art module for NTT with a similar security level.
更多
查看译文
关键词
DSP,flip flops,look-up table,LUT,pipeline architecture,dual-column sequential scheme,memory access,Kyber processor,polynomial vector,number-theoretic transform,Gentlemen-Sande butterfly,Kyber1024,Kyber512,clock cycles,optimization strategy,post-quantum cryptography standardization process,FPGA,31684 NTT operations
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要