PreNTT:面向zk-SNARK的数论变换计算并行加速方法

PreNTT:parallel acceleration method for number theorytransformation calculations for zk-SNARK

下载PDF

导出

摘要简洁非交互式零知识证明(zk-SNARK)由于具备证明验证过程简捷快速的优点,已在加密货币等众多领域得到广泛应用。但其证明生成过程所需计算仍复杂耗时,影响了进一步的应用拓展。针对zk-SNARK证明生成过程中的主要计算瓶颈——数论变换(NTT),提出了一种基于GPU的NTT计算加速方法PreNTT。首先,提出了基于预计算的NTT并行计算方法,利用预计算与旋转因子次幂算法优化,减少NTT并行计算开销,并结合动态预计算,进一步提高NTT计算效率。其次,通过“动态自适应计算核调度”,可以根据NTT输入规模自适应地分配GPU片上资源,提升了大规模NTT任务的计算能效。然后,通过核外整体数据混洗和核内局部数据混洗相结合的方式,避免了访存冲突。最后,使用CUDA多流技术执行数据传输和计算过程,对预计算时间进行了有效隐藏。实验结果表明:基于PreNTT实现的zk-SNARK系统,与目前业界最先进的系统Bellperson相比,NTT模块运行时间获得了全规模最低1.7倍的加速比,最高加速比为9倍。PreNTT能够有效提高NTT算法并行度,降低zk-SNARK运算时间开销。 Zero-knowledge succinct non-interactive proofs(zk-SNARK)have found extensive applications in various fields,including cryptocurrencies,due to their swift and efficient proof verification process.However,the computational intensity of the proof generation process poses a significant challenge,particularly at the number theoretic transform(NTT)stage.This paper proposed a GPU-based acceleration method for NTT computations,named PreNTT,to address this bottleneck.The method employed precomputation and optimization of twiddle factor powers to reduce the parallel computation overhead in NTT.It also introduced dynamic precomputation to enhance the efficiency of these computations.The algorithm made use of dynamic adaptive kernel scheduling,which allocated GPU resources on-chip according to the NTT input size,thereby boosting the computational efficiency for large-scale tasks.Additionally,the approach combined external global data shuffling with internal local data shuffling to avoid memory access conflicts.The use of CUDA multi-stream technology allowed for effective concealment of precomputation times during data transfer and computation processes.Experimental results indicate that the zk-SNARK system utilizing PreNTT achieves a speed-up ratio ranging from 1.7x to 9x in NTT module running times compared to Bellperson,the industry-leading system.PreNTT effectively increases the parallelism of the NTT algorithm and reduces the computational overhead in zk-SNARK operations.

作者丁冬李正权柴志雷 Ding Dong;Li Zhengquan;Chai Zhilei(School of Internet of Things Engineering,Jiangnan University,Wuxi Jiangsu 214122,China;School of Artificial Intelligence&Computer Science,Jiangnan University,Wuxi Jiangsu 214122,China;State Key Laboratory of Networking&Switching Technology,Beijing University of Posts&Telecommunications,Beijing 100876,China)

机构地区江南大学物联网工程学院江南大学人工智能与计算机学院北京邮电大学网络与交换技术全国重点实验室

出处《计算机应用研究》 CSCD 北大核心 2024年第10期3059-3067,共9页 Application Research of Computers

基金国家自然科学基金资助项目(61972180) 北京邮电大学网络与交换技术全国重点实验室开放课题资助项目(SKLNST-2023-1-13)。

关键词简洁非交互式零知识证明数论变换 GPU 并行计算加速 zero-knowledge succinct non-interactive argument of knowledge number theoretic transformation GPU parallel computing acceleration

分类号 TP311 [自动化与计算机技术—计算机软件与理论] TP309.7 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

1李春月,董翔,韩冷,丰丽娟,许东波.中西医结合护理前交叉韧带损伤研究进展[J].光明中医,2024,39(6):1243-1246.
2杨亚涛,曹景沛,陈亮宇,王伟.基于Zynq平台的BFV全同态加密算法高效实现[J].通信学报,2024,45(9):192-205.
3凌维.动量定理的应用归类剖析[J].中学生数理化（高考理化）,2024(9):27-28.
4本刊编辑部,岳汝华(图).执镜抒壮志绘影抚人心——专访摄影人岳汝华[J].摄影与摄像,2023(9):66-71.
5高莹,高健鑫,杨欣蕊,郭子渊,陈洁.基于理想格公钥密码关键部件的改进与优化实现[J].密码学报（中英文）,2024,11(4):878-894.
6宋璐,徐涛,魏筱,吴慧,张琳琳,黄磊.基于AHP-TOPSIS对某三甲医院股骨头置换术预防性抗菌药物合理性点评[J].甘肃医药,2024,43(9):826-829.
7WANG LiGuan,LI Yuan,ZHANG ShuangJun,CAI DongLiang,KAN HaiBin.Simulation extractable SNARKs based on target linearly collision-resistant oracle[J].Science China(Technological Sciences),2024,67(9):2853-2866.
8杨颖,傅坤昆.关于学前教育督导评估的问题审视与理论探析——基于系统耦合理论的视角[J].早期教育（幼教·教育科研）,2024(7):27-31.
9陆高峰,姚智宇.算法-关系-中介:平台劳动过程的混合控制框架搭建——基于AI数据标注员的扎根研究[J].现代传播（中国传媒大学学报）,2024,46(8):38-47.
10高新颖,刘晶雪,张静,左兴盛,张林林.改进YOLOv5框架在细菌计数方向的研究[J].计算机科学与应用,2024,14(9):111-120.

计算机应用研究

2024年第10期

浏览历史

内容加载中请稍等...

PreNTT:面向zk-SNARK的数论变换计算并行加速方法

相关作者

相关机构

相关主题

浏览历史