基于GPU加速的快速全同态加密算法设计与实现

Design and implementation of fast fully homomorphic encryption algorithm based on GPU acceleration

下载PDF

导出

摘要全同态加密算法支持直接对加密数据(密文)执行代数运算,但其密文评估中的数论变换(NTT)涉及大量高维度整系数多项式环运算,限制了其在隐私计算中的应用。针对CPU实现方案对NTT算法计算并行度较低的问题,提出一种CPU+GPU异构的CKKS全同态加密实现方案。首先,根据NTT算法数据内存访问规律,设计一种数据暂存共享内存策略,有效减少频繁的全局内存访问。其次,针对数据规模可变导致内核出现部分空闲线程的问题,设计线程工作负载动态分配机制,并采用不同基数的蝴蝶变换结构,提高数据输入的灵活性并优化并行策略。再次,提出单—多内核混合调用模式,通过NTT算法蝶形变换分组大小动态切换内核调用模式,充分利用GPU多核调用的并行潜力。最后,设计并实现并行程度更高、计算复杂度更低的NTT算法,利用该算法实现并行的同态乘法运算,并基于HElib库实现CPU+GPU异构的CKKS全同态加密算法。实验结果表明,与使用AVX-512加速的HElib库相比,所提的NTT/INTT计算时间减短近65%。 Fully homomorphic encryption supports direct algebraic operations on encrypted data(ciphertext),with the foundation of its ciphertext evaluation phase involving numerous high-dimensional integer coefficient polynomial ring additions and multiplications.This limits its widespread application in the field of privacy computing.The CPU implementation scheme offers low parallelism for the Number Theoretic Transform(NTT)algorithm calculations.A CPU+GPU heterogeneous fully homomorphic encryption im-plementation scheme was proposed.Firstly,a cache strategy of data temporarily stored in shared memory was introduced,which stored repeatedly read and unchanging data,including NTT input data and rotation factors,in shared memory to reduce frequent glob-al memory access.Secondly,to address the issue of partially idle threads caused by variable data sizes,it dynamically allocates thread workloads based on data size and hardware resources,adopting butterfly transformation structures of different radices to achieve opti-mal parallel strategies while enhancing the flexibility of data input.Thirdly,it introduces a single-multi-core mixed invocation mode,dynamically switching the kernel invocation mode based on the group size of butterfly transformations in each NTT iteration,to fully utilize the parallel potential of multi-core invocations on GPU.Finally,it designs and implements a higher parallelism,lower computa-tional complexity NTT algorithm for GPU,uses this algorithm to perform parallel homomorphic multiplication operations,and imple-ments a CPU+GPU heterogeneous CKKS fully homomorphic encryption algorithm based on the HElib library.Experimental results show NTT/INTT computation time is reduced by nearly 65%compared to HElib library using AVX-512 acceleration technology.

作者谭泽玖赵鑫万俊平刘虎成蒋琳徐金明纪守领王轩 TAN Zejiu;ZHAO Xin;WAN Junping;LIU Hucheng;JIANG Lin;XU jinming;JI Shouling;WANG Xuan(School of computer science and technology,Harbin Institute of Technology,Shenzhen,Shenzhen,518055,China;College of control science and engineering,Zhejiang University,Hangzhou,310058,China;College of Computer Science and Technology,Zhejiang University,Hangzhou,310058,China;Pengcheng National Laboratory,Shenzhen 518000,China;Guangdong Key Laboratory of New Security and Intelligence Technology,Shenzhen 518005,China)

机构地区哈尔滨工业大学(深圳)计算机科学与技术学院浙江大学控制科学与工程学院浙江大学计算机科学与技术学院鹏城国家实验室广东省安全智能新技术重点实验室

出处《网络空间安全科学学报》 2024年第3期41-52,共12页 Journal of Cybersecurity

基金国家重点研发计划项目(2022YFB3102100)。

关键词隐私计算全同态加密 GPU并行加速数论变换缓存策略卷积神经网络 Privacy computing Fully homomorphic encryption GPU acceleration Number theoretic transform Caching strategy Convolutional neural networks

分类号 TN918.4 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献1

1王励成,李婧.无噪声全同态加密浅析[J].密码学报,2017,4(6):579-595. 被引量：3

共引文献2

1徐温菊,王保仓,王正.对一个对称全同态加密方案的攻击和改进[J].现代通信技术,2018,0(4):34-39.
2杨亚涛,赵阳,张卷美,黄洁润,高原.同态密码理论与应用进展[J].电子与信息学报,2021,43(2):475-487. 被引量：22

1姬杨蓓蓓,陆雪晴,董继昌,赖泽荣.考虑医院用户停车偏好和优先级的共享停车位分配[J].上海大学学报（自然科学版）,2023,29(4):681-693. 被引量：2
2杨亚涛,曹景沛,陈亮宇,王伟.基于Zynq平台的BFV全同态加密算法高效实现[J].通信学报,2024,45(9):192-205.
3丁冬,李正权,柴志雷.PreNTT:面向zk-SNARK的数论变换计算并行加速方法[J].计算机应用研究,2024,41(10):3059-3067.
4陶昀翔.基于Apriori算法的大规模并发资源推荐系统设计[J].信息与电脑,2024,36(16):88-91.
5刘才军,宋显庆.实施单元整体教学促进运算能力提升 ——“多位数乘一位数笔算乘法”单元整体教学关键课逆向设计[J].小学教学研究,2024(29):9-12.
6高莹,高健鑫,杨欣蕊,郭子渊,陈洁.基于理想格公钥密码关键部件的改进与优化实现[J].密码学报（中英文）,2024,11(4):878-894.
7刘家森,王绪安,余丹,李龙,赵臻.基于同态加密和边缘计算的关键目标人脸识别方案[J].信息安全研究,2024,10(11):1004-1011.
8周清雷,韩贺茹,李斌,刘宇航.面向格密码的可配置基-4 NTT硬件优化与实现[J].通信学报,2024,45(10):163-179.
9葛同山.浅谈利用几何法解决高中数学中的三角函数问题[J].新智慧,2024(21):7-9.
10刘沛津,王柳月,孙昱,史洁琳,晏东阳.基于一致性哈希算法的分布式机电系统海量数据存储策略研究[J].机床与液压,2023,51(22):31-38. 被引量：2

网络空间安全科学学报

2024年第3期

浏览历史

内容加载中请稍等...

基于GPU加速的快速全同态加密算法设计与实现

参考文献1

共引文献2

相关作者

相关机构

相关主题

浏览历史