机器学习和深度学习的并行训练方法被引量：1

Parallel Training Methods for Machine Learning and Deep Learning

下载PDF

导出

摘要并行计算技术广泛用于对一些特定问题进行更进一步的优化,从而突破性地降低算法的时间消耗。近年来,随着大数据和人工智能的快速发展,在进行大规模深度学习模型的训练时,时间消耗成为一个重要的考虑因素。在模型的训练过程中,由于各个样本之间互不相关的性质,使得模型的训练过程可以利用并行技术来很好地优化。本文以最基础的线性回归作为模型的任务,测试了并行化方法在深度学习模型中的可行性,并对比了不同节点下的性能提升幅度。本文所提出的并行训练方法的时间复杂度为O(m/k×P+k×ϵ),根据该时间复杂度,可以合理地根据待解决问题的规模来选择合适的并行化策略。 Parallel computing techniques are widely used for further optimization of specific problems,enabling breakthrough improvements of algorithm time complexity.In recent years,with the rapid development of big data and artificial intelligence,time consumption has become an important consideration when training large-scale deep learning models.The training process of the model can be optimized by parallel techniques due to the uncorrelated nature of the samples.In this paper,we test the feasibility of parallelization methods in deep learning models with the most basic linear regression as the task of the model,and compare the per⁃formance improvement under different nodes.The time complexity of the parallel training method proposed is O(m/k×P+k×ϵ).Based on this time complexity,it is reasonable to choose the appropriate parallelization strategy based on the size of the problem to be solved.

作者祝佳怡 Zhu Jiayi(College of Computer Science and Engineering,Southwest Minzu University,Chengdu 610225)

机构地区西南民族大学计算机科学与工程学院

出处《现代计算机》 2022年第14期42-48,共7页 Modern Computer

关键词并行计算机器学习深度学习最优化 parallel computing machine learning deep learning optimization

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1陈国良,孙广中,徐云,龙柏.并行计算的一体化研究现状与发展趋势[J].科学通报,2009,54(8):1043-1049. 被引量：96

二级参考文献23

1陈国良,梁维发,沈鸿.并行图论算法研究进展[J].计算机研究与发展,1995,32(9):1-16. 被引量：13
2Chen G L, Sun G Z, Zhang Y Q, et al. Study on parallel computing. J Comput Sci Tech, 2006.21(5): 665--673.
3Grama A, Gupta A, Karypis G, et al. Introduction to parallel computing. Boston: Benjaming/Cummings Publish Company, Inc., 2003.
4Chen G L. A partitioning selection algorithm on multiprocessors. J Comput Sci Tech, 1988, 3(4): 241--250.
5Zhang F, Chen G L, Zhang Z Q. OpenMP on Networks of Workstations for Software DSMs. J Comput Sci Tech, 2002, 17(1): 90--100.
6Sutter H, Larus J. Software and the concurrency revolution. Q focus: Multiprocessors, 2005, 3(7): 54--62.
7Rajkumar B, Chee S Y, Srikumar V. Market-oriented cloud computing: Vision, hype, and reality for delivering IT services as computing utilities. In: Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications, 2008 Sept 25-27, Dalian. Los Alamitos, CA: IEEE CS Press, 2008. 15--22.
8Asanovic K, Bodik R, James J, et al. The landscape of parallel computing research: A view from Berkeley. Technical Report, Electrical Engineering and Computer Sciences, University of California, Berkeley. 2006.
9Zhang Y Q, Chen G L, Sun G Z, et al. Models of parallel computation: a survey and classification. Front Comput Sci China, 2007, 1(2): 156--165.
10Sun X H. Scalable computing in the multicore era. In: Proceedings of the Inaugural Symposium on Parallel Algorithms, Architechures and Programming, 2008 8ep 16-18, Hefei. Hefei: University of Science and Technology of China Press, 2008. 1--18.

共引文献95

1王郑,韩焱,单联春.通信运营商桌面云运用探讨[J].电信科学,2011,27(S1):16-22. 被引量：5
2黎红友,杜吉成,彭舰,王江勇.云数据中心基于异构工作负载的负载均衡调度方案[J].四川大学学报（工程科学版）,2013,45(S1):112-117. 被引量：3
3吴吉义,平玲娣,潘雪增,李卓.云计算:从概念到平台[J].电信科学,2009,25(12):23-30. 被引量：191
4李海霞.软硬件结合解决对称多处理机的同步问题[J].沿海企业与科技,2010(3):27-28.
5吴吉义,傅建庆,张明西,平玲娣.云数据管理研究综述[J].电信科学,2010,26(5):34-41. 被引量：51
6李宏宽,杨晓冬,邹珍军.基于MPI并行的遥感影像系统级几何校正快速处理技术研究[J].河南工程学院学报（自然科学版）,2011,23(1):49-52. 被引量：6
7康瑛石,王海宁,虞江锋.基于云计算的虚拟化系统研究[J].电信科学,2011,27(4):61-67. 被引量：10
8康瑛石,吴吉义,王海宁.基于云计算的一体化煤矿安全监管信息系统[J].煤炭学报,2011,36(5):873-877. 被引量：45
9孙香花.云计算研究现状与发展趋势[J].计算机测量与控制,2011,19(5):998-1001. 被引量：61
10鲁永泉,高鹏东,裘初,王金涛.基于高性能计算的服务云平台[J].中国科技成果,2011(10):40-42.

同被引文献6

1尹晋.电子对抗作战仿真实验设计[J].电子技术与软件工程,2017(22):76-77. 被引量：1
2孙少斌,韩志军.通用仿真实验总体架构研究[J].科学技术创新,2018(27):29-30. 被引量：1
3王宗杰,沈培志,罗木生.作战仿真实验设计软件体系框架研究[J].舰船电子工程,2018,38(12):18-21. 被引量：5
4高开,郭振华,陈永芳,王丽,赵雅倩,赵坤.面向混合异构架构的模型并行训练优化方法[J].计算机工程与科学,2021,43(1):42-48. 被引量：2
5李江涛,张京涛,张波,王培.联合作战仿真实验中仿真引擎运行策略研究[J].指挥控制与仿真,2022,44(3):80-87. 被引量：5
6孙正伦,乔鹏,窦勇,李青青,李荣春.面向执行-学习者的在线强化学习并行训练方法[J].计算机学报,2023,46(2):229-243. 被引量：2

引证文献1

1马春华.基于仿真实验的智能并行训练方法[J].指挥控制与仿真,2024,46(1):93-99.

1麻莹莹,陈钢.基于Coq的矩阵代码生成技术[J].软件学报,2022,33(6):2224-2245.

现代计算机

2022年第14期

浏览历史

内容加载中请稍等...

机器学习和深度学习的并行训练方法被引量：1

参考文献1

二级参考文献23

共引文献95

同被引文献6

引证文献1

相关作者

相关机构

相关主题

浏览历史

机器学习和深度学习的并行训练方法 被引量：1

参考文献1

二级参考文献23

共引文献95

同被引文献6

引证文献1

相关作者

相关机构

相关主题

浏览历史

机器学习和深度学习的并行训练方法被引量：1