期刊文献+

参数并行:一种基于群启发式算法的机器学习参数寻优方法 被引量:9

Parallel Parameters:An Optimization Method of Machine Learning Parameters Based on Swarm Heuristic Algorithm
下载PDF
导出
摘要 为了提高机器学习算法超参数寻优效率,提出了一种基于参数并行机制的机器学习参数寻优方法。该方法通过群启发式算法来进行机器学习算法的参数寻优,将种群转换为Spark平台特有的弹性分布式数据集,针对参数寻优耗时特点并行计算种群中个体适应度。选取随机森林和遗传算法作为实验算法,设计了多组实验对所提出的学习训练方法进行验证。实验结果表明:该方法的参数寻优能力和效率都优于主流的网格搜索算法;在20万条以下的小数据量下,与基于数据并行机制的机器学习参数寻优方法相比,该方法运行时间最多能够减少69.5%,并具有良好的可扩展性。 In order to improve the super parameter optimization efficiency of machine learning algorithm,a machine learning parameter optimization method based on parameter parallel mechanism was proposed.The group heuristic algorithm was used to optimize the parameters of the machine learning algorithm,the population was transformed into the unique elastic distributed data set of Spark platform,and the individual fitness in the population was calculated in parallel according to the time-consuming characteristics of parameter optimization.Random forest and genetic algorithm were selected as experimental algorithms,and several groups of experiments were designed to verify the proposed learning and training method.The experimental results show that the parameter optimization ability and efficiency of this method are better than the mainstream grid search algorithm.Compared with the machine learning parameter optimization method based on data parallel mechanism,the running time of this method can be reduced by 69.5%at most,and has good scalability.
作者 杨艳艳 李雷孝 林浩 王永生 王慧 高静 YANG Yan-yan;LI Lei-xiao;LIN Hao;WANG Yong-sheng;WANG Hui;GAO Jing(College of Data Science and Application, Inner Mongolia University of Technology, Hohhot 010080, China;Inner Mongolia Autonomous Region Engineering & Technology Research Center of Big Data Based Software Service, Hohhot 010080, China;College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot 010010, China)
出处 《科学技术与工程》 北大核心 2022年第5期1972-1980,共9页 Science Technology and Engineering
基金 内蒙古自治区科技成果转化资金(2020CG0073) 内蒙古自治区科技重大专项(2019ZD015,2019ZD016) 内蒙古自治区关键技术攻关计划(2019GG273,2020GG0094) 内蒙古高等学校科学研究项目(NJZY21317) 内蒙古工业大学科学研究重点项目(ZZ202017)。
关键词 参数寻优 群启发式算法 SPARK 参数并行 机器学习算法 parameter optimization swarm heuristic algorithm Spark parallel parameters machine learning algorithm
  • 相关文献

参考文献15

二级参考文献115

  • 1陈跃华,曹广益,朱新坚.PEMFC的Elman神经网络建模与模糊神经网络控制[J].能源技术,2005,26(4):146-149. 被引量:3
  • 2Chang C C, Lin C J. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2011,2 (3) : 75--102.
  • 3Zaharia M, Chowdhury M, Das T, et al. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. In- Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation. Berkeley: USENIX Association, 2012,2 -- 16.
  • 4Iehihashi H, Honda K, Notsu A. Comparison of scaling behavior between fuzzy c-means based classifier with many parameters and LibSVM. Fuzzy Systems,2011,35(2) :386--393.
  • 5Joseph S M, Hameed A. Online handwritten malaya[am character recognition using LIBSVM in matlab. Australian Computer Society, 2014, 15(1) :21--25.
  • 6郑哗,李剑.Scala程序设计.北京:人民邮电出版社,2010,1—196.
  • 7黄海旭,高宇翔.Scala编程.北京:电子工业出版社,2010,30-278.
  • 8仲志丹,朱新坚,任远.PEMFC的PSO优化LS-SVM动态建模仿真[J].计算机仿真,2008,25(2):248-251. 被引量:1
  • 9陈帅,朱建宁,潘俊,侍洪波.最小二乘支持向量机的参数优化及其应用[J].华东理工大学学报(自然科学版),2008,34(2):278-282. 被引量:53
  • 10孙继昌.中国的水库大坝安全管理[J].中国水利,2008(20):10-14. 被引量:92

共引文献264

同被引文献75

引证文献9

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部