期刊文献+

大数据机器学习系统研究进展 被引量:51

Research Progress on Big Data Machine Learning System
下载PDF
导出
摘要 要实现高效的大数据机器学习,需要构建一个能同时支持机器学习算法设计和大规模数据处理的一体化大数据机器学习系统。研究设计高效、可扩展且易于使用的大数据机器学习系统面临诸多技术挑战。近年来,大数据浪潮的兴起,推动了大数据机器学习的迅猛发展,使大数据机器学习系统成为大数据领域的一个热点研究问题。介绍了国内外大数据机器学习系统的基本概念、基本研究问题、技术特征、系统分类以及典型系统;在此基础上,进一步介绍了本实验室研究设计的一个跨平台统一大数据机器学习系统——Octopus(大章鱼)。 To achieve efficient big data machine learning, we need to construct a unified big data machine learning system to support both machine learning algorithm design and big data processing. Designing an efficient, scalable and easy-to-use big data machine learning system still faces a number of challenges. Recently, the upsurge of big data technology has promoted rapid development of big data machine learning, making big data machine learning system to become a research hotspot. The basic concepts, research issues, technical characteristics, categories, and typical systems for big data machine learning system, were reviewed. Then a unified and cross-platform big data machine learning system, Octopus, was presented.
作者 黄宜华
出处 《大数据》 2015年第1期28-47,共20页 Big Data Research
基金 江苏省科技支撑计划基金资助项目(No.BE2014131)~~
关键词 大数据 机器学习 分布并行计算 大数据处理平台 big data, machine learning, distributed and parallel computing, big data processing platform
  • 相关文献

参考文献27

  • 1Banko M, Brill E. Scaling to very large corpora for natural language disambiguation. Proceedings of the 39th Annual Meeting on Association for Computational Linguistics (ACL), Toulouse, France, 2001:26-33.
  • 2Brants T, Popat C A, Xu P, et al. Large language models in machine translation. Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Language Learning. Prague. Czech Republic, 2007:858-867.
  • 3Wang Y, Zhao X M, Sun Z L, et al. Peacock: learning long-tail topic features for industrial applications. ACM Transactions on Intelligent Systems and Technology, 2014, 9(4).
  • 4CCF Task Force on Big Data. Forecast for the development trend of big data in 2015. Communications of the China Computer Federation (CCCF), 2015, 11(1): 48-52.
  • 5Gonzalez J E. Emerging systems for large-scale machine learning. Proceedings of Tutorial on International Conference for Machine Learning(ICML) 2014, Beijing, China, 2014.
  • 6CCF Task Force on Big Data. White paper of China's big data technology and industrial development in 2014. Proceedings of Big Data Conference China, Beijing, China, 2014.
  • 7中国计算机学会大数据专家委员会.2015年中国大数据发展趋势预测.中国计算机学会通讯,2015,11(1):48-52.
  • 8中国计算机学会大数据专家委员会.2014年中国大数据技术与产业发展白皮书.2014中国大数据技术大会,北京,中国,2014.
  • 9Boehm M, Tatikonda S, Reinwald B, et al. Hybrid parallelization strategies for large-scale machine learning in systemML. Proceedings of the VLDB Endowment, Hangzhou, China, 2014.
  • 10Markl V. Breaking the chains: on declarative data analysis and data independence in the big data era. Proceedings of the VLDB Endowment, Hangzhou, China, 2014.

同被引文献420

引证文献51

二级引证文献479

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部