期刊文献+

数据与计算平台是驱动当代科学研究发展的重要基础设施 被引量:24

The Data and Computing Platform is An Important Infrastructure Which Drives Modern Scientific Research Development
下载PDF
导出
摘要 【目的】为表明数据与计算平台在科学研究活动中的重要驱动作用,本文研究了数据、计算以及科学研究的发展与本质。【方法】本文简述了数据技术和计算技术的发展,通过拓扑材料计算、计算化学、引力波发现、黑洞成像和半监督学习图像识别等典型案例,表明了在各领域科研活动中,数据与计算平台极大地拓展了科学研究的深度和广度,为当代科学研究提供了新的手段与方法。【结果】本文认为摩尔定律的驱动、大数据爆炸式的增长以及人工智能的再次蓬勃发展,都和数据与计算技术的发展呈现密不可分的关系。【结论】以大数据、人工智能技术为代表的数据与计算平台将作为科学研究一种独立、不可或缺的投入要素,融入科学研究活动的全过程,数据与计算平台将成为世界各国驱动现代科学研究发展的重要基础设施。 [Objective]To demonstrate the important driving role of data and computing in scientific research,this paper studies the development and essence of data,computing,and scientific research.[Methods]This paper expounds the essence o f data technology and computing technology.Several typical cases are used to illustrate that data and computing,viewed as one of the keys to scientific activities,greatly expand the depth and breadth of scientific and technological innovation research.[Results]Data are quantitative or qualitative records of natural,social phenomena and scientific experiments,which is the important for scientific research.The technology of data refers to a series of scientific and technological activities such as collecting,classifying,transporting,storing,analyzing and visualizing data.Its goal is to turn data into information,knowledge and pattern for human beings to understand the natural world and human society.Since the invention of von Neumann computer,the rapid development of computing technology driven by Moore's law has made the research and application of data technology and artificial intelligence more active.The collaborative progress of data technology,artificial intelligence and computing technology has brought about a leap forward in human understanding of natural and social knowledge and pattern.[Conclusions]Therefore,data technology,computing technology and artificial intelligence provide the most basic technology platform for building the"digital twin"body of the"human-machineobject"ternary fusion.Advanced data and computing technology,represented by big data and artificial intelligence technology,will integrate theory,experiment and simulation,and form a new scientific research paradigm.In the development process of scientific research in the past thousand years,human,capital,tools(scientific instruments)and methods(theories)have become the necessary input elements of scientific research.In the past decades,computer has aided scientists to carry out a lot of calculation work.It becomes a type of tools of the input elements.However,with the rapid development of data and computing technology,data and computing technology not only plays an auxiliary and supporting role in scientific research,but also can rely on the logical method of themselves to carry out scientific research in the"digital twin"body of"human-machine-object"three-dimensional integration.Therefore,data and computing technology will be regard as a new and indispensable input elements of scientific research.
作者 廖方宇 洪学海 汪洋 褚大伟 Liao Fangyu;Hong Xuehai;Wang Yang;Chu Dawei(Computer Network Information Center,Chinese Academy of Sciences,Beijing 100190,China;Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China;Chinese Academy of Sciences,Beijing 100864,China)
出处 《数据与计算发展前沿》 2019年第1期2-10,共9页 Frontiers of Data & Computing
基金 中国科学院战略研究与决策支持系统建设专项项目“教据与计算平台——重大创新领域战略规划研究”(GHJZLZX-2018-10,GHJ-ZLZX-2019-10)。
关键词 数据与计算平台 数据技术 计算技术 人工智能 data technology computing technology artificial intelligence scientific research paradigm
  • 相关文献

参考文献2

二级参考文献30

  • 1李国杰.大数据研究的科学价值[J].中国计算机学会通讯,2012,8(9):8-15.
  • 2李国杰,程学旗.赵国栋,等.2014中国大数据技术与产业发展报告[M].北京:机械工业出版社,2013:6-11.
  • 3周慧.国家发改委:资金支持大数据重大建设项目[EB/OL].2016-01-20[2016-04-08].http://news.hexun.com/2016-01-20/181906965.html.
  • 4Apache H. What is apache hadoop?[EB/OL]. 2013-08-26[2016-04-13]. http://hadoop.apache.org.
  • 5Dean J, Ghemawat S. MapReduce: Simplified data processing on large cluster[J]. Communications of the ACM, 2008, 51(1): 107-113.
  • 6Zaharia M, Chowdhury M, Das T, et al. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing[C]//Proceedings of the 9th USENIX Conference on Networked Systems Design and hnplementation. Berkeley, CA: USENIX Association, 2012: 141-146.
  • 7Lublinsky B, Smith K T, Yakubovich A. Professional hadoop solutions[M]. Birmingham: Wrox Press, 2013.
  • 8Gartner Research Report. Magic quadrant for data quality tools [EB/OL]. [2016-04-12]. http://useready.com/wp-contenffuploads/2013/07/Gartner-Data- Quality-2012.pdf.
  • 9Gonzalez J E, Low Y, Gu H, et al. Powergraph: Distributed graph-parallel computation on natural graphs[C]//Pmceedings of the 10th USENIX Sympo- sium on Operating Systems Design and Implementation. Berkeley, CA: USENIX Association, 2012: 17-30.
  • 10Engle C, Lupher A, Xin R, et al. Shark: Fast data analysis using coarse-grained distributed memory[C]//Proceedings of the 2012 ACM SIGMOD Interna- tional Conference on Management. New York: ACM, 2012.

共引文献55

同被引文献196

引证文献24

二级引证文献65

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部