期刊文献+

基于Spark Streaming的明安图射电频谱日像仪实时数据处理 被引量:4

Real-Time Data Processing in Mingantu Ultrawide Spectral Radio Heliograph Based on Spark Streaming
下载PDF
导出
摘要 目前天文观测中对数据的实时处理需求越来越多,性能要求也越来越高,我国明安图射电频谱日像仪(Mingant U Sp Ectral Radioheliograph,MUSER)是同时以高时间、高空间和高频率分辨率对太阳进行射电频谱成像的设备。在低频部分的日常观测中,包含了两方面的需求:(1)对历史数据的处理;(2)5秒钟抽样观测数据的处理。抽样观测数据需要实时处理,并在监控终端显示,数据处理过程包含了数据校验、修正、成图、洁化等多个步骤,传统的单机处理模式已无法满足大数据量下的实时性要求。因此,实时数据计算中,使用Spark Streaming流式计算这一新兴的分布式计算方法,设计了自定义的接收器,并将多个图形处理器节点加入到分布式集群中。通过实验对性能进行评估,结果证明基于内存的高速执行引擎的特点能显著提高性能。期待能通过实验进一步优化算法和配置,获得更好的结果,并最终运用到实际环境中。 There is a growing demand for real-time processing in astronomical observations in recent years, meanwhile, the requirement for performance is also increasing dramatically. Mingantu Ultrawide Spectral Radio Heliograph (MUSER) is a synthetic aperture radio interferometer with high temporal, spatial and spectral resolution. In daily observation of low frequency, MUSER contains two aspects of data processing, historical data processing and sampling observational data which is produced every 5 seconds and processed in real-time mode. The procedure of raw data processing contains validation, correction, clean and other processing steps, then the results need to be transmitted in real-time mode to monitoring end without user constantly refreshing or sending a request. The traditional stand-alone processing mode has been unable to meet the requirements of large amounts of data in real-time mode. In this paper, we explored the use of Spark Streaming in a new approach for MUSER real-time calculations across multiple machines and evaluated its effectiveness and efficiency. A customized receiver was created for real-time binary stream of MUSER. We also extended the Spark cluster by adding multiple GPU's nodes. The experiments have shown that Spark Streaming can significantly improve MUSER real-time processing performance for its memory-based execution engine. We might look forward to optimize the algorithm through experiments and configurations so as to obtain better results, and apply it to the actual environment of MUSER finally.
出处 《天文研究与技术》 CSCD 2017年第4期421-428,共8页 Astronomical Research & Technology
基金 国家自然科学基金(11403009 U1231205)资助
关键词 MUSER 射电天文 SPARK 流式计算 实时计算 MUSER Radio astronomy Spark Streaming computing Real-time computing
  • 相关文献

参考文献3

二级参考文献62

  • 1姬国枢,窦玉江,王威,刘飞,陈志军,张坚,颜毅华.CSRH模拟接收机设计[J].天文研究与技术,2006,3(2):135-142. 被引量:6
  • 2张坚,颜毅华,刘飞,王威.用于双天线干涉实验的数字相关接收机[J].天文研究与技术,2006,3(2):148-153. 被引量:2
  • 3陈志军,颜毅华,刘玉英,张坚,王威.关于中国厘米—分米波频谱日像仪(CSRH)选址与无线电环境监测[J].天文研究与技术,2006,3(2):168-175. 被引量:10
  • 4Wikipedia. Cloud computing [ EB/OL ]. (2007-03-03) [ 2008-12- 20]. http ://en. wikipedia, org/wiki/Cloud computing.
  • 5Wikipedia. John McCarthy ( computer scientist) [ EB/OL]. (2008- 10-07) [2008-12-10]. http://en. wikipcdia, org/wiki/John_McCarthy_(computer_scientist).
  • 6IBM, C, oogle and IBM announced university initiative to address intemetscale computing challenges [EB/OL]. (2007-10-08) [2008-10-15]. http ://www-03. ibm. com/press/us/en/pressrelease/22414. wss.
  • 7HEWITT C. ORGs for scalable, robust privacy-friendly client cloud computing [ J]. IEEE Intemet Computing, 2008,12 (5) :96- 99.
  • 8WANG Li-zhe, TAO Jie, KUNZE M. Scientific cloud computing: early definition and experience[ C ]//Proc of the 10th IEEE International Conference on High Performance Computing and Communications. 2008:825- 830.
  • 9BUYYA R, YEO C S, VENUGOPAL S. Market-oriented cloud computing: vision, hype, and reality for delivering IT services as computing utilities[ C]//Proc of the 10th IEEE International Conference on High Performance Computing and Communications. 2008:5- 13.
  • 10ARMBRUST M, FOX A, GRIFFITH R, etal. Above the clouds:a Berkeley view of cloud computing[ R/OL]. (2009-02-10) [2009-05- 15 ]. http ://www. grid. pku. edu. cn/cloud/Berkeley-abovetheclouds. pdf.

共引文献921

同被引文献15

引证文献4

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部