期刊文献+

用MapReduce实现天文星表交叉认证

MapReduce for astronomical cross-matching
下载PDF
导出
摘要 天文星表的交叉认证是天文研究中非常重要的基础工作。新巡天项目和更强大望远镜的投入使用,使天文数据爆炸增长,数据量的增加使得两个星表之间的交叉认证变得非常耗时。描述了如何利用MapReduce实现并行天文星表交叉认证,综合考虑了算法与体系结构的匹配问题,并给出了在大数据天文星表交叉认证工作的性能评估,通过与广泛使用的PostgreSQL数据库的比较,证明了基于MapReduce交叉认证方法的有效性。 As a basic and indispensable step,the astronomical cross-match is facing a data avalanche. With the completion of new sky survey projects and powerful telescopes,current cross-matching methods cannot be performed on demand for large scale astronomical data sets. This paper introduced MapReduce framework to solve this problem. It carefully considered the mapping of cross-matching algorithm on map and reduce phases. Performance evaluation shows that the MapReduce-based cross-matching can outperform the traditional one on PostgreSQL. As the knowledge,it is the first effort to adopt MapReduce for astronomical cross-matching problem.
出处 《计算机应用研究》 CSCD 北大核心 2010年第10期3740-3743,共4页 Application Research of Computers
基金 北京市自然科学基金资助项目(1052008)
关键词 映射化简 交叉认证 并行 大规模 MapReduce cross-matching parallel large-scale
  • 相关文献

参考文献11

  • 1DJORGOVSKI S G, BRUNNER R J. Astronomical archives of the future: a virtual observatory[J]. Future Generation Computer Systems, 1999, 16(1):63-72.
  • 2CUI Chen- zhou, ZHAO Yong- heng. Worldwide R&D of virtual observatory [ J ]. Proceedings of the International Astronomical Union, 2007, 3:563-564.
  • 3Viewing the heavens through the cloud [EB/OL]. [2009- 12- 14]. http ://ssg. astro. washington. edu/research. shtml? research/CluE1.
  • 4ZHAO Qing, SUN Ji-zhou, YU Ce, et al. A paralleled large-scale astronomical cross-matching function [ C ]//Proc of Lecture Notes in Computer Science, vol 5574. 2009:604-614.
  • 5高丹,张彦霞,赵永恒.中国虚拟天文台交叉证认工具的开发和应用[J].天文学报,2008,49(3):348-358. 被引量:5
  • 6CGP. Report on cross matching catalogues [ EB/OL]. (2003-09-29) [2009- 12- 14]. http://wiki. astrogrid. org/pub/Astrogrid/DataFe- derationandDataMining/cross. htm.
  • 7POWER R. Cross match simulation [ CP/OL ]. (2007-04-23) [ 2009- 12-14 ]. http ://www. ict. csiro, au/staff/robert.power/projects/CM/ ps/cm. htm.
  • 8O' MALLEY O. TeraByte sort on Apache Hadoop [ EB/OL]. (2008- 05 ) [ 2009-12-14 ]. http ://sortbenehmark. org/YahooHadoop. pdf.
  • 9DEAN J, GHEMAWAT S. MapReduce: simplified data processing on large clusters[J]. Communications of the ACM, 2008, 51 (1) : 107-113.
  • 10CUTRI R M, SKRUTSKIE M F, VAN DYK S, et al. 2MASS all sky catalog of point sources, the IRSA 2MASS all-sky point source catalog, NASA/IPAC infrared science archive [ EB/OL]. (2003) [2009-12-14]. http://irsa. ipac. caltech, edu/applications/Gator/.

二级参考文献17

  • 1[1][LAMOST的科学目标]http://www.lamost.org/xoops/modules/wfchannel/index.php?pagenum=3
  • 2[2]Szalay A,Gray J.Science,2001,293:203
  • 3[4]VizieR]http://vizier.u-strasbg.fr
  • 4[5][Simbad]http://simbad.u-strasbg.fr
  • 5[6][Aladin]http://aladin.u-strasbg.fr
  • 6[7][VIZIER Search]http://archive.stsci.edu/vizier.php
  • 7[8]Ortiz P F,Ochsenbein F,Wicenec A et al.ESO/CDS Data-mining Tool Development Project[C].In:ASP Conf.Ser.(Vol.172).1999.379-382
  • 8[9][NED Batch Jobs]http://nedwww.ipac.caltech.edu/help/batch.html
  • 9[10][OpenSkyQuery]http://openskyquery.net
  • 10[11][TOPCAT]http://www.star.bris.ac.uk/~mbt/topcat/

共引文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部