期刊文献+

信牌驱动式Web数据采集模型的应用 被引量:4

Applications of XINPAI-driven Web data scraping model
下载PDF
导出
摘要 针对数据源复杂、实时性强、准确性高和数据类型多样的Web空间环境数据采集任务,提出了一个基于Petri网的信牌驱动式Web数据采集模型。首先,通过引入Petri网的基本要素作为模型的理论基础,研究适合于Web数据采集的建模方法;在此基础上,针对模型的具体应用验证,研究了空间环境数据采集任务服务系统(SEDGSS)的架构设计,对数据源配置子系统、任务控制子系统和任务处理子系统进行具体的实现。实验结果表明,该模型实现了自动化机制和回溯校验机制,并具有良好的易配置性、可重用性和扩展灵活性;该系统7×24小时实时抓取254个复杂的数据源任务,目前正承担着自动化、业务化的空间环境数据采集任务以服务于我国空间环境预报。 In order to scrap the space environment data which is complex,real-time,accurate and diverse,an XINPAIdriven Web scraping model based on Petri net was proposed.Firstly,by intruducing basic elements of Petri net as the theoretical foundation,a modeling method for Web data scraping was investigated.Then,to verify this model,the architecture of Space Environment Data Gather Service System( SEDGSS) was designed.Simultaneously,data source configuring subsystem,task controlling subsystem and task processing subsystem were implemented.The experimental results show that,this model shows automated mechanism and backtracking mechanism,and possesses easy configurability,reusability and expansion flexibility.At the same time,254 complex data sources are scraped in real time and the system undertakes the automatic task of scraping space environment data for forecast.
出处 《计算机应用》 CSCD 北大核心 2016年第A01期252-256,共5页 journal of Computer Applications
基金 装备技术基础项目(ZKKZX20141ZL01) 中科院高技术局项目(YYYJ-1110-01)
关键词 空间环境数据 PETRI网 信牌驱动式 Web数据采集模型 空间环境预报 space environment data Petri net XINPAI-driven Web data scraping model space environment forecast
  • 相关文献

参考文献12

  • 1叶宗海,都亨,龚建村.中国的空间环境研究与空间环境预报[J].地球物理学进展,1999,14(S1):20-29. 被引量:3
  • 2齐鹏,李隐峰,宋玉伟.基于Python的Web数据采集技术[J].电子科技,2012,25(11):118-120. 被引量:32
  • 3CALIFF M E, MOONEY R J. Relational learning of pattern-match rules for information extraction [ C]/! AAAI'99/IAAI'99: Proceed-ings of the Sixteenth National Conference on Artificial Intelligence. Menlo Park: American Association for Artificial Intelligence, 1999: 328 - 334.
  • 4KUSHMERICK N. Wrapper induction: Efficiency and expressive- ness[J]. Artificial Intelligence, 2000, 118(1): 15-68.
  • 5BAUMGARTNER R, FLESCA S, GOTI'LOB G. Visual Web infor- mation extraction with lixto[ C]//VLDB'01: Proceedings of the 27th International Conference on Very Large Data Bases. San Francisco: Morgan Kaufmann Publishers, 2001:119 - 128.
  • 6BASAK O, ALBAYRAK Y E. Petri net based decision system mod- eling in real-time scheduling and control of flexible automotive man- ufacturing systems[ J]. Computers & Industrial Engineering, 2014, 86:116 - 126.
  • 7DENARO G, PEZZE M. Petri nets and software engineering[ C]// Lectures on Concurrency and Petri Nets, LNCS 3098. Heidelberg: Springer Berlin, 2004:439-466.
  • 8TOSIC M, MANIC M. A RESTful technique for collaborative learn- ing content transclusion by Wiki-style mashups[ C]//Proceedings of the 2011 5th IEEE International Conference on E-Learning in Indus- trial Electronics. Piscataway: 1EEE, 2011:38 -43.
  • 9GLEZ-PENA D, LOUREN(O A, LOPEZ-FEMANDEZ H, et al. Web scraping technologies in an API world[ J]. Briefings in bioin- formaties, 2014, 15(5): 788-797.
  • 10HE W. System and method for synchronized Web scraping: U.S. Patent 20140351091[ P]. 2014 - 11 -27.

二级参考文献15

  • 1叶宗海,都亨.中国的空间环境研究[J].地球物理学报,1997,40(S1):429-441. 被引量:9
  • 2王世金,林华安.一种新的太阳质子事件警报方法的探讨[J].空间科学学报,1993,13(3):215-223. 被引量:8
  • 3古士芬,师立勤,臧振群.空间等离子体引起的高电压太阳阵之弧光放电[J].空间科学学报,1995,15(2):131-136. 被引量:6
  • 4Yue Xiaoli,Proc of theInt’l Symp on Future Software Technology,1998年,339页
  • 5Yu Lei,Proc of the ER-97Workshop on Behavioral Modeling and DesignTransformations:Issue,1997年
  • 6赫特兰.Python基础教程[M].2版.北京:人民邮电出版社,2010.
  • 7丘恩.Python核心编程[M].2版.北京:人民邮电出版社,2008.
  • 8鲁特兹.Python学习手册[M].北京:机械工业出版社,2009.
  • 9都亨,叶宗海.低轨道航天器空间环境手册[M]国防工业出版社,1996.
  • 10《人造地球卫星环境手册》编写组.人造地球卫星环境手册[M]国防工业出版社,1971.

共引文献59

同被引文献30

引证文献4

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部