期刊文献+

分布式网络日志分析系统的设计与实现 被引量:1

Design and Implementation of Distributed Network Log Analysis System
下载PDF
导出
摘要 面对当前海量网络日志数据积累的现代社会,人们迫切希望从浩瀚的数据中提炼出有价值的信息。因此,结合分布式系统和当下大数据处理技术,完成了分布式Web日志分析系统的设计和实现。系统结合实时计算和离线计算技术,实现了对站点的入侵检测和运行状态监控分析。同时,将数据挖掘的相关理论应用到系统中的访问者行为分析模块,实现了对访问者行为轨迹的分析,并将分析结果以友好的可视化界面展示给网站运营者,从而达到日志的自动化采集、分析和结果可视化分析处理。 Faced with the modern society that has accumulated huge amounts of online log data,people are eager to extract valuable information from the vast data.This paper combines the distributed system and the current mainstream big data processing technology to complete the design and implementation of a distributed Web log analysis system.The system combines real-time computing and off-line computing technology to achieve site intrusion detection and monitoring of operational status.At the same time,the related theory of data mining is applied to the visitor behavior analysis module in the system to implement the analysis of the visitor's behavior trajectory,and the analysis results are displayed to the operator of the website with a friendly visual interface so As to achieve the automatic analysis of log collection,analysis and visualization of r esults.
作者 李亚红 胡前忠 Li Yahong;Hu Qianzhong(School of Computer and Information Engineering,Nanyang Institute of Technology,Nanyang Henan 473004,China)
出处 《信息与电脑》 2018年第21期163-165,共3页 Information & Computer
关键词 数据挖掘 日志分析 分布式计算 HADOOP SPARK data mining log analysis distributed computing Hadoop Spark
  • 相关文献

参考文献6

二级参考文献42

  • 1张晔,魏然,谷延锋,严萌.基于小波变换的光谱异常特征分析及提取技术研究[J].新型工业化,2013,2(1):38-45. 被引量:7
  • 2王文平,刘希玉,韩杰.基于并行遗传算法的关联规则挖掘[J].山东师范大学学报(自然科学版),2006,21(4):29-31. 被引量:7
  • 3iProspect Search Engine User Behavior Study [EB/OL].[2009-11-17]. http://www. iprospect. com/premiumPDFs/ WhitePaper 2006_SearchEngineUerBehavior.pdf.
  • 4Hawking D,Craswell N. Overview of the TREC-2002 Web Track[C]//Proc of the Eleventh Text Retrieval Conference,Technology, 2003:86-95.
  • 5SEWM-2004中文Web检索测试指南[EB/OL].[2009-11-17]. http://www. cwirf. org/2004WebTrack/ SEWM2004WebTrackGuidelines.pdf.
  • 6SEWM2005中文Web检索评测指南[EB/OL].[2009-11-17].http://www.cwirf.org/2005WebTrack/SEWM2005WebTrackGuidelines.pdf.
  • 7Page L, Brin S, Motwani R, et al. The Pagerank Citation Ranking: Bringing Order to the Web[R]. Technical Report, Stanford Digital Library Technologies Project, 1998.
  • 8Kleinberg J M. Authoritative Sources in a Hyperlinked Environment[J]. Journal of the ACM, 1999, 46(5) :604-632.
  • 9Chakrabarti S, Dom B, Raghavan P, et al. Automatic Re source List Compilation by Analyzing Hyperlink Structure and Associated Text[EB/OL]. [2009-11-17]. http://citese er. ist. psu. edu/chakrabarti98automalie. html.
  • 10Culliss G. User Popularity Ranked Search Engine [EB/OL]. [2009-11-17]. http://www. infonortics. com/searchengines/ bostonl999/culliss/index. htm.

共引文献100

同被引文献2

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部