摘要
面对当前海量网络日志数据积累的现代社会,人们迫切希望从浩瀚的数据中提炼出有价值的信息。因此,结合分布式系统和当下大数据处理技术,完成了分布式Web日志分析系统的设计和实现。系统结合实时计算和离线计算技术,实现了对站点的入侵检测和运行状态监控分析。同时,将数据挖掘的相关理论应用到系统中的访问者行为分析模块,实现了对访问者行为轨迹的分析,并将分析结果以友好的可视化界面展示给网站运营者,从而达到日志的自动化采集、分析和结果可视化分析处理。
Faced with the modern society that has accumulated huge amounts of online log data,people are eager to extract valuable information from the vast data.This paper combines the distributed system and the current mainstream big data processing technology to complete the design and implementation of a distributed Web log analysis system.The system combines real-time computing and off-line computing technology to achieve site intrusion detection and monitoring of operational status.At the same time,the related theory of data mining is applied to the visitor behavior analysis module in the system to implement the analysis of the visitor's behavior trajectory,and the analysis results are displayed to the operator of the website with a friendly visual interface so As to achieve the automatic analysis of log collection,analysis and visualization of r esults.
作者
李亚红
胡前忠
Li Yahong;Hu Qianzhong(School of Computer and Information Engineering,Nanyang Institute of Technology,Nanyang Henan 473004,China)
出处
《信息与电脑》
2018年第21期163-165,共3页
Information & Computer