期刊文献+

基于分布式平台的实时网络日志分析系统设计 被引量:2

Design of real-time network log analysis system based on distributed platform
下载PDF
导出
摘要 传统的网络日志分析系统面临数据来源的多样化、文件分级优化存储、处理效率低和系统时延较长等问题。文章设计了一种基于分布式平台Hadoop的实时网络日志分析系统原型。首先,利用Filebeat组件采集各节点日志文件,根据文件分级归档管理机制对数据过滤、修剪之后发送至Kafka;然后,Kafka根据不同主题把日志流进行分组,实现分级优化存储;最后,使用Kibana工具实现日志文件的高效搜索、可视化分析等操作。实验结果表明:采用模块化设计方案降低了系统的耦合性,文件分级优化存储可以缩短数据处理时延,Kafka服务集群可以提升日志系统高并发处理能力,满足日志实时分析性能要求。 Traditional network log analysis system has been confronted with issues such as the diversification of data sources,file hierarchical optimization and storage,low processing efficiency and long delay.In this paper,a model with real-time network log analysis system is designed based-on the distributed platform Hadoop.First,the Filebeat component is employed to collect the log files of every node.Then,the data are filtered and pruned according to the file hierarchical archive management mechanism and sent to Kafka.After that,log data flow is grouped according to different topics to realize hierarchical optimization storage by using Kafka.Finally,Kibana tools are used to realize the efficient search of log files and visualized analysis and other operations.The experimental results show that:(1)Coupling between systems is reduced by using modularized design solutions;(2)File hierarchical optimization and storage can shorten data processing delay;(3)Kafka service cluster can improve the high concurrent processing capability of log system,so it can satisfy the performance requirements of real-time log analysis.
作者 何长鹏 He Changpeng(School of Public Security Technology,Gansu University of Political Science and Law,Lanzhou 730070,China)
出处 《江苏科技信息》 2020年第27期63-66,共4页 Jiangsu Science and Technology Information
基金 甘肃省高等学校创新基金项目,项目编号:2020B-164 甘肃政法大学校级科研资助项目,项目编号:GZFXQNLW003。
关键词 日志分析 网络日志 HADOOP 网络安全 log analysis network log Hadoop network security
  • 相关文献

参考文献11

二级参考文献91

  • 1姜传菊.网络日志分析在网络安全中的作用[J].现代图书情报技术,2004(12):58-60. 被引量:19
  • 2王文平,刘希玉,韩杰.基于并行遗传算法的关联规则挖掘[J].山东师范大学学报(自然科学版),2006,21(4):29-31. 被引量:7
  • 3iProspect Search Engine User Behavior Study [EB/OL].[2009-11-17]. http://www. iprospect. com/premiumPDFs/ WhitePaper 2006_SearchEngineUerBehavior.pdf.
  • 4Hawking D,Craswell N. Overview of the TREC-2002 Web Track[C]//Proc of the Eleventh Text Retrieval Conference,Technology, 2003:86-95.
  • 5SEWM-2004中文Web检索测试指南[EB/OL].[2009-11-17]. http://www. cwirf. org/2004WebTrack/ SEWM2004WebTrackGuidelines.pdf.
  • 6SEWM2005中文Web检索评测指南[EB/OL].[2009-11-17].http://www.cwirf.org/2005WebTrack/SEWM2005WebTrackGuidelines.pdf.
  • 7Page L, Brin S, Motwani R, et al. The Pagerank Citation Ranking: Bringing Order to the Web[R]. Technical Report, Stanford Digital Library Technologies Project, 1998.
  • 8Kleinberg J M. Authoritative Sources in a Hyperlinked Environment[J]. Journal of the ACM, 1999, 46(5) :604-632.
  • 9Chakrabarti S, Dom B, Raghavan P, et al. Automatic Re source List Compilation by Analyzing Hyperlink Structure and Associated Text[EB/OL]. [2009-11-17]. http://citese er. ist. psu. edu/chakrabarti98automalie. html.
  • 10Culliss G. User Popularity Ranked Search Engine [EB/OL]. [2009-11-17]. http://www. infonortics. com/searchengines/ bostonl999/culliss/index. htm.

共引文献150

同被引文献14

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部