摘要
网站易成为黑客入侵篡改的对象,网站的实时变更监测对于网站安全尤为重要.针对目前大规模进行网站实时变更监测的难点,设计并实现了一种基于非关系型数据库和消息机制的网站变更监测方案.系统采用爬虫技术进行网站页面实时爬取,通过分布式数据存储和消息机制实现对多网站的实时分析,采用了MD5值与文本对比相结合的算法进行网站内容变更监测,并对监测结果进行可视化.此外,当网站出现异常变更时,支持实时处理告警及紧急切断服务,减少由于网站内容被篡改所带来的不良影响.
Websites are easy to become the target of hacking and tampering.The real-time monitoring of website changes is particularly important for the safety of websites.Regarding the difficulties of large-scale real-time website change monitoring,we design and implement a website change monitoring system based on non-relational database and message mechanism.It uses crawler technology to crawl web pages in real time,and realizes real-time analysis of multiple websites through distributed data storage and message mechanism.An algorithm combining MD5 value and text comparison is designed to monitor website content changes and the results are visualized on the monitoring browser.When abnormal changes occur,it supports real-time alarming and emergency cut-off services in order to reduce the adverse effects caused by website content tampering.
作者
何诗佳
刘晓强
李柏岩
蔡立志
胡芸
He Shijia;Liu Xiaoqiang;Li Baiyan;Cai Lizhi;Hu Yun(College of Computer Science and Technology,Donghua University,Shanghai 201620,China;Shanghai Key Laboratory of Computer Software Testing and Evaluating,Shanghai 201112,China)
出处
《南京师范大学学报(工程技术版)》
CAS
2021年第1期30-35,共6页
Journal of Nanjing Normal University(Engineering and Technology Edition)
关键词
网站内容篡改
网站变更监测
MD5
文本对比算法
分布式存储
消息机制
website content tampering
website change monitoring
MD5
text comparison algorithm
distributed storage
message mechanism