期刊文献+

融合密度聚类与集成学习的数据库异常检测 被引量:6

Database Anomaly Detection Based on Density Clustering and Ensemble Learning
下载PDF
导出
摘要 目前,针对数据库系统内部攻击与威胁的检测方法较少,且已有的数据库异常检测方案存在代价开销高、检测准确率低等问题.为此,将密度聚类和集成学习融合,提出一种基于密度聚类和集成学习的数据库异常检测方法.利用OPTICS(Ordering Points To Identify the Clustering Structure)密度聚类算法对用户产生的数据库SQL操作日志进行聚类,通过对SQL语句中的各属性进行分析,提取用户的异常行为,形成先验知识;将Bagging、Boosting和Stacking进行组合,形成集成学习模型,以OPTICS聚类形成的先验知识为基础,并利用该集成学习模型对用户行为作进一步分析,并创建用户行为特征库.基于用户形成特征库,对用户行为进行检测.给出了方案的详细构建过程,包括数据预处理、训练、学习模型建立以及异常检测;利用相关实验数据进行测试,结果表明本方案能以较高的效率检测出数据库异常行为,并且在准确率方面优于同类方案. At present,there are fewer detection methods for internal attacks and threats in database systems,and most existing database anomaly detection schemes have problems such as high cost and low accuracy.Therefore,we proposed a database anomaly detection scheme based on OPTICS and ensemble learning.Use OPTICS to cluster the database SQL operation log generated by the user,and analyze the attributes of the SQL statement to extract the abnormal behavior.Finally,use ensemble learning model composed of Bagging,Boosting and Stacking to further analyze user behavior and create a feature database.Detect user behavior based on the feature database.The detailed construction process of the scheme is given including data preprocessing,training,learning model construction and anomaly detection.The performance of the proposed scheme are evaluated and compared on different metrics.The results show that the scheme can detect abnormal database user behavior with high efficiency and accuracy.
作者 李勃 寿增 刘昕禹 高明慧 马力 徐剑 LI Bo;SHOU Zeng;LIU Xin-yu;GAO Ming-hui;MA Li;XU Jian(China NARI Group Corporation(State Grid Electronic Power Research Institute),Nanjing 210061,China;Beijing Kedong Electric Power Control System Co.,Ltd.,Beijing 100192,China;State Grid Liaoning Electric Power Supply Co.Ltd.,Shenyang 110003,China;Software College,Northeastern University,Shenyang 110169,China)
出处 《小型微型计算机系统》 CSCD 北大核心 2021年第3期666-672,共7页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(61872069)资助 中央高校基本科研业务费专项资金项目(N2017012)资助。
关键词 异常检测 数据库系统 用户行为 密度聚类 集成学习 anomaly detection database system user behavior density clustering ensemble learning
  • 相关文献

参考文献3

二级参考文献26

  • 1孙吉贵,刘杰,赵连宇.聚类算法研究[J].软件学报,2008(1):48-61. 被引量:1065
  • 2Bahrololoum A,Nezamabadi-pour H,Saryazdi S.A Data Clustering Approach Based on Universal Gravity Rule[J].Engineering Applications of Artificial Intelligence,2015,45(S1):415-428.
  • 3Zhao Qinpei,Shi Yang,Liu Qin,et al.A Grid-growing Clustering Algorithm for Geo-spatial Data[J].Pattern Recognition Letters,2015,53:77-84.
  • 4Karypis G,Han E H,Kumar V.Chameleon:Hierarchical Clustering Using Dynamic Modeling[J].Computer,1999,32(8):68-75.
  • 5Tu Q,Lu J F,Yuan B.Density-based Hierarchical Clustering for Streaming Data[J].Pattern Recognition Letters,2012,33(5):641-645.
  • 6Ester M,Kriegel H P,Sander J,et al.A Density-based Algorithm for Discovering Clusters in Large Spatial Databases with Noise[C]//Proceedings of the 2nd International Conference on Knowledge Discovering in Databases and Data Mining.Portland,USA:AAAI Press,1996:226-232.
  • 7Chen Xiaoming,Liu Wanquan,Qiu Huining,et al.APSCAN:A Parameter Free Algorithm for Clustering[J].Pattern Recognition Letters,2011,32(7):973-986.
  • 8Cassisi C,Ferro A,Giugno R,et al.Enhancing Density-based Clustering:Parameter Reduction and Outlier Detection[J].Information Systems,2013,38(3):317-330.
  • 9LüYinghua,Ma Tinghuai,Tang Meili,et al.An Efficent and Scalable Density-based Clustering Algorithm[J].Neurocomputing,2015,171(1):1-14.
  • 10Rodriguez A,Laio A.Clustering by Fast Search and Find of Density Peaks[J].Science,2014,344(6191):1489-1492.

共引文献43

同被引文献68

引证文献6

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部