基于加权信息增益的恶意代码检测方法被引量：9

Malicious Code Detection Method Based on Weighted Information Gain

下载PDF

导出

摘要采用数据挖掘技术检测恶意代码,提出一种基于加权信息增益的特征选择方法。该方法综合考虑特征频率和信息增益的作用,能够更加准确地选取有效特征,从而提高检测性能。实现一个恶意代码检测系统,采用二进制代码的N-gram和变长N-gram作为特征提取方法,加权信息增益作为特征选择方法,使用多种分类器进行恶意代码检测。实验结果证明,该方法能有效提高恶意代码的检测率和准确率。 Using data mining technology to detect malicious code, this paper proposes a feature selection method based on weighted intormation gain. This method can select effective features more correctly by combining the advantage of informatiou gain with classwise frequency. A malicious code detection system is implemented which adopts binary N-gram and variable-length N-gram as the feature extraction method, weighted informatinn gain as the feature selection method. Several classifiers are used to detect malicious code in the system. Experimental results prove that this method can effectively improve the detection and accuracy rate.

作者张小康帅建梅史林

机构地区中国科学技术大学自动化系

出处《计算机工程》 CAS CSCD 北大核心 2010年第6期149-151,共3页 Computer Engineering

基金国家"863"计划基金资助项目(2006AA01Z449)

关键词数据挖掘变长N—gram 特征选择信息增益 data mining variable-length N-gram feature selection： information gain

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献5

1Schultz M G, Eskin E, Zadok E, et al. Data Mining Methods for Detection of New Malicious Executabtes[C]//Proc. of the IEEE Symposium on Security and Privacy. Oakland, California, USA: IEEE Press, 2001: 38-49.
2Assaleh T A, Cercone N, Keselj V, et al. Detection of New Malicious Code Using N-grams Signatures[C]//Proc. of the 2nd Annual Conference on Privacy, Security and Trust. Ontario, Canada [s. n.], 2004: 193-196.
3Kolter J Z, Maloof M A. Learning to Detect and Classify Malicious Executables in the Wild[J]. Journal of Machine Learning Research, 2006, 7: 2721-2744.
4Reddy D S, Dash S K, Pujari A K. New Malicious Code Detection Using Variable Lenglb N-grams[C]//Proc. of the 2nd International Conference on Information Systems Security. Kolkata, India: [s. n.], 2006: 276-288.
5Cohen P, Heeringa B, Adams N M. An Unsupervised Algorithm for Segmenting Categorical Time Series into Episodes[C]//Proc. of the ESF Exploratory Workshop on Pattern Detection and Discovery. London, UK: [s. n.], 2002: 49-62.

同被引文献80

1李伟,苏璞睿.基于内核驱动的恶意代码动态检测技术[J].中国科学院研究生院学报,2010,27(5):695-703. 被引量：9
2张波云,殷建平,蒿敬波,张鼎兴.基于多重朴素贝叶斯算法的未知病毒检测[J].计算机工程,2006,32(10):18-21. 被引量：22
3朱裕禄.Linux系统下的ELF文件分析[J].电脑知识与技术,2006(9):111-113. 被引量：5
4王洪春,彭宏.一种基于主成分分析的异常点挖掘方法[J].计算机科学,2007,34(10):192-194. 被引量：14
5MAIRH A, BARIK D, VERMA K, et al. Honeypot in network secur- ity: a survey[ C] //Proceedings of the 2011 ACM International Con- ference on Communication. New York: ACM Press, 2011 : 600 - 605.
6Rinsing. Safty Reports[ EB/OL]. [ 2011 - 07 - 20]. http://www. rising, com. en/about/news/rising/2011 - 07 - 20/9802. html.
7YE Y, CHEN L, LI T, et aL An interpretable string based malware detection system using SVM ensemble with bagging[ J]. Journal of Computer Virolo-, 2009, 5(4) : 283 -293.
8F-Secure. Virus and threats[ EB/OL]. [ 2011 - 05 - 25]. http:// www. f-secure, com/v-descs/cih, shtml.
9Datarescue. IDA Pro[ EB/OL]. [ 2011 - 03 - 10]. http://www. datarescue, com.
10ABOU-ASSALEH T, CERCONE N, KESELJ V, et al. N-gram- based detection of new malicious code[ C] // COMPSAC'04: Pro- ceedings of the 28th Annual International Computer Software and Applications Conference. Washington, DC: IEEE Computer Socie- ty, 2004:41-42.

引证文献9

1张健飞,陈黎飞,郭躬德.检测迷惑恶意代码的层次化特征选择方法[J].计算机应用,2012,32(10):2761-2767. 被引量：3
2沈壮毫.基于白名单的Web应用程序安全防护[J].广州大学学报（自然科学版）,2012,11(6):27-31. 被引量：3
3朱立军,徐玉芬.C4.5算法在未知恶意代码识别中的应用[J].沈阳化工大学学报,2013,27(1):78-82.
4曾键,赵辉.一种基于N-Gram的计算机病毒特征码自动提取方法[J].计算机安全,2013(10):2-5. 被引量：3
5黄一峰,黄俊伟,吴恋.一种应用机器学习和D-S证据理论的Linux病毒检测方案[J].单片机与嵌入式系统应用,2014,14(4):28-31.
6杨燕,蒋国平.基于N-Gram的计算机病毒特征码自动提取的改进方法[J].计算机科学,2017,44(B11):338-341. 被引量：8
7徐久成,黄方舟,穆辉宇,王云,徐战威.基于PCA和信息增益的肿瘤特征基因选择方法[J].河南师范大学学报（自然科学版）,2018,46(2):104-110. 被引量：10
8文伟平,吴勃志,焦英楠,何永强,通信作者.基于机器学习的恶意文档识别工具设计与实现[J].信息网络安全,2018,0(8):1-7. 被引量：3
9马春波,曾坤.一种基于行为分析和KNN算法的恶意软件检测模型[J].计算机科学与应用,2017,7(6):491-498.

二级引证文献30

1冯本慧.一种基于变长指令序列与粗糙集属性约简的恶意代码检测技术[J].科技视界,2013(23):19-19.
2陈琦,马迪,王岩.一种基于人工免疫和代码相关性的计算机病毒特征提取方法[J].网友世界,2014,0(23):6-7.
3石波,王红艳,郭旭东.基于业务白名单的异常违规行为监测研究[J].信息网络安全,2015(9):144-148. 被引量：6
4肖利强.网络安全风险评估与控制研究[J].电子技术与软件工程,2016(7):215-215. 被引量：1
5左栋,张雨心.国内互联网地图POI存在的涉密问题及其解决办法[J].测绘通报,2016(9):108-111. 被引量：11
6张亮.校园网智能恶意软件数据检测研究[J].微型电脑应用,2016,32(10):44-47.
7范宇杰,陈黎飞,郭躬德.软件代码的恶意行为学习与分类[J].数据采集与处理,2017,32(3):612-620. 被引量：4
8杨燕,蒋国平.基于N-Gram的计算机病毒特征码自动提取的改进方法[J].计算机科学,2017,44(B11):338-341. 被引量：8
9唐春明,魏伟明.基于安全两方计算的具有隐私性的回归算法[J].信息网络安全,2018(10):10-16. 被引量：2
10滕予非,冯世林,张真源,何锐,吴杰,高剑,李熠,张宏图.基于数据驱动的500 kV高压并联电抗器过流误报警在线判别方法研究[J].电力系统保护与控制,2019,47(3):146-153. 被引量：10

1苏志同,李晋宏,王俊山.一种改进的决策树算法及其应用[J].微计算机信息,2009,25(30):177-178. 被引量：5

计算机工程

2010年第6期

浏览历史

内容加载中请稍等...

基于加权信息增益的恶意代码检测方法被引量：9

参考文献5

同被引文献80

引证文献9

二级引证文献30

相关作者

相关机构

相关主题

浏览历史

基于加权信息增益的恶意代码检测方法 被引量：9

参考文献5

同被引文献80

引证文献9

二级引证文献30

相关作者

相关机构

相关主题

浏览历史

基于加权信息增益的恶意代码检测方法被引量：9