基于文法推断的协议逆向工程被引量：9

Protocol Reverse Engineering Using Grammatical Inference

下载PDF

导出

摘要要深入了解网络中的各种应用过程,进而对这些应用进行自动分类、识别、跟踪和控制,首先就要获得代表这些应用会话过程的状态机.为此提出一种新的方法从采集的应用层数据中反推协议状态机.它采用基于差错纠正的文法推断方法,利用应用层协议交互过程中出现的标识符状态序列,逆向工程其协议状态机.为充分挖掘和发挥差错纠正的性能,提出了最佳路径匹配标准确定纠正路径,以及基于概率统计的异常入度区分及其剪枝的方法;通过去重的状态合并和相似行为意义的协议结构化简措施解决状态膨胀问题,从而获取最精简的协议状态机.通过在包含多种应用层协议的实际网络中的实验,验证了该方法的有效性. To deeply understand procedures of various network applications, and to automatically classify, recognize, trace and control them, protocol state machine that represents the application sessions have to be obtained in advance. A novel approach is presented to reversely infer protocol state machine from collected application layer data. Protocol state machine is derived with a method of error-correcting grammatical inference based on the state sequences that appear in the application sessions. To richly mine and bring into play the performance of error-collecting, a criterion of best- matching path is presented to solve the difficulty of path selection during the error-correcting process. A method with regard to abnormal indegree discrimination and pruning on the basis of statistical probability is proposed. Moreover, negative example sets with similar tokens are adopted to reinforce the error-collecting performance. In order to solve the state expansion during the reconstruction of the state machine, a simplifying measure to obtain a compact protocol state machine that expresses the internal operating mechanism of the protocol accurately is used based on state merging with removal of the identical token and model reduction with a similar behavioral semantic. The experiments conducted in a real network, containing a number of real applications with several application layer protocols, validate this method.

作者肖明明余顺争

机构地区中山大学信息科学与技术学院仲恺农业工程学院信息科学与技术学院

出处《计算机研究与发展》 EI CSCD 北大核心 2013年第10期2044-2058,共15页 Journal of Computer Research and Development

基金国家"八六三"高技术研究发展计划基金项目(2007AA01Z449) 国家自然科学基金-广东联合基金重点项目(U0735002) 国家自然科学基金项目(60970146 61202271)

关键词协议逆向工程协议状态机推断协议分析差错纠正文法推断网络安全 protocol reverse engineering protocol state machine inference protocol analysis error-correcting grammatical inferences network security

分类号 TP393.08 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献33

1Oehlert P. Violating assumptions with fuzzing [J]. IEEE Security and Privacy, 2005, 3(2): 58-62.
2Roesch M. Snort: Lightweight intrusion detection for networks [C] //Proc of the 13th Systems Administration Conf (LISA). Berkeley: USENIX Association, 1999: 229- 238.
3Paxson Vern. Bro: A system {or detecting network intruders in real-time [J]. Computer Networks, 1999, 31(23/24): 2435-2463.
4Aitel D. MSRPC fuzzing with SPIKE 2006 [R/OL]. Miami: Immunity Inc, 2006 [2011-02 01]. http://xcon, xfocus, net/ XCon2006/archieves/Dave_ Aitel Microsoft _ System_ RPC_Fuzz. pdf.
5李伟明,张爱芳,刘建财,李之棠.网络协议的自动化模糊测试漏洞挖掘方法[J].计算机学报,2011,34(2):242-255. 被引量：67
6陈曙晖,苏金树,范慧萍,侯婕.一种基于深度报文检测的FSM状态表压缩技术[J].计算机研究与发展,2008,45(8):1299-1306. 被引量：16
7Aaraj N, Raghunathan A, Jha NK. Dynamic binary instrumentation-based framework for malware defense [G] // LNCS 5137: Proc of the 5th Int Conf on Detection of Intrusions and Malware, and Vulnerability Assessment. Berlin: Springer, 2008: 64-87.
8Cui W, Paxson V, Weaver N, et al. Protocol-independent adaptive replay of application dialog [C] //Proc of the 13th Symp on Network and Distributed System Security (NDSS 2006). San Diego, CA: Internet Society, 2006:1-15.
9Cui W, Kannan J, Wang H. Discoverer: Automatic protocol reverse engineering from network traces [C] //Proc of the 16th Usenix Security Symp. Berkeley: USENIX Association, 2007: 199-212.
10Ma J, Levchenko K, Kreibich C, et al. Unexpected means of protocol inference [C] //Proc of the 6th ACM SIGCOMM Conf on Internet Measurement. New York: ACM, 2006: 313-326.

二级参考文献43

1刘立芳,霍红卫,王宝树.PHGA-COFFEE:多序列比对问题的并行混合遗传算法求解[J].计算机学报,2006,29(5):727-733. 被引量：11
2李伟男,鄂跃鹏,葛敬国,钱华林.多模式匹配算法及硬件实现[J].软件学报,2006,17(12):2403-2415. 被引量：42
3Aho A V, Corasick M J. Efficient string matching: An aid to bibliographic search [J]. Communications of the ACM, 1975, 18(6): 333-340.
4Tuck N, Sherwood T, Calder B, et al. Deterministic memory efficient string matching algorithms for intrusion detection [C] //Proc of the IEEE INFOCOM 2004. Piscataway, NJ: IEEE Press, 2004:333-340.
5Fang Yu, Randy H Katz, Lakshman T V. Gigabit rate packet pattern-matching using TCAM[C] //Proc of the 12th IEEE Int'l Conf on Network Protocols (ICNP' 04). Washington: IEEE Computer Society, 2004.
6Application Layer Packet Classifier for Linux[OL]. [2007-02-14]. http://17-filter, sourceforge, net.
7IPP2P[OL]. [2007-02-14]. http://www, ipp2p, org.
8Snort. Network Intrusion Detection System[OL]. [2007-02- 14]. http://www, snort, org.
9Bro. Intrusion Detection System [ OL]. [ 2007-02-14 ]. http ://bro ids. org/Overview.
10Fang Yu, Zhifeng Chen, Yanlei Diao. Fast and memory-efficient regular expression matching for deep packet inspection, UCB/EECS-2006-76 [R/OL]. Berkeley: University of California, 2006. [2007-02-14]. http://www. eecs. berkeley, edu/Pubs/TechRpts/2006/EECS-2006-76, html.

共引文献96

1杨毅夫,刘燕兵,刘萍,郭牧怡,郭莉.正则表达式的DFA压缩算法[J].通信学报,2009,30(S1):36-42. 被引量：6
2姚远,刘鹏,单征,田双鹏.面向存储的正则表达式匹配算法综述[J].计算机应用,2009,29(12):3171-3173. 被引量：13
3姚振军,黄德根,纪翔宇.正则表达式在汉英对照中国文化术语抽取中应用[J].大连理工大学学报,2010,50(2):291-295. 被引量：9
4肖武德.一种正则表达式的高效分组算法[J].计算机安全,2010(4):57-59. 被引量：4
5姚远,刘鹏,王辉,笱程成.基于稀疏矩阵存储的状态表压缩算法[J].计算机应用,2010,30(8):2157-2160. 被引量：5
6张树壮,罗浩,方滨兴,云晓春.一种面向网络安全检测的高性能正则表达式匹配算法[J].计算机学报,2010,33(10):1976-1986. 被引量：27
7张树壮,罗浩,方滨兴.大规模复杂规则匹配技术研究[J].高技术通讯,2010,20(12):1217-1223. 被引量：3
8潘璠,吴礼发,杜有翔,洪征.协议逆向工程研究进展[J].计算机应用研究,2011,28(8):2801-2806. 被引量：21
9张树壮,罗浩,方滨兴.面向网络安全的正则表达式匹配技术[J].软件学报,2011,22(8):1838-1854. 被引量：29
10郑天明,王韬,郭世泽,李华,赵新杰.改进的空间协议识别算法[J].通信学报,2012,33(5):183-190. 被引量：6

同被引文献144

1赵咏,姚秋林,张志斌,郭莉,方滨兴.TPCAD:一种文本类多协议特征自动发现方法[J].通信学报,2009,30(S1):28-35. 被引量：10
2蔡罡,冯辉宗.基于协议分析状态机的入侵检测系统[J].重庆邮电学院学报（自然科学版）,2005,17(1):97-101. 被引量：4
3William S. Cryptography and Network Security: Principles and Practice[M]. Englewood Cliffs, NJ: Prentice Hall, 201 1 : 30-35.
4Felix G. , Carsten W, Thorsten H. Automatic identification of cryptographic primitives in binary programs [C] //Proc of the 14th Annual hat Symp on Recent Advances in Intrusion Detection. Ferlin: Springer, 2011:41-60.
5Calvet J, Fernandez J M, Marion J Y. Aligot: Cryptograpbie function identification in obfuscated binary programs [C]// Proc of the 2012 ACM Conf on Computer and Communications Security. New York: ACM, 2012:169-182.
6Wondracek G. Comparetti P M, Kruegel C, et al. Automatic network protocol analysis [C] //Proc of the t5th Annual Network and Distributed System Security Symp. San Diego: Internet Society, 2008: 1-14.
7Juan C, Yin H, Liang Z, et al. Polyglot.. Automatic extraction of protocol message format using dynamic binary analysis [C] //Proc of the 14th ACM Conf on Computer and Communications Security. New York: ACM, 2007:317-329.
8Li X, Wang X, Chang W. CipherXRay: Exposing cryptographic operations and transient secrets from monitored binary execution [J]. IEEE Trans on Dependable and Secure Computing, 2012, 99(1): 1-14.
9Wang Z, Jiang X, Cui W, et al. ReFormat: Automatic reverse engineering of encrypted messages [G]//LNCS 5789: Proc of the 14th European Symp on Research in Computer Security. Berlin: Springer, 2009:200-215.
10Lutz N. Towards revealing attackers' intent by automatically decrypting network traffic [D]. Zurich: Swiss Federal Institute of Technology, 2009.

引证文献9

1魏强,武泽慧,王清贤.基于内存依赖关系度量的解密数据提取方法[J].计算机研究与发展,2014,51(7):1547-1554.
2孟凡治,刘渊,张春瑞,李桐.基于状态相关字段识别的未知二进制协议状态机逆向方法[J].电讯技术,2015,55(4):372-378. 被引量：2
3吴礼发,王辰,洪征,庄洪林.协议状态机推断技术研究进展[J].计算机应用研究,2015,32(7):1931-1936. 被引量：8
4王辰,吴礼发,洪征,赖海光,庄洪林.一种基于状态融合的协议状态机推断方法[J].解放军理工大学学报（自然科学版）,2015,16(4):322-329. 被引量：3
5刘渊,张春瑞,孟凡治,李桐,岳旸.基于网络数据的协议逆向工程研究进展[J].计算机工程与设计,2015,36(11):2915-2920. 被引量：7
6王辰,吴礼发,洪征,郑成辉,庄洪林.一种基于域知识的协议状态机主动推断算法[J].计算机科学,2015,42(12):233-239. 被引量：4
7罗建桢,余顺争,蔡君.基于最大似然概率的协议关键词长度确定方法[J].通信学报,2016,37(6):119-128. 被引量：6
8闫小勇,李青,莫有权.基于状态相关字段的二进制协议状态机推断[J].计算机工程,2019,45(7):126-133. 被引量：2
9王晓晨,沈晶,刘海波,于爱民,蔡利君.自动协议逆向工程研究综述[J].计算机应用研究,2020,37(9):2561-2570. 被引量：2

二级引证文献24

1彭博一,张钊,蒋鸿宇.一种基于改进自编码器的二进制协议聚类方法[J].太赫兹科学与电子信息学报,2021,19(4):712-716. 被引量：1
2邓志森.电网工控网络流量分析[J].信息网络安全,2020(S01):127-130. 被引量：1
3罗建桢,余顺争,蔡君.基于最大似然概率的协议关键词长度确定方法[J].通信学报,2016,37(6):119-128. 被引量：6
4姬胜凯,刘仁辉,董伟,许凤凯.协议安全测试在工业DCS系统测评中的应用[J].微型机与应用,2017,36(20):10-13. 被引量：6
5付光远,刘津霖,李海龙.基于HMM的私有协议自主学习方法[J].计算机应用研究,2017,34(12):3779-3783. 被引量：1
6孟博,鲁金钿,王德军,何旭东.安全协议实施安全性分析综述[J].山东大学学报（理学版）,2018,53(1):1-18. 被引量：4
7薛开平,柳彬,李威,洪佩琳.一种面向未知链路帧的格式特征提取与分类算法[J].中国科学院大学学报（中英文）,2018,35(4):521-528. 被引量：1
8洪征,田益凡,张洪泽,吴礼发.基于扩展前缀树的协议格式推断方法[J].计算机工程与应用,2018,54(12):14-20. 被引量：2
9潘思远,王轶骏,薛质,林祥.APT木马网络协议逆向自动化分析[J].计算机应用与软件,2018,35(4):317-324.
10高巍伟,周纯杰.未知工控协议语义逆向研究方法[J].信息通信,2019,0(5):44-47. 被引量：1

1潘璠,吴礼发,杜有翔,洪征.协议逆向工程研究进展[J].计算机应用研究,2011,28(8):2801-2806. 被引量：21
2肖明明,余顺争,张世龙.文法推断网络协议状态机[J].科学技术与工程,2014,22(19):100-105. 被引量：1
3盛立东.关于有限状态文法推断的实用算法[J].北京邮电学院学报,1990,13(3):84-88.
4李志圣,陈永生.上下文无关文法推断中的几条启发规则及其应用[J].计算机工程与科学,2006,28(9):64-66.
5张钊,温巧燕,唐文.协议规范挖掘研究综述[J].计算机工程与应用,2013,49(9):1-9. 被引量：9
6张钊,唐文,温巧燕.一种基于长度语义约束的报文格式挖掘方法[J].北京邮电大学学报,2012,35(6):55-59. 被引量：4
7卢正鼎,董泽锋.文法推断与HMM相结合的信息提取[J].计算机工程与科学,2005,27(8):1-3. 被引量：1
8张瑞岭.文法推断研究的历史和现状[J].软件学报,1999,10(8):850-860. 被引量：4
9颜蕾,吴斌,宋宇波.基于状态机比对的状态机推断方案[J].江苏通信,2015,31(5):63-65.
10吴艳彬,鲜继清,郭艳荣,谢昊飞.EPA协议状态机与服务的一致性测试方法研究[J].电信快报（网络与通信）,2009(4):42-45.

计算机研究与发展

2013年第10期

浏览历史

内容加载中请稍等...

基于文法推断的协议逆向工程被引量：9

参考文献33

二级参考文献43

共引文献96

同被引文献144

引证文献9

二级引证文献24

相关作者

相关机构

相关主题

浏览历史

基于文法推断的协议逆向工程 被引量：9

参考文献33

二级参考文献43

共引文献96

同被引文献144

引证文献9

二级引证文献24

相关作者

相关机构

相关主题

浏览历史

基于文法推断的协议逆向工程被引量：9