改进的对等网络流量传输层识别方法被引量：13

An Improved Transport Layer Identification of Peer-to-Peer Traffic

下载PDF

导出

摘要 P2P(peer-to-peer)流量识别问题是近年来网络测量研究中的热点问题.基于传输层行为的P2P流量识别方法不依赖P2P应用的特征字段,具有良好的可扩展性.然而网络应用的传输层行为通常极易受网络环境的影响,因此基于传输层行为的P2P流量识别方法在国内外不同的网络环境中,其准确性存在较大的差异.为了弥补现有P2P流量传输层识别方法在国内网络环境中的不足,提出了3条改进策略:1)基于非P2P知名端口的过滤机制;2)基于有效数据流的计数机制;3)基于反向流的FTP过滤机制.随后,在国内网络流量记录上验证了上述改进策略的有效性.实验结果表明,改进后的传输层识别方法,其P2P流识别准确率和P2P字节识别准确率分别接近95%和99%.最后,在国内率先使用改进的P2P流量传输层识别方法对中国教育科研网的骨干网流量记录进行了分析.测量结果表明,国内骨干网上P2P流量所占的比例已经由过去的0.76%上升到70%左右. Peer-to-peer （P2P） traffic identification is a hot topic in network measurement in recent years. The identification method based on P2P traffic transport layer behavior has good scalability, because it is independent of the signature strings of P2P application. But the network application＇s behavior in transport layer is easy to be affected by network environment, so there is a great difference in the accuracy of this identification method between domestic and overseas network environment. In order to improve the existing transport layer identification method in domestic network environment, three proposals are offered in this paper. The first is a filtering mechanism based on non-P2P known port. The second is a counting mechanism using data flow. The third is an FTP flow filtering mechanism using reversed flow. Then, these proposals are validated using the domestic traces. The result of experiments indicates that the flow accuracy and bytes accuracy of the improved P2P traffic transport layer identification method approach 95% and 99% respectively. Finally, this improved method is firstly used to analyze the trace of the Internet backbone in China Education and Research Network. The result of measurement shows that the volume of P2P traffic increases from 0. 76% roughly to 70% of the total traffic in the backbone.

作者徐鹏刘琼林森

机构地区中国科学院软件研究所中国科学院研究生院

出处《计算机研究与发展》 EI CSCD 北大核心 2008年第5期794-802,共9页 Journal of Computer Research and Development

基金国家“九七三”重点基础研究发展规划基金项目(2007CB307100,2007CB307106)~~

关键词网络测量对等网络流量识别传输层网络行为 network measurement peer-to-peer traffic identification transport layer network behavior

分类号 TP393 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献16

1刘琼,徐鹏,杨海涛,彭芸.Peer-to-Peer文件共享系统的测量研究[J].软件学报,2006,17(10):2131-2140. 被引量：36
2S Sen,J Wang.Analyzing peer-to-peer traffic across large networks[C].The 2nd ACM SIGCOMM Workshop on Internet Measurement,Marseille,France,2002
3S Sen,O Spatscheck,D M Wang.Accurate,scalable innetwork identification of P2P traffic using application signatures[C].The 13th Int'l WWW Conf,New York,2004
4T Karagiannis,A Broido,M Faloutsos,et al.Transport layer identification of P2P traffic[C].The 4th ACM SIGCOMM Conf on Internet Measurement,Taormina,Sicily,Italy,2004
5A Madhukar,C Williamson.A longitudinal study of P2P traffic classification[C].The 14th IEEE Int'l Symp on Modeling,Analysis,and Simulation,Monterey,USA,2006
6T Karagiannis,A Broido,N Brownlee,et al.Is P2P dying or just hiding[C].The IEEE Globecom 2004,Dallas,USA,2004
7T Karagiannis,K Papagiannaki,M Faloutsos.BLINC:MultileveI traffic classification in the dark[C].The ACM SIGCOMM 2005,Philadelphia,Pennsylvania,USA,2005
8K Keys,D Moore,R Koga,et al.The architecture of the CoralReef:Internet traffic monitoring software suite[C].The 2nd Passive and Active Measurement Workshop(PAM2001),Amsterdam,2001
9A W Moore,K Papagiannaki.Toward the accurate identification of network applications[C].The 6th Passive and Active Measurement Workshop(PAM2005),Boston,USA,2005
10T Karagiannis.Application Spedfic Bit-Strings[OL].http://www.cs.ucr.edu/_tkarag/papers/strings.txt,2004

二级参考文献41

1Sen S, Wang J. Analyzing peer－to－peer traffic across large network[A]. In: Proceedings of ACM Sigcomm Internet Measurement Workshop[C]. Marseille, France: Nov. 2002.
2Saroiu S, Gummadi P K, Gribble S D. A Measurement Study of Peer－to－Peer File Sharing Systems[A]. In: Proceedings of Multimedia Computing and Networking [C]. Jan. 2002.
3Leibowitz N, et al. Are File Swapping Networks Cacheable? Characterizing P2P Traffic[A]. In: Proceedings of 7th International Workshop on Web Content Caching and Distribution[C]. Aug. 2002.
4B Krishnamurthy, J Wang, Y Xie. Early Measurements of a Clusterbased Architecture for P2P Systems[A]. In :Proceedings of ACM Sigcomm Internet Measurement Workshop[C]. Nov. 2001.
5Netflow services and applications [EB/OL]. Http://www.cisco.com/warp/public/cc/pd/iosw/ioft/neflct/tech/napps wp.htm.
6Vern Paxson, Sally Floyd. Wide－Area Traffic: The Failure of Poisson Modeling[J], IEEE/ACM Transactions on Networking,3(3), June 1995. 
7Gwendolyn Mariano. Schools Declare File－Swapping Truce[J/OL]. Http://news.com.com/2100－1023－859705.html ]Gwendolyn Mariano. Schools Declare File-Swapping Truce. Http://news.com.com/2100-1023-859705.html
8Duffield N, Lund C, Thorup M. Properties and Prediction of Flow Statistics from Sampled Packet Streams[A]. In :Proceedings of ACM Sigcomm Internet Measurement Workshop[C]. Marseille, France: Nov. 2002.
9Hagiwara T, Doi H, Tode H, Ikeda H. High－Speed Calculation Method of Hurst Parameter Based On Real Traffic[A]. In: proc.IEEE Conference on Local Computer Networks (LCN 2000)[C]. Tampa Florida, U.S.A: Nov. 2000.
10Zhang H F, Shu Y T, Yang O. Estimation of Hurst Parameter by Varariance－Time Plots[A]. In Proceedings. IEEE Pacrim[C], 1997.

共引文献41

1张云飞,李绍龙,陈常嘉,张舒.一个稳定的基于网络坐标的多业务有偿服务P2P体系结构[J].北京交通大学学报,2004,28(5):49-53.
2欧中洪,宋美娜,战晓苏,宋俊德.移动对等网络关键技术[J].软件学报,2008,19(2):404-418. 被引量：59
3朱朝霞.一种基于Agent的P2PBA资源共享模型[J].中原工学院学报,2008,19(2):47-51.
4陈亮,龚俭.大规模网络中BitTorrent流行为分析[J].东南大学学报（自然科学版）,2008,38(3):390-395. 被引量：6
5方群,吴国新,于坤,张三峰.P2P文件污染的Markov生灭模型[J].东南大学学报（自然科学版）,2008,38(4):593-597. 被引量：2
6王健.新疆广电数据网络P2P数据流量分析与控制[J].广播与电视技术,2008,35(9):102-102. 被引量：2
7张建标,廖超,郜文彬.特定P2P网络的主动测量研究[J].计算机应用研究,2008,25(11):3415-3418.
8牛尔力,孙晓辉,陈君,王劲林.Gnutella中基于兴趣的社区结构研究[J].计算机工程,2009,35(5):76-78. 被引量：4
9张杰.一种基于同步邻居列表的BitTorrent测量方法[J].商丘职业技术学院学报,2009,8(2):35-38.
10李莹峰,邓晓衡.DHT网络中基于测量的QoS监控系统[J].计算机技术与发展,2009,19(5):188-191. 被引量：1

同被引文献121

1杨岳湘,王锐,唐川,李强.基于双重特征的P2P流量检测方法[J].通信学报,2006,27(z1):134-139. 被引量：8
2赵咏,姚秋林,张志斌,郭莉,方滨兴.TPCAD:一种文本类多协议特征自动发现方法[J].通信学报,2009,30(S1):28-35. 被引量：10
3张云飞,雷连虹,陈常嘉.Internet中Peer-to-Peer应用流量测量与分析[J].铁道学报,2004,26(5):55-60. 被引量：7
4吴晓军,薛惠锋,李慜,兰壮丽.GA-PSO混合规划算法[J].西北大学学报（自然科学版）,2005,35(1):39-43. 被引量：21
5罗浩,方滨兴,云晓春,王欣,辛毅.高速实时的一种邮件蠕虫异常检测模型[J].通信学报,2006,27(2):35-41. 被引量：3
6陈亮,龚俭,徐选.基于特征串的应用层协议识别[J].计算机工程与应用,2006,42(24):16-19. 被引量：43
7刘琼,徐鹏,杨海涛,彭芸.Peer-to-Peer文件共享系统的测量研究[J].软件学报,2006,17(10):2131-2140. 被引量：36
8李伟男,鄂跃鹏,葛敬国,钱华林.多模式匹配算法及硬件实现[J].软件学报,2006,17(12):2403-2415. 被引量：42
9宫婧,孙知信,顾强.基于行为特征描述的P2P流识别方法的研究[J].小型微型计算机系统,2007,28(1):48-53. 被引量：5
10石萍,陈贞翔,荆山,贾冠昕,杨波.基于对等特征的P2P流量识别方法[J].中国教育网络,2007(2):36-38. 被引量：9

引证文献13

1袁雪美,王晖,张鑫,刘亚杰.P2P流量识别技术综述[J].计算机应用,2009,29(B12):11-15. 被引量：10
2刘琼,刘珍,黄敏.基于机器学习的IP流量分类研究[J].计算机科学,2010,37(12):35-40. 被引量：20
3刘剑刚,秦拯,祝仰金.基于多重特性的P2P流量识别方法[J].微计算机信息,2010,26(33):69-71. 被引量：2
4易宇,王颖,张甜甜.基于GA-PSO混合规划算法的P2P流量识别研究[J].微处理机,2010,31(6):34-37.
5胡六四.高校校园网P2P流量检测和分类研究[J].吉林工程技术师范学院学报,2011,27(4):68-70.
6鲁刚,张宏莉,叶麟.P2P流量识别[J].软件学报,2011,22(6):1281-1298. 被引量：48
7刘三民,孙知信.P2P流量识别技术综述[J].计算机科学,2011,38(10):6-12. 被引量：2
8刘三民,孙知信,刘余霞.基于决策树集成的P2P流量识别研究[J].计算机科学,2011,38(11):26-29. 被引量：4
9丁要军,蔡皖东.采用两阶段策略模型(KTSVM)的P2P流量识别方法[J].西安交通大学学报,2012,46(2):45-50. 被引量：8
10刘三民,孙知信.具有概念漂移的P2P网络流量识别研究[J].系统工程与电子技术,2013,35(4):864-869. 被引量：2

二级引证文献95

1董永苹,余翔湛,吴刚.基于决策树的P2P节点识别技术研究[J].通信学报,2013,34(S2):40-46.
2程春玲,周芸,徐小龙.基于主被动连接的P2P节点识别算法[J].计算机技术与发展,2010,20(12):50-53. 被引量：2
3荣辉桂,李明伟,蔡立军.An early recognition algorithm for BitTorrent traffic based on improved K-means[J].Journal of Central South University,2011,18(6):2061-2067.
4黄志根,陈健,王珊.一种基于包长和时间间隔的网络流量分类方法[J].电子测量技术,2011,34(11):109-112. 被引量：3
5康浩,何速,姜志宏,张鑫,樊鹏翼.面向内容监管的P2P-TV音视频数据还原与在线检测方法研究[J].计算机应用研究,2012,29(1):187-190. 被引量：1
6邓伟锋,程绍银,蒋凡,吕秀全.应用层负载特征定义及自动提取方法[J].通信技术,2012,45(7):20-23. 被引量：2
7司刚全,娄勇,张寅松.鲁棒最小二乘支持向量机及其在软测量中的应用[J].西安交通大学学报,2012,46(8):15-21. 被引量：4
8汪为汉,唐学文,邓一贵.基于贝叶斯学习的集成流量分类方法[J].计算机工程,2012,38(16):164-166. 被引量：4
9安文娟,李丹,辛阳.基于聚类算法的实时IP流量识别技术研究[J].信息网络安全,2012(10):54-58.
10丁要军,蔡皖东,姚烨.基于SBN模型的Internet应用协议识别方法[J].华中科技大学学报（自然科学版）,2012,40(9):44-47.

1徐敬东,何亮,于博洋,于云涛,刘伯兴,李嵩.改进的WSN拓扑控制方法[J].计算机工程,2010,36(16):85-87. 被引量：1
2欧柳然.基于XMPP协议的即时通讯流量识别方法[J].才智,2010,0(5):52-52.
3胡婷,王勇,陶晓玲.网络流量分类方法的比较研究[J].桂林电子科技大学学报,2010,30(3):216-219. 被引量：4
4杨林,刘聪,徐慧,张宵龙.P2P流实时识别技术研究[J].计算机科学,2012,39(S2):86-87. 被引量：3
5唐洪英,龚箭,曹泽翰.检测端口扫描的方法研究[J].计算机应用,2003,23(z1):287-288. 被引量：2
6薛建彬,王文华,张婷,孙瑞.基于计数机制的多状态二进制搜索防碰撞算法[J].计算机工程,2013,39(4):309-313. 被引量：2
7戴磊,王源,刘科科.一种主动学习式P2P流识别方法[J].计算机应用研究,2012,29(2):717-721. 被引量：3
8何升,许国春,殷红武.一种用于多核系统的分布式引用计数机制[J].高性能计算技术,2012,0(1):22-24.
9胡婷,王勇,陶晓玲.混合模式的网络流量分类方法[J].计算机应用,2010,30(10):2653-2655. 被引量：8
10李军辉,李培峰,朱巧明,钱培德.最大熵模型在邮件分类中的应用[J].计算机工程与应用,2007,43(35):126-129. 被引量：1

计算机研究与发展

2008年第5期

浏览历史

内容加载中请稍等...

改进的对等网络流量传输层识别方法被引量：13

参考文献16

二级参考文献41

共引文献41

同被引文献121

引证文献13

二级引证文献95

相关作者

相关机构

相关主题

浏览历史

改进的对等网络流量传输层识别方法 被引量：13

参考文献16

二级参考文献41

共引文献41

同被引文献121

引证文献13

二级引证文献95

相关作者

相关机构

相关主题

浏览历史

改进的对等网络流量传输层识别方法被引量：13