期刊文献+

基于随机森林的IP地址城市级定位方法研究

Research on IP City-Level Geolocation Based on Random Forest
下载PDF
导出
摘要 针对传统IP地址定位方法准确度低的问题,提出了一种基于随机森林的IP地址城市级定位方法。该方法分析了IP自身特点及IP间存在关系,分别对经过路由特征和地域触发特征进行了定义,并通过主动测量提取IP多维度特征,然后结合机器学习的思想训练构建分类器,以实现对IP所处城市的定位。在此基础上,利用真实的IP地址数据集对所提定位方法进行实证分析。实验结果表明,基于随机森林的IP地址城市级定位方法可以实现IP在物理空间上到城市级的精准定位,且定位结果与主流IP定位工具一致。此外,该方法模型训练时间短且无需进行复杂调优,在保证定位可信度的前提下运行耗时少,具有更高的实际应用价值。 Aiming at the low accuracy of traditional IP address location method,this paper proposes a IP city-level geoloca⁃tion method based on random forest.This method analyzes the characteristics of IP itself and the relationship between IPs,and de⁃fines the routing characteristics and geographical triggering characteristics respectively.Then,the IP multi-dimensional feature is extracted by active measurement,and the classifier is constructed in combination with the idea of machine learning to realize the po⁃sitioning of the city where the IP is located.On this basis,the proposed localization method is empirically analyzed using real IP ad⁃dress data sets.Experimental results show that the IP city-level geolocation method based on random forest can achieve accurate IP geolocation to city level,and the positioning results are highly consistent with the mainstream IP positioning tools.In addition,the model training time of this method is short and no complex tuning is required,so it takes less time to run under the premise of ensur⁃ing the credibility of positioning and has higher practical application value.
作者 蔡颖 张琨 尹魏昕 张云纯 蒋彤彤 方悦 CAI Ying;ZHANG Kun;YIN Weixin;ZHANG Yunchun;JIANG Tongtong;FANG Yue(School of Computer Science and Engineering,Nanjing University of Science and Technology,Nanjing 210094;National Computer Network and Information Security Management Center Jiangsu Sub Center,Nanjing 210019)
出处 《计算机与数字工程》 2021年第6期1163-1170,共8页 Computer & Digital Engineering
基金 江苏省研究生科研与实践创新计划项目(编号:SJCX19_0053)资助。
关键词 IP地址定位 主动测量 网络拓扑 机器学习 随机森林 IP geolocation active measurement network topology machine learning random forest
  • 相关文献

参考文献4

二级参考文献92

  • 1陆介平,刘月波,倪巍伟,陈耿,孙志挥.基于投影数据库的序列模式挖掘增量式更新算法[J].东南大学学报(自然科学版),2006,36(3):457-462. 被引量:5
  • 2张坤,朱扬勇.无重复投影数据库扫描的序列模式挖掘算法[J].计算机研究与发展,2007,44(1):126-132. 被引量:17
  • 3戴英霞 连一峰 王航.系统安全与入侵检测[M].北京:清华大学出版社,2002..
  • 4Wong B, Stoyanov I, Sirer EG. Octant: A comprehensive framework for the geolocalization of Internet hosts. In: Proc. of the NSDI 2007 Symp. Cambridge: ACM Press, 2007.313-326. http://www.cs.comell.edu/People/egs/papers/octant-nsdi,pdf.
  • 5Azureus Software, Inc. Vuze bittorrent client. 2014. http://www.vuze.com/.
  • 6Katz-Bassett E, John J, Krishnamurthy A, Wetherall D, Anderson T, Chawathe Y. Towards IP geolocation using delay and topology measurements. In: Proc. of the IMC 2006. Rio de Janeiro: ACM Press, 2006. 71-84. http://homes.cs.washington.edu/ -arvind/papers/geoloe.pdf [doi: 10.1145/1177080.1177090].
  • 7Gill P, Ganjali Y, Dude WB. Dude, Where's that IP? Circumventing measurement-based IP geolocation. In: Proc. of the Usenix Security Symp. 2010. https://es.uwaterloo.ca/-bernard/sec 10.pdf.
  • 8CAIDA. CAIDA's geolocation tools comparison. 2010. http://www.caida.org/projects/cybersecurity/geolocation/.
  • 9IETF Geopriv Workgroup. draft-ietf-geopriv-uncertainty. 2014. http://tools.ietf.org/wg/geopriv/.
  • 10Robin B, Travis L, Silvia P, Erika DN, Edward O. Html5 draft. 2012. http://dev.w3.org/latml5/spec-author-view/.

共引文献42

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部