期刊文献+

使用程序分析和特征识别方法过滤网页广告 被引量:4

Filtering Web Advertisements by Program Analysis and Feature Identification
下载PDF
导出
摘要 网页广告已经成为互联网生态系统的主要经济来源,但随着恶意广告的传播和泛滥,越来越多的网页广告跟踪、窃取用户的隐私数据,对用户的信息安全构成了威胁.因此,大多数用户倾向于在浏览器上安装广告过滤器.现有的广告过滤器,如Ad Block Plus,通过维护一个大的黑名单列表来过滤广告,需要不定期更新列表,运行、维护的代价很大.为此,本文提出了一种结合网页代码静态分析和特征识别的方法来实现广告的自动过滤.该方法首先通过一个简短的过滤列表来过滤确定的广告(Java Script文件),然后对其余的Java Script文件进行静态分析并提取出一系列特征向量,进而训练分类器得到广告识别模型.最后实现了一个Chrome浏览器插件Tri Filter,来完成广告的识别与屏蔽.实验结果表明本文方法在准确率和召回率方面优于现有的广告屏蔽工具. Web advertisements have been the main economic sources of Internet ecosystem. However,more and more web advertisements track and steal user′s private data with the widespread of malware,which is a threat of user′s information security.Thus,most users choose to install Ad Blocker in the browser to avoid these annoying advertisements (short as ads).Existing Ad Blockers,such as AdBlock Plus,filter ads through maintaining a large black list,which needs constant updates of this list and causes great overheads of operation and maintenance.Therefore,we present an approach combining static analysis of web pages and feature identification to filter web ads.First,we filter some determined ads (e.g.JavaScript files) through the match with a small list.Second,we perform static analysis on other JavaScript files to extract a series of feature vectors.Then,we use a classifier to train the feature vectors and generate a model to identify ads.We implement a chrome plugin (TriFilter) to block web ads.The experimental results show that this method is better than existing ad blocking tools in precision and recall.
作者 何欣程 查春柳 许蕾 HE Xin-cheng;ZHA Chun-liu;XU Lei(Department of Computer Science and Technology,Nanjing University,Nanjing 210023,China;State Key Laboratory for Novel Software Technology,Nanjing University,Nanjing 210023,China)
出处 《小型微型计算机系统》 CSCD 北大核心 2018年第9期1998-2002,共5页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(91418202 61272080 61472076 61403187)资助
关键词 网页广告 程序分析 特征识别 广告屏蔽 Web advertisement program analysis feature identification advertisement blocking
  • 相关文献

同被引文献34

引证文献4

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部