摘要
提出一种表单Ajax信息项提取方法。该方法在独立于浏览器运行的程序中嵌入JavaScript引擎,本地化构建DOM对象和Ajax应用对象,利用JavaScript引擎跟踪执行脚本代码,模拟执行用户在浏览器下的操作,从而自动获取表单Ajax信息项数据。实验结果表明,该方法可以完整获取Deep Web查询接口的表单信息,提高搜索准确率。
This paper presents an extraction method of form Ajax information item. The method embeds JavaScript engine into the running programs which is independent of the browser, reconstructs the DOM and Ajax objects locally. It simulates the user operation in using the browser with the JavaScript engine tracking and executing the script to automatic gain form Ajax information item data. Experimental results show that the method can completely obtain form information of Deep Web query interface, and it can improve search accuracy.
出处
《计算机工程》
CAS
CSCD
北大核心
2011年第3期44-46,共3页
Computer Engineering
基金
国家科技重大专项基金资助项目(2009ZX03001-019-01)