期刊文献+

正则表达式在网络蜘蛛抽取问答系统信息中的应用

The Application of Regular Expressions in Web Spider Extracting the Information from Q and A System
下载PDF
导出
摘要 问答系统是信息检索系统的一种高级形式,为了提高网络蜘蛛在抽取问答系统信息时的爬行效率,从问答系统所特有的布局结构特点出发,结合正则表达式,设计了一个针对问答系统的网络蜘蛛爬行策略。实验证明,该爬行策略提高了网络蜘蛛爬行效率,节省了网络带宽和本地存储空间,有效地提高了答案抽取的精度和效率。 Q and A system has gradually become a new information retrieval technology by returning directly the precise answers to users. In order to improve the web spider's crawl efficiency in the extraction of information from Q and A system,considering the unique characteristics of Q and A system's layout structure and combined with regular expression, a web spider crawling strategy for Q and A system is designed. The experiment results show that this crawling strategy can greatly improve web spider crawl efficiency and save network bandwidth and local storage space to improve the accuracy and efficiency of the answer extraction.
作者 汪材印
出处 《宿州学院学报》 2012年第5期32-35,共4页 Journal of Suzhou University
基金 宿州学院智能信息处理实验室开放课题"用户提问与问答系统中问答对之间的语义相似度研究"(2012YKF36) 安徽省高校自然科学研究一般项目"P2P环境下基于本体的资源语义共享和检索研究"(KJ2011B173)
关键词 正则表达式 网络蜘蛛 问答系统)DOM树 regular expression Web Spider Question Answering System Document Object Model Tree
  • 相关文献

参考文献5

二级参考文献23

  • 1杨桢,赵燕平,朱东华.基于正则表达式的信息抽取系统在国防技术监测中的应用[J].北京理工大学学报,2006,26(z1):74-78. 被引量:9
  • 2崔继馨,张鹏,杨文柱.基于DOM的Web信息抽取[J].河北农业大学学报,2005,28(3):90-93. 被引量:12
  • 3王志晓,牛强.语义Web环境下的信息检索机制研究[J].计算机工程与设计,2007,28(12):2842-2844. 被引量:4
  • 4[8]Ulf Hermjakob. Parsing and Question Classification for Question Answering. Proceeding of the workshop on Open-Domain Question Answering at ACL-2001
  • 5[9]Eugene Agichtein, Steve Lawrence, Luis Gravano. Learning Search Engine Specific Query Transformations for Question Answering. ACM 2001,169- 178
  • 6[10]Soo-Min Kim, ae-Ho Baek, Sang-Beom Kim, Hae-Chang Rim Question Answering Considering Semantic Categories and Co-occurrence Density. Proceedings of the night Text Retrieval Conference (TREC-9)
  • 7[11]Marius Pasca, Sanda Harabagiu. High-Performance Question/Answering. 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( Sigir-01 ). New Orleans, LA. September 9 - 13,2001
  • 8[1]Ittycheriah,M. Franz,W-J Zhu,A. Ratnaparkhi. IBM's Statistical Question Answering System. Proceedings of the night Text Retrieval Conference (TREC-9)
  • 9[2]D. Elworthy. Question Answering Using a Large NLP System. Proceedings of the night Text Retrieval Conference (TREC-9)
  • 10[3]L. Wu,X-j Huang,Y. Guo,B. Liu,Y. Zhang. FDU at TREC-9:CLIR,Filtering and QA Tasks. Proceedings of the night Text Retrieval Conference(TREC-9)

共引文献214

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部