期刊文献+

一种改进的自适应文本信息过滤模型 被引量:18

An Improved Model for Adaptive Text Information Filtering
下载PDF
导出
摘要 自适应信息过滤技术能够帮助用户从Web等信息海洋中获得感兴趣的内容或过滤无关垃圾信息.针对现有自适应过滤系统的不足,提出了一种改进的自适应文本信息过滤模型.模型中提供了两种相关性检索机制,在此基础上改进了反馈算法,并采用了增量训练的思想,对过滤中的自适应学习机制也提出了新的算法.基于本模型的系统在相关领域的国际评测中取得良好成绩.试验数据说明各项改进是有效的,新模型具有更高的性能. The information filtering technology is usually used to track favorite topics and eliminate garbage content from information stream. The adaptive information filtering, which requires little initial training resource and can actively improve itself in filtering process, provides a better performance and convenience than the old way. But there are still some difficulties in training and adaptive learning. In this paper, an improved filtering model for adaptive text filtering is proposed. In this model, two retrieval/feedback mechanisms are used respectively. One is based on vector space model and Rocchio feedback algorithm, and another mechanism is derived from a latest language model IR system. Based on them, an incremental learning method using multi-step pseudo feedback is introduced in profile training to keep a minimal bias to the original topic, and an adaptive profile adjusting mechanism in filtering process, which newly takes into account the document distribution and the decay rate of the topic feature, is also developed. The running system constructed using the new model got a high evaluation score in related international contest, indicating that the improvements in the filtering model are effective.
出处 《计算机研究与发展》 EI CSCD 北大核心 2005年第1期79-84,共6页 Journal of Computer Research and Development
基金 国家"八六三"高技术研究发展计划基金项目(2001AA14040)
关键词 信息检索 WEB 自适应信息过滤 LANGUAGE MODEL 相关性反馈 information retrieval Web adaptive information filtering language model relevance feedback
  • 相关文献

参考文献15

  • 1Ricardo Baeza-Yates, Berthier Ribeiro-Neto. Modern Information Retrieval. Reading, MAz Addison-Wesley, 1999.
  • 2Chris Buckley, et al.. The smart/empire TIPSTER IR system.Inz Proc. of TIPSTER Phase 3 Workshop. San Francisco..Morgan Kaufmann Publishers, 1999. 107--121.
  • 3Stephen Roberson, S. Walker. Okapi/keenbow at TREC-8. The 8th Text Retrieval Conf., Gathersburg, USA, 1999. http://tree. nist. gov/pubs/tree8/t8_proceedings, html.
  • 4J. Rocchio. Relevance feedback in information retrieval. In: The SMART Retrieval System. Englewood Cliffs, NJ : Prentice-Hall,1971. 313-323.
  • 5Wu Lide, Huang Xuanjing, et al. Filtering, QA, Web and video tasks. The 10th Text Retrieval Conf. , Gathersburg, USA,2001. http://trec, hist. gov/pub.s/trecl0/t10_proceedings, html.
  • 6Zhai Chengxiang, Peter Jansen, Norbert Roma, et al.Optimization in CLAR1T TREC-8 Adaptive Filtering. The 8th Text Retrieval Conf., Gathersburg, USA, 1999. http://trec. nist. gov/pubs/trec8/t8_proceedings.html.
  • 7Avi Arampatzis. Unbiased S-D threshold optimization, initial query degradation, incrementality, for adaptive filtering. The 10th Text Retrieval Conf., Gathersburg, USA, 2001. http://trec. nist. gov/pubs/trecl 0/t10_proceedings. html.
  • 8Stephen Robertson, Ian Soboroff. The TREC 2001 filtering track report. The 10th Text Retrieval Conf., Gathersburg, USA,2001. http://trec, nist. gov/pubs/trecl0/t10_proceedings, html.
  • 9Stephen Robertson, lan Soboroff. The TREC 2002 filtering track report. The llth Text Retrieval Conf., Gathersburg, USA,2002. http://trec. nist. gov/pubs/trecl 1/t11_proceedings, html.
  • 10Robert E. Schapire, Yoram Singer Amit Singhal. Boosting and Rocchio applied to text filtering. In: Proc. of 21st ACM SIGIR Conf. on Research and Development in Information Retrieval.New York: ACM Press, 1998. 215--223.

同被引文献182

引证文献18

二级引证文献80

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部