TREC2002中的WEB信息检索

Web Information Retrieval in TREC2002

下载PDF

导出

摘要文本检索会议(TextREtrievalConference,TREC)是目前国际上信息检索领域最重要的学术交流与系统评测活动。会议为参加者提供标准的数据集合、评测问题和标准答案,从而使参加者以共同的标准进行系统运行和评测。作者代表中国科学院参加了文本检索会议的WEB信息检索任务。在TREC2002中,作者发现了适合不同数据集合的较高性能的内容检索算法,并综合考虑了文本内容、链接文字、文档结构等因素对WEB信息检索效果的影响,取得了较好的成绩。该方法在两届会议的不同任务中均表现了较高的性能。 The Text REtrieval Conference(TREC)is the most important academic interaction and system evaluation fo-rum in the information retrieval community.TREC provides standard data collection,topics and relevance judgments for its participants so that they can conduct their retrieval research in a common manner.We took part in Web Track of TREC in2002.We have built an effective information retrieval system which can deal with large amounts of data while showing satisfactory performance on different test collections.We make use of relevance information from other aspects such as anchor texts and document structure as well as the relevance score from traditional IR system.Our approach has shown good performance in both of the Web Track tasks.

作者杨志峰刘悦杨哲王斌程学旗

机构地区中国科学院计算技术研究所软件研究室

出处《计算机工程与应用》 CSCD 北大核心 2003年第26期37-39,80,共4页 Computer Engineering and Applications

基金国家重点基础研究发展规划973资助项目(编号:G1998030413 G1998030510) 计算所领域前沿青年基金(编号:20026180-24)

关键词信息检索文本检索会议 WEB TRACK 评测 Information Retrieval,TREC,Web Track,Evaluation

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献8

1E Voorhees,D Harman.Overview of the Ninth Text REtrieval Con- ference(TREC-9)[C].In:The Ninth Text REtrieval Conference(TREC- 9) ,2000.
2Ogawa Y,Mano H,Narita M et al.Structuring and Expanding Queries in the Probabilistic Model[C].In:The Ninth Text REtrieval Conference (TREC 9),2O0O.
3S E Robertson,S Walker.Okapi/Keenbow at TREC-8[C].In:The Eighth Text REtrieval Conference(TREC 8),1999.
4S Brin,L Page.The anatomy of a large scale hypertextual web search engine[C].In:The 7th WWW Conference,1998.
5J Kleinberg.Authoritative sources in a hyperlinked environment[C].In: Proc 9th ACM-SIAM SODA,1998.
6Min Zhang,Ruihua Song,Chuan Lin et al.THU at TREC2002:Nov- elty,Web and Filtering[C].In:The Eleventh Text REtrieval Conference (TREC-11 ) .2002.
7Einat Amitay,David Carmel,Adam Darlow et al.Topic Distillation with Knowledge Agents[C].In:The Eleventh Text REtrieval Conference (TREC- 11 ) ,2002.
8A Singhal,C Buckley,M Mitra.Pivoted Document Length Normaliza- tion[C].In:H Frei,D Harman,P Schauble eds.Proceedings of the Nin- eteenth Annual International A CM SIGIR Conference on Research and Development in Information Retrieval, 1996.

1李季.一个标准中文问答系统的研究与实现[J].计算机系统应用,2004,13(6):17-20. 被引量：1
2李季,迟呈英.中文问答系统的研究[J].鞍山科技大学学报,2003,26(6):437-440. 被引量：1
3魏峰.一种面向报文的负载均衡内容检索算法[J].中国科技博览,2009(16):112-112.
4李季,孙冀侠.标准中文问答系统的研究与实现[J].鞍山师范学院学报,2005,7(6):83-86.
5技术热线[J].电脑迷,2009(18):91-91.
6王志军.激活IE8光标浏览模式[J].电脑迷,2009(3):69-69.
7李斌.TREC-3:文本检索会议[J].管理观察,1997,0(5):57-57.
8李斌.第二次文本检索会议资料的概述[J].管理观察,1997,0(5):57-57.
9潘文锋,孙健,王斌.一种Winnow线性分类器及其在TREC Novelty任务中的应用[J].计算机工程与应用,2004,40(23):59-61. 被引量：2
10孙麟,牛军钰.基于领域相关词汇提取的特征选择方法[J].小型微型计算机系统,2007,28(5):895-899. 被引量：4

计算机工程与应用

2003年第26期

浏览历史

内容加载中请稍等...

TREC2002中的WEB信息检索

参考文献8

相关作者

相关机构

相关主题

浏览历史