期刊文献+

基于少量示例的个性化Web信息自动获取系统(英文) 被引量:1

A Personalized Web Information Auto-retrieval System Based on Small Samples
下载PDF
导出
摘要 基于关键词的搜索引擎满足了人们一定的需要,但由于其通用的性质,并不能满足用户的个性化需求,为此,设计并实现了一个基于示例的个性化Web信息自动获取系统.该系统采用了一种新的基于少量Web示例网页和语料库词频统计的特征抽取算法和过滤阈值设定方法.实验结果表明,较基于关键词的搜索引擎而言,该系统能充分考虑用户的兴趣偏好(示例),长期、主动地向用户提供更加准确的Web信息获取服务. current search engines based on keywords satisfy some users' need, they can't meet users' personalized demands for their all purpose characteristics. The design and implementation of a novel personalized Web information auto-retrieval system based on small samples is presented. This system adopts a new algorithm of fea- ture extraction and a new method to determine filtering threshold based on small webpage training sets and term-frequency statistics of corpus. Experimental results show that this system can long-termly and on its own initiative provide more accurate Web information-obtaining service to a user according to his interest than the search engines based on keywords.
出处 《郑州大学学报(理学版)》 CAS 2006年第4期44-49,共6页 Journal of Zhengzhou University:Natural Science Edition
关键词 个性化Web信息获取 WEB信息过滤 特征抽取 少量Web文档示例 personalized Web information retrieval Web document filtering feature extraetion small samples ofWeb documents
  • 相关文献

参考文献8

  • 1STEFAN K,ARMIN H,MARKUS J.Improving document retrieval by automatic query expansion using collaborative learning of term-based concepts[J].Lecture Notes in Computer Science,2002,2423:376-387.
  • 2LI X M,YAN H F,WANG J M.The principle,technique,and system of search engine[M].Beijing:China Science Press,2005.
  • 3Baidu search engine.http://www.baidu.com.
  • 4Zhongsou search engine.http://www.zhongsou.com.
  • 5RICARDO B Y,BERTHIER R N.Modern information retrieval[M].Beijing:China Machine Press,2004.
  • 6Institute of Computational Linguistics of Peking University.PFR People's Daily corpus[EB/OL].http://www.icl.pku.edu.cn/icl-groups/corpus/dwldform1.asp.
  • 7FRANCOIS D,REMI G,MARC T.Text classification from positive and unlabeled examples[C]//IPMU'02,9th International Conference on Information Processing and Management of Uncertainty in Knowledge-based Systems.Annecy,France,2002:1927-1934.
  • 8夏迎炬,黄萱菁,胡恬,吴立德.自适应信息过滤中使用少量正例进行阈值优化(英文)[J].软件学报,2003,14(10):1697-1705. 被引量:6

二级参考文献14

  • 1Salton G. Develovments in automatic text retrieval. Science, 1991,253:974-979
  • 2Zhai C, Jansen P,Roma N, Stoica E, Evans DA. Optimization in CLARIT adaptive filtering. In:Voorhees EM, Harman DK, eds.Proceedings of the 8th Text Retrieval Conference. 1999.253-258.
  • 3Zhang Y, Callan J. Yfilter at TREC9. In: Voorhees EM, Harman DK, eds, Proceedings of the 9th Text Retrieval Conference.Gaithersburg. 2000. 154-161.
  • 4Allan J. Incremental relevance feedback for information filtering. In:Frei HP, Harman D, Schiuble P, Wilkinson R, eds.Proceedings of the 19th annual international ACM SIGIR conference on Research and Development in Information Retrieval 1996.Zurich, Switzerland. 1996. 270-278.
  • 5Arampatzis A, Beney J, Koster CHA, van der Weide TP. KUN on the TREC9 filtering track: Incrementality, decay, and theshold optimization for adaptive filtering systems. In:Voorhees EM, Harman DK, eds. Proceedings of the 9th Text Retrieval Conference.Gaithersburg, 2000. 87-109.
  • 6Bucldey C, Salton G, Allan J. The effect of adding relevance information in a relevance feedback enviroment.ln: Croft WB, van Rijsbergen CJ, eds. Proceedings of the 17th Annual International ACM-SIGIR Conference on Research md Development in Information Retrieval. Dublin, ACM/Springer, 1994. 292-300.
  • 7Voorhees EM, et al. Overview of TREC 2001. In: Voorhees EM, Harman DK, eds. Proceedings of the 9th Text Retrieval Conference. Gaithersburg, 2001. 1 - 12.
  • 8Sebastiani F. Macrame learning in automated text categorization, ACM Computing Surveys, 2002,34(1): 1--47.
  • 9Wu LD, et al. FDU at TREC--9: CLIR, filtering and QA tasks. In: Voorhees EM, Harman DK, eds. Proceedings of the 9th Text Retrieval Conference. Galthersburg, 2000. 202-219.
  • 10Robertson SE, Walker S. Microsoft cambridge at TREC9: Filtering track. In:Voorhees EM, Harman DK, eds. Proceedings of the 9th Text Retrieval Conference. Gaithersburg, 2001. 117-131.

共引文献5

同被引文献12

引证文献1

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部