期刊文献+

基于频繁链接的Web权威资源挖掘 被引量:6

Mining Authoritative Web Resources Based on Frequent Hyperlinks
下载PDF
导出
摘要 如何有效地利用Web这个巨大的信息库 ?传统的基于关键字的搜索引擎取得了一定的成绩 ,但是存在着查准率不高的问题 Web页面间链接结构事实上隐含地表达着权威的信息 ,这已被许多研究者用来试图改善Web信息检索(包括搜索引擎 )的性能 ,取得了较好的效果 ,但依然存在很大的改善空间 为此 ,提出了FARMING(基于频繁度的Web图的权威资源挖掘 )算法 诠释了新的权威页面定义 ,提出了带阶的频繁子图和权威社团等概念 。 How to utilize the Web resources more efficiently? One of the noteworthy approaches is
出处 《计算机研究与发展》 EI CSCD 北大核心 2003年第7期1095-1103,共9页 Journal of Computer Research and Development
基金 国家自然科学基金 ( 6993 3 0 10 ) 国家"八六三"高技术研究发展计划基金 ( 2 0 0 2AA4Z3 43 0 )
关键词 信息检索 频繁子图 权威页面 权威社团 WEB挖掘 based search engine However, this approach brings negative influence on searching and weighting the perfect (i e the most authoritative) results Actually, hyperlinks between Web pages represent authority implicitly And the power of hyp
  • 相关文献

参考文献19

  • 1G Slaton. Automatic Text Processing: The Transformation,Analysis, and Retrieval of Information by Computer. Reading,MA: Addison Wesley, 1989.
  • 2E Voorhees, N gupta, B Johnson-Laird. Learning collection fusion strategies. ACM SIGIR Conf, Seattle, 1995.
  • 3J Kleinberg. Authoritative sources in a hyperlinked environment.In: Proc of the 9th Annual ACM-SIAM Symposium on Discrete Algorithms. New York: ACM Press, 1998. 668--677.
  • 4P K Reddy, M Kitsuregawa. Inferring Web communities through relaxed cocitation and dense bipartite graphs. 2001. http: //www. tkl. iis. u-tokyo, ac. jp/Kilab/Research/Paper/2001/reddy/6a6.pdf.
  • 5D Florescu, A Levy, A Mendelzon. Database techniques for the World-Wide Web: A survey. ACM SIGMOD Record, 1998, 27(3): 59--74.
  • 6J Cho, N Shivakumar, H Garcia-Molina. Finding replicated Web collections. The 2000 ACM SIGMOD on Managenment of Data,Dallas, 2000.
  • 7K Wang, H Liu. Discovering typical structures of documents: A road map approach. The ACM SIGIR Conf on Research and Development in Information Retrieval, Melbourrne, 1998.
  • 8L Katz. A new status index derived from sociometric analysis.Psychometrika, 1953, 18:39-43.
  • 9C H Hubbel. An input-output approach to clique identification.Sociometry, 1965, 28 : 377- 399.
  • 10E Garfield. Citation analysis as a tool in journal evaluation.Science, 1972, 178(4060): 471-479.

同被引文献59

  • 1胡建武,何贞铭,张贻权.WEB日志挖掘及其实现[J].计算机工程与应用,2004,40(14):156-158. 被引量:13
  • 2陈安龙,唐常杰,陶宏才,元昌安,谢方军.基于极大团和FP-Tree的挖掘关联规则的改进算法[J].软件学报,2004,15(8):1198-1207. 被引量:30
  • 3Jia-WeiHan,JianPei,Xi-FengYan.From Sequential Pattern Mining to Structured Pattern Mining: A Pattern-Growth Approach[J].Journal of Computer Science & Technology,2004,19(3):257-279. 被引量:18
  • 4ShuChing Chen. Identifying topics for Web Documents through fuzzy association learning[J]. International Journal of Computational Intelligence and Applications, 2002, 2(3) : 277-285.
  • 5Arash Rakhshan, Lawrence B Holder, Diane J Cook. Structural Web search engine[J].Intemational Journal on Artificial Intelligence Tools, 2004,13 (1): 27-44.
  • 6Diane J Cook, Nitish Manocha, Lawrence B Holder. Using a graph-based data mining system to perform web search[J] .International Journal of Pattern Recognition and Artificial Intelligence, 2003,17(5): 705-720.
  • 7Supriya Kumar D E, Radha Krishna E Mining Web data usingclustering technique for Web personalization [J]. International Journal of Computational Intelligence and Applications, 2002, 2(3): 255-265.
  • 8CHEN Yu-ru, HUNG Ming-chuan, Don-lin YANG. Using data mining to construct an intelligent web search system[J]. International Journal of Computer Processing of Oriental Languages,2003,16 (2):143-170.
  • 9Gordon S Linoff,Michael J A Berry.Mining the web:transforming customer data into customer value[M].北京:电子工业出版社.2004.
  • 10Wen Gao, Shi Wang,Bin Liu. A dynamic recommendation system based on log mining[J]. International Journal of Foundation of Computer Science, 2002,13 (4): 521-530.

引证文献6

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部