期刊文献+

大型Web站点逻辑域挖掘算法

Large Scale Website Logical Domain Mining Algorithm
下载PDF
导出
摘要 通过进一步发展Wen-SyanLi等人提出的Web站点逻辑域理论,该文提出Web站点逻辑域核模型及建立在其上的逻辑域挖掘算法。该算法通过对Web站点超链接的图结构进行运算,得到Web站点逻辑域。与Wen-SyanLi算法对比测试,结果表明在获得相同逻辑域个数的情况下,克服了其采用启发式方法所带来的效率问题。在对4个大型Web站点的单独测试中,平均能够达到85%的逻辑域挖掘精度。 By developing Wen-Syan Li's website logical domain theory, the paper proposes a website logical domain core model and logical domain mining algorithm based upon it. The algorithm computes website's hyperlink graph structure to obtain its logical domain. In comparative test with Wen-Syan Li's algorithm, it overcomes the efficiency defect of Wen-Syan Li's huristic method while obtaining the same quantity of logical domain. In separate test of 4 large scale websites, the logical domain core mining precision can averagely reach 85%.
作者 郑皎凌
出处 《计算机工程》 CAS CSCD 北大核心 2008年第9期101-102,105,共3页 Computer Engineering
关键词 Web站点结构挖掘 逻辑域 逻辑域核 website structure mining logical domain logical domain core
  • 相关文献

参考文献4

  • 1Crescenzi V,Merialdo P,Missier P.Discovering the Structure of Large Web Sites Valter Cresenzi[C]//Proceedings of the 27th International Conference on very Large Data Bases.Washington D.C.,USA:[s.n.],2001.
  • 2Henzinger M R,Motwani R,Silverstein C.Challenges in Web Search Engines[J].ACM SIGIR Forum,2002,36(2):102-118.
  • 3Candan K S,Li Wen-Syan.Reasoning for Web Document Associations and Its Applications in Site Map Construction[J].Data & Knowledge Engineering,2002,43(2):121-150.
  • 4Li Wen-Syan,Kolak O,Vu Q,et al.Defining Logical Domains in a Web Site[C]//Proceedings of the 11th ACM Conference on Hypertext.San Antonio,TX,USA:[s.n.],2000.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部