期刊文献+

Mining Interesting Knowledge from Web-Log 被引量:1

Mining Interesting Knowledge from Web-Log
下载PDF
导出
摘要 Web-log contains a lot of information related with user activities on the Internet. How to mine user browsing interest patterns effectively is an important and challengeable research topic. On the analysis of the present algorithm’s advantages and disadvantages we propose a new concept: support-interest. Its key insight is that visitor will backtrack if they do not find the information where they expect. And the point from where they backtrack is the expected location for the page. We present User Access Matrix and the corresponding algorithm for discovering such expected locations that can handle page caching by the browser. Since the URL-URL matrix is a sparse matrix which can be represented by List of 3-tuples, we can mine user preferred sub-paths from the computation of this matrix. Accordingly, all the sub-paths are merged, and user preferred paths are formed. Experiments showed that it was accurate and scalable. It’s suitable for website based application, such as to optimize website’s topological structure or to design personalized services. Key words Web Mining - user preferred path - Web-log - support-interest - personalized services CLC number TP 391 Foundation item: Supported by the National High Technology Development (863 program of China) (2001AA113182)Biography: ZHOU Hong-fang (1976-), female.Ph. D candidate, research direction: data mining and knowledge discovery in databases. Web-log contains a lot of information related with user activities on the Internet. How to mine user browsing interest patterns effectively is an important and challengeable research topic. On the analysis of the present algorithm’s advantages and disadvantages we propose a new concept: support-interest. Its key insight is that visitor will backtrack if they do not find the information where they expect. And the point from where they backtrack is the expected location for the page. We present User Access Matrix and the corresponding algorithm for discovering such expected locations that can handle page caching by the browser. Since the URL-URL matrix is a sparse matrix which can be represented by List of 3-tuples, we can mine user preferred sub-paths from the computation of this matrix. Accordingly, all the sub-paths are merged, and user preferred paths are formed. Experiments showed that it was accurate and scalable. It’s suitable for website based application, such as to optimize website’s topological structure or to design personalized services. Key words Web Mining - user preferred path - Web-log - support-interest - personalized services CLC number TP 391 Foundation item: Supported by the National High Technology Development (863 program of China) (2001AA113182)Biography: ZHOU Hong-fang (1976-), female.Ph. D candidate, research direction: data mining and knowledge discovery in databases.
出处 《Wuhan University Journal of Natural Sciences》 EI CAS 2004年第5期569-574,共6页 武汉大学学报(自然科学英文版)
基金 theNationalHighTechnologyDevelopment(863programofChina)(2001AA113182)
关键词 Web Mining user preferred path Web-log support-interest personalized services Web Mining user preferred path Web-log support-interest personalized services
  • 相关文献

同被引文献11

  • 1Hochbaum D S,Pathria A.Analysis of the Greedy Approach in the Problems of Maximum k-Coverage[].Naval Research Logistics.1998
  • 2Kannan R,Vempala S,Vetta A.On Clusterings: Good, Bad, and Spectral[].ACM.2004
  • 3Pan Feng,Wang Wei,Anthony K H, et al.Finding Represen- tative Set from Massive Data[].Proceedings of the Fifth IEEE International Conference on Data Mining.2005
  • 4Kantardzic M.Data Mining Concepts, Models, Methods, and Algorithms[]..2003
  • 5Zhou Hongfang,Feng Boqin,Lv Lintao, et al.LQRA: A New Method to Improve Web Searching Quality[].Proceedings of the th Joint International Computer Conference.2005
  • 6Zhou Hongfang,Feng Boqin,Lv Lintao, et al.A New Integrated Personalized Recommendation Algorithm[].Proceedings of Computational Intelligence and Security.2005
  • 7Ali K,Manganaris S,Srikant R.Partial Classification Using Association Rules[].Proc of the rd Int’l Conf on Knowledge Discovery in Databases and Data Mining.1997
  • 8Clark P,Boswell P.Rule Induction with CN2: Some Recent Improvements[].Machine Learning: Proc of the Fifth European Conference.1991
  • 9Dhar V,Tuzhilin A.Abstract-Driven Pattern Discovery in Databases[].IEEE Transactions on Knowledge and Data Engineering.1993
  • 10Mobasher B,Srivastava J.Data Preparation for Mining World Wide Web Browsing Patterns[].Knowledge and Information Systems.1999

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部