期刊文献+

基于文字链接比的网页分类的研究 被引量:1

Research on Web Page Classification Based on Text Link Ratio
下载PDF
导出
摘要 对于Web内容挖掘来说,对挖掘对象进行初步的识别是非常重要的,首先必须把含有具体内容的网页识别出来,才能进一步进行有效的分析。论文提出了链接比的概念,以此来分析网页的特征,然后进行有监督的学习,从而导出相关的规则,再用该规则对新的网页进行分类。 To Simply Classify the Web page is very important to Web Mining.Firstly,it should identify the Web page which content s the text message.Then it can analyse the page efficiently.This paper puts forward the concept of Link Ratio,and analyzes the character of Web page with it.By supervised learning,it can extract the rule of classification.Finally,the rule can be used to classify the new Web page.
出处 《计算机工程与应用》 CSCD 北大核心 2004年第27期151-153,共3页 Computer Engineering and Applications
关键词 Hub网页 内容网页 链接比 网页分类 Hub page,content page,link rate,Web page classification
  • 相关文献

参考文献10

二级参考文献41

  • 1王建勇,谢正茂,雷鸣,李晓明.近似镜像网页检测算法的研究与评价[J].电子学报,2000,28(z1):130-132. 被引量:21
  • 2[1]Cooley,R.,Srivastava,J.Data preparation for mining World Wide Web browsing patterns.Journal of Knowledge and Information Systems,1999,1(1):5~32.
  • 3[2]Fayyad,U.M.,Piatetsky-Shapiro,G.,Smyth,P.The KDD process for extracting useful knowledge from volumes of data.Communications of the ACM,1996,39(11):27~34.
  • 4[3]Mobasher,B.,Jain,N.,Han,E.H.,et al.Web mining:pattern discovery and from World Wide Web transactions.Technical Report,96-050,University of Minnesota,1996.
  • 5[4]Wu,K.L.,Yu,P.S.,Ballman,A.SpeedTracer:a web usage mining and analysis tool.IBM System Journal,1998,37(1):89~105.
  • 6[1]Mark A.C.Overmeer.My personal search engine.Computer Networks,1999,31:2271~2279
  • 7[2]S.Lawrence,C.Lee Giles.Accessibility of information on the Web.Nature,1999,400
  • 8[3]M.Koster.Robots in the web:threat or treat.Conne Xions,1995,9(4) http://info.webcrawler.com/mak/projects/robots/threat-or-treat.html
  • 9[4]Krishan Bharat,Andrei Broder,Monika Henzinger,etc..The connectivity derver:fast access to linkage information on the web.Proc.7th International World Wide Web Conference,1998
  • 10[5]Soumen Chakrabarti.Mining the Web's link structure.Computer,IEEE,1999,August:60~67

共引文献662

同被引文献7

  • 1GIMSON R.Device Independence Priciples[EB/OL].http://www.w3.org/TR/di-princ/.
  • 2佚名.解读电视的分辨率和清晰度[EB/OL].http://www.yesky.com/379/1932879.shtml.
  • 3岳玮宁,王悦,谭继志,等.移动计算中的小屏幕网络浏览策略及其实现[EB/OL].http//www.docin.com/p-20176384.html.
  • 4WOBBROCK J O,FORLIZZI J,HUDSON S E,et al.Web Thumb:interaction techniques for small-screen browsers[C].Paris,France:UIST'02,2002:205-208.
  • 5HEIDI L,PATRICK B.Summary thumbnails:readable overviews for small screen web browsers[EB/OL].http://www.patrickbaudisch.com/publications/2005-Baudisch-CH105-SummaryThumbnails.pdf.
  • 6王琦犇.基于信息抽取的手持智能终端网页显示技术研究与实现[D].上海:华东师范大学,2006.
  • 7DONG S F,TENG G F,WANG D,et al.Research on the transformation of display format for web informaiton[C] //CCTA2007.USA:Springer,2007.

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部