期刊文献+

基于K-近邻算法的Deep Web数据源的自动分类

Automatic classification of Deep Web sources based on KNN algorithm
下载PDF
导出
摘要 针对Deep Web的查询需求,提出了一种基于K-近邻算法的Deep Web数据源的自动分类方法。该算法在对Deep Web网页进行表单特征提取及规范化的基础上,基于距离对Deep Web网页所属的目标主题进行判定。实验结果表明:基于K-近邻分类算法可以较有效地进行DeepWeb数据源的自动分类,并得到较高的查全率和查准率。 To meet the need of Deep Web query,an algorithm for classification of Deep Web sources based on KNN is put forward.The algorithm extracts the form features from Web pages,and makes the form features vector normal.Then the algorithm classifies Deep Web pages by computing distance.The experimental results show that the algorithm has improved in precision and recall.
作者 张智 顾韵华
出处 《信息技术》 2011年第5期108-111,共4页 Information Technology
关键词 深网 查询接口 K近邻算法 网页分类 Deep Web query interface KNN classification
  • 相关文献

参考文献10

  • 1Raghavan S , Garcia-Molina H. Crawling the Hidden Web [ C ]. Proceedings of the 27th International Conference on Very Large Data Bases. Roma: [ s. n. ] ,2001 : 129 - 138.
  • 2He B, Patel M, Zhang Z, et al. Accessing the Deep Web:A Survey [ J ]. Communications of the ACM ( CACM ) ,2007,50 (5) : 94 - 101.
  • 3Panagiotis G Ipeirotis, Luis Gravano, Mehran Sahami. Probe, count, and classify: categorizing hidden web databases[ C ]//Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, 2001:67 -78.
  • 4Yih-Ling Hedley, Muhammad Younas,Anne E James. The categorisation of hidden web databases through concept specificity and coverage[ C]//proceedings of the 2005 international workshop on web and mobile information Systems ,2005:371 -376.
  • 5He B, Tao T, Chang K C C. Organizing structured web sources by query schemas : a clustering approach [ C ]//Proceedings of the 13 th Conference on Information and Knowledge Management, 2004: 22 -31.
  • 6Peng Qian, Meng Weiyi, He Hal, et al. WISE-Cluster: Clustering search engines automatically[ C]//6th ACM lnternational Workshop on Web Information and Data Management, 2004 104 -111.
  • 7Michael K Bergman. The Deep Web: surfacing hidden value[J]. journal of electronic publishing, 2002, 7 ( 1 ) :8912 - 8914.
  • 8赵朋朋,高岭,崔志明.基于查询接口特征的Deep Web数据源自动分类[J].微电子学与计算机,2006,23(10):47-50. 被引量:11
  • 9金灵芝,王小玲,朱守中.Deep Web数据源自动分类[J].微计算机信息,2009,25(12):227-228. 被引量:3
  • 10Gravano L. Qprober: A System for Automatic Classification of Hidden Web Database[ J]. ACM Transaction on Information Systems, 2003,21(1) :1 -41.

二级参考文献11

  • 1Bergman M K. The Deep Web:Surfacing Hidden Value J/OL . The Journal of Electronic Publishin g, 2001 , 7 (1)2001 . htt p:// www. press, umich, edu/jep/07 - 01/bergman.HTML.
  • 2Chang K C, He B, Li C, Patel M, Zhang Z. Structured databases on the Web: Observations and Implications. SIG-MOD Record, 2004, 33(3): 61-70
  • 3Peng Q, Meng W, He H, Yu C T. WISE-cluster: Cluste-ring e-commerce search engines automatically//Proceedingsof the 6th ACM International Workshop on Web Information and Data Management. Washington, 2004:104-111
  • 4Ipeirotis P G, Gravano L, Sahami M. Probe, count, an classify: Categorizing hidden Web databases//Proceedings othe 19th ACM SIGMOD International Conference on Man-agement of Data. Santa Barbara, 2001:67-78
  • 5Michael K Bergman.The deep web:surfacing hidden value[J].In journal of electronic publishing,2002,7 (1):8912~8914
  • 6K C C Chang,B He,C Li,et al.Structured databases on the web:observations and implications[J].SIGMOD Record,2004,33(3):61~70
  • 7Panagiotis G Ipeirotis,Luis Gravano,Mehran Sahami.Probe,count,and classify:categorizing hidden web databases[C].In Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data,2001:67~78
  • 8Yih-Ling Hedley,Muhammad Younas,Anne E James.The categorisation of hidden web databases through concept specificity and coverage[C].In proceedings of the 2005 international workshop on web and mobile information Systems,2005:371~376
  • 9B He,T Tao,K C C Chang.Organizing structured web sources by query schemas:a clustering approach[C].In Proceedings of the 13th Conference on Information and Knowledge Management,2004:22~31
  • 10Qian Peng,Weiyi Meng,Hai He,et al.WISE-Cluster:Clustering e-commerce search engines automatically[C].In 6th ACM International Workshop on Web Information and Data Management,2004:104~111

共引文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部