期刊文献+

协方差特征爬虫网页语义概念树构建方法 被引量:1

Construction Method of Webpage Semantic Concept Tree Based on Covariance Features Reptile
下载PDF
导出
摘要 提出一种基于协方差特征爬虫的网页语义概念树构建方法,引入语义概念决策树算法进行主特征建模,根据语义三叉特征决策树概率正则训练迁移法则,得到决策树网络节点最近时刻获得的数据集有效特征概率,采用协方差特征网页爬虫进行网页语义概念树构建算法的改进。通过协方差特征爬虫,进行自相关成分的独立快速分离,得到语义自相关检索编码,实现网页语义概念树构建指导信息检索。仿真结果表明,该算法能有效进行数据挖掘和网页语义概念树的构建,为信息定位提供了最优分叉路径,从而实现对主题热点信息的准确检索和定位,算法具有较好的网页召回和定位检索性能,数据召回率提高明显,展示了较好的应用价值。 Construction method of Webpage semantic concept tree is proposed based on covariance features reptile, the decision tree algorithm of feature modeling is obtained, according to semantic trigeminal feature decision tree probability regular training transfer rule, decision tree node set effective feature probability is obtained, the covariance feature Webpage crawler is used to design Webpage semantic concept tree construction algorithm. The covariance features reptile, rapid separation of autocorrelation components are independent, the semantic correlation retrieval code, and the Webpage semantic concept tree construction guidance information retrieval is realized. The simulation results show that, the algorithm can effectively realize data mining and Webpage semantic concept tree, it provides the optimal branching path for the information orientation, so as to realize the theme topic information retrieval and location accuracy, the algorithm has better Webpage recall and positioning data retrieval performance, it can improve the recall rate significantly, it has a good application value.
作者 梁武 苏燕
出处 《科技通报》 北大核心 2015年第4期85-87,共3页 Bulletin of Science and Technology
基金 广西高等教育教改工程项目(NO.2012JGB404)
关键词 协方差 特征爬虫 网页 语义概念树 covariance characteristics of crawler Webpage semantic concept tree
  • 相关文献

参考文献6

二级参考文献163

  • 1Candea G, Kawamoto S, Fujiki Y et al. Microreboot--A technique for cheap reeovery//Proceedings of the 6th Confer- ence on Symposium on Opearting Systems Design & Imple- mentation-Volume 6. San Francisco, USA, 2004:3.
  • 2Lin T T Y, Siewiorek D P. Error log analysis: Statistical modeling and heuristic trend analysis. IEEE Transactions on Reliability, 1990, 39(4): 419-432.
  • 3Yuan D, Mai H, Xiong W et al. SherLog: Error diagnosis by connecting clues from run-time logs//Proceedings of the 15th Edition of ASPLOS on Architectural Support for Pro- gramming Languages and Operating Systems. Pittsburgh, Pennsylvania, USA, 2010:143-154.
  • 4Zheng A X, Lloyd J, Brewer E. Failure diagnosis using deci- sion trees//Proeeedings of the 1st International Conference on Autonomie Computing. Limassol, Cyprus, 2004:36-43.
  • 5Tan J, Kavulya S, Gandhi R et al. Visual, Log-based causal tracing for performance debugging of MapReduce systems// Proceedings of the 2010 IEEE 30th International Conference on Distributed Computing Systems. Genoa, Italy, 2010: 795-806.
  • 6Zheng Z, Lan Z, Park B H et al. System log pre-processing to improve failure prediction//Proceedings of the IEEE/IFIP International Conference on Dependable Systems & Net- works(DSN'09). Lisbon, Poltugal, 2009:572-577.
  • 7Reidemeister T, Munawar M A, Jiang Met al. Diagnosis of recurrent faults using log files//Proeeedings of the 2009 Con- ference of the Center for Advanced Studies on Collaborative Research. Ontario, Canada, 2009: 12-23.
  • 8Chen M Y, Kiciman E, Fratkin E et al. Pinpoint: Problem determination in large, dynamic internet services//Proceed- ings of the 2002 International Conference on Dependable Sys- tems and Networks. Bethesda, USA, 2002:595-604.
  • 9Barham P, Donnelly A, Isaaes R et al. Using magpie for request extraction and workload modelling//Proceedings of the 6th Conference on Symposium on Opearting Systems Design & Implementation-Volume 6. San Francisco, USA, 2004:18.
  • 10Tan P N, Steinbach M, Kumar V. Introduction to Data Mining. Bostom Pearson Addison Wesley, 2006.

共引文献200

同被引文献8

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部