期刊文献+

基于半结构化分割的Web热点数据挖掘算法 被引量:3

Algorithm of Web Hot Data Mining Based on Structured Segmentation
下载PDF
导出
摘要 随着大数据信息技术的发展,数据在线监测和数据挖掘成为计算机信息领域研究的热点。通过对Web热点数据分割挖掘,提高信息热点追踪和Web数据分类能力。传统算法采用非结构化数据挖掘算法,无法有效对Web热点数据进行准确定位和分层挖掘。提出一种基于半结构化分割的Web热点数据挖掘算法。采用半结构化数据进行特征分割,基于优秀基因位进行差分进化,使寻优曲线不断趋于平缓,在多个节点上并行的运行比较脚本,采用半结构化分割,使得Web热点特征挖掘实现自适应寻优,得到Web热点数据的分配因子,提高了挖掘性能。仿真结果表明,该算法获得了良好的效率和精度,提高了Web热点数据挖掘的自适应寻优能力。 With the development of big data information technology, online monitoring data and data mining has become a hot research field of computer information. The segmentation of Web hot data mining, improve the classification ability of information focus and Web data. Using the traditional algorithm of unstructured data mining algorithms, it is not valid for Web hot data for accurate positioning and layered mining. The paper proposed a mining algorithm Web hot data structured based on segmentation, feature segmentation using semi structured data, excellent genes are based on differential evolution,make the optimization curve tends to be gentle, parallel on multiple nodes running script, through the code makes the unstructured data mapped to the data block, make the data stored in the database relational data model, to get the distribution factor Web hot data, improve the mining performance.The simulation results show that the high efficiency and accuracy, it improved adaptive Web hotspot of data mining optimization ability.
作者 阮梦黎
出处 《科技通报》 北大核心 2015年第4期115-117,共3页 Bulletin of Science and Technology
关键词 数据挖掘 半结构化 热点数据 大数据 data mining semi-structured hot data big data
  • 相关文献

参考文献6

二级参考文献181

  • 1周志华,何萍,尹建伟,陈刚.基于Web的海量存储柔性分布式文件服务器设计[J].计算机应用研究,2002,19(11):14-16. 被引量:5
  • 2段盛,李仁发.嵌入式建模中带有时间扩展的UML状态图的形式化方法[J].计算技术与自动化,2007,26(1):108-110. 被引量:4
  • 3Gdc Jong.A UML-based design methodology for real-time and embedded systems[C] //Proceeding of DATE 02.
  • 4Van Der Straeten R.Jonckers V.Mens T.Supporting model refactorings through behaviour inheritance consistencies[C] //Proc of the Unified Modeling Language (UML 2004).LNCS 3273,Heidelberg:Springer-Verlag,2004.305-319.
  • 5Trung PT,Thang HQ.Building the reliability prediction model of component -based software architectures[J].Journal of Information Technology.2009,5(1):18-25.
  • 6Kourad S,Cheng B H C,Campbell L A.Object Analysis Patterns for Embedded Systems[J].IEEE Trans on Software Engineering,2004,30(12):970-992.
  • 7The Generic Modeling Environment,http://www.isis.vanderbilt.edu/Projects/gme/.
  • 8Carl Adam Petri.Communication with automata[D].Darmstadt:Darmstadt University of Technology,1962.
  • 9Yuan Chongyi.The principle and application of Petri nets[M].Beijing:Publishing House of Electronics.
  • 10Ledeczi A,Maroti M,Karsai G,Garrett J,Sprinkle J.The generic modeling environment[C] // Workshop on Intelligent Signal Processing Budapest,Hungary,2001.

共引文献2398

同被引文献21

引证文献3

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部