摘要
本文分析了网络文档内容页面分块的提取方法,同时引入了一个层次化关键数据挖掘思想,自顶向下将网页进行划分,这样就可以划分为多个物理模块,从简单的分块操作中获取一个准确的分块决策方法,进而进一步提高分块数据挖掘的准确度。
This paper analyzes the method of extracting the block of network document content pages,and introduces a hierarchical key data mining idea,which divides the webpage from top to bottom,so that it can be divided into multiple physical modules,from simple partitioning.Obtain an accurate block decision method in operation to further improve the accuracy of block data mining.
作者
曹宇逢
CAO Yu-feng(The First High School in Puyang City,Puyang Henan 457000)
出处
《数字技术与应用》
2018年第9期231-231,233,共2页
Digital Technology & Application
关键词
网络文档
分块
数据挖掘
network document
block
data mining