摘要
提出了一种基于页面语义的分层迭代划分方法,并将其运用于网页挖掘,通过把网站页面迭代划分为不同数目节点的多层,选取符合要求的层来进行数据挖掘处理,便于快速定位到该层中的某个节点,该节点就是需要的主要内容。
This paper points out a segmentation iterative method based on web semantics and applies this method to web mining. By classifying web iteration into different numbers of hierarchy and by choosing the segmented hierarchy which accords with the requirement to be treated by data mining, some nodes of this hierarchy are rapidly positioned and the contents of this nodes are the main contents required.
出处
《重庆工商大学学报(自然科学版)》
2007年第5期477-480,498,共5页
Journal of Chongqing Technology and Business University:Natural Science Edition
关键词
网页挖掘
网页分层迭代
页面区域
web mining
web segmented iterative anlysis
web region