摘要
在网页坐标系中运用VIPS(Vision-based page segmentation)理论,对网页中信息块的重要性进行判定.该方法利用网页创建过程中的设计习惯和人类浏览信息过程中的视觉焦点判定,按九宫格划分页面区域分布并在此基础上识别主题信息,论文最后选取新闻类型网站网页,按不同页面分割比例检测了网页信息块空间层次和主题信息块提取间的关系.
Applying the theory of VIPS in the coordinate system of Webpage,the way of identifying key information in Webpage is developed.The method focuses on judging the visual focus of people in designing or browsing the Webpage,for identifying key information within distributing nine-square grid.At last,with several dissection ratios,News websites are discussed that spatial level of information block and extracting key information block are linked in web pages.
出处
《湛江师范学院学报》
2014年第6期106-113,共8页
Journal of Zhanjiang Normal College
关键词
网页
九宫格
VIPS
关键信息识别
信息去噪
webpage
nine-square grid
VIPS
identifying key information
eliminate noise