摘要
为了提高用户的网络体验和页面检索性能,提出一种基于核心词汇和语义匹配相结合的在线社会网络近似网页识别方法研究.首先从网页中提取特征核心词汇,对文档概念进行分类和合并,并建立相似度特征模型;基于相似度特征模型,构建特征向量,并对特征向量进行检索和存储,实现社会网络中近似网页的识别.实验证明提出的方法能够有效地降低网络噪声,具有良好的识别准确率和召回率.
In order to improve the user's Web experience and page retrieval performance, a new method based on key words and semantic matching for online social network is proposed. The first feature extraction of keywords from web pages, classify and merge of document concept, and establish the similarity model; similarity model based on feature vector is constructed, and the retrieval and storage of feature vector, identify the approximate pages in social networks. The experimental results show that the proposed method can effectively reduce the network noise, and has good recognition accuracy and recall rate.
出处
《微电子学与计算机》
CSCD
北大核心
2017年第2期141-144,共4页
Microelectronics & Computer
基金
河南省高等学校重点科研项目(教科技[2015]1120号)"现代化跟踪系统目标成像轨迹模拟研究"(16A520093)
河南省科技厅科技攻关项目"基于WiFi的无线存储测试系统设计"(162102210367)
河南省科技攻关项目"基于机器人视觉下的运动目标成像特征提取技术研究"
关键词
在线社会网络
近似网页
识别方法
online social network
approximate web page
identification method