期刊文献+

基于向量空间模型的个性化信息过滤系统研究与开发 被引量:3

Implementation of Personalized Information Filtering System Based on Vector Space Model
下载PDF
导出
摘要 论文提出了一种基于向量空间模型的用户个性化需求建模方法。对关键词权重算法作出改进,将网页分为四类逻辑段,通过计算关键词在各类逻辑段中的权重而加权得到综合权重。采用基于内容的构建原则和反馈原则,将用户模型构建分为训练阶段和自适应学习阶段。在训练阶段由用户给出的样本文档与关键词采用类重心分类算法训练得到初始用户模型;在自适应学习阶段,提出了基于Rocchio算法的周期性自适应学习机制,根据用户对过滤结果的评价,调整用户模型,以提高对用户个性化需求的动态追踪能力。开发了个性化信息过滤原型系统。以中国服装网为实验数据源,对比百度搜索引擎,测试系统的信息过滤性能。实验结果表明,系统索引更新及时,响应速度快,返回的信息更精确,更合理,更加符合用户的实际需求。 A profiling method of user's personalized requirements based on vector space model is proposed .The algo-rithm for calculating keyword's weight is improved .Web page is divided into four kinds of logic sections ,and the keyword's weight is integrated by calculating weight in all kinds of logic sections .The user profile building process is divided into the training stage and the adaptive learning stage ,adopting the content-based constructing principle and feedback principle .In the training stage ,the initial user profile is gained from the sample documents and keywords given by the user through the al-gorithm of Text Categorization based on Category Centric .In the adaptive learning stage ,a periodic adaptive learning mecha-nism based on Rocchio algorithm is put forward .The user profile is adjusted according to user's evaluation about the filtering result to improve the dynamic tracking ability of user's personalized requirements .Personalized information filtering proto-type system is developed .Taking Chinese Clothing Network as the experimental data ,its performance of information filte-ring is tested with the comparison of Baidu search engine .The experimental results show that the system's index updates more timely ,responds faster ,and returns the information more accurate ,reasonable ,and more in line with the user's actual needs .
作者 许琦
出处 《计算机与数字工程》 2014年第10期1940-1944,1990,共6页 Computer & Digital Engineering
基金 浙江省哲学社会科学规划课题"基于专利引证网络的知识基因提取方法探索"(编号:13NDJC19YBM) 浙江省软科学研究计划项目"技术标准下提升企业自主创新能力--基于专利池的组建与管理"(编号:2013C35064) 浙江省人力资源和社会保障科学研究课题"技术标准下面向知识创新的公共信息服务平台研究"(编号:R2013A012)资助
关键词 信息过滤 个性化需求 用户模型 向量空间模型 ROCCHIO 算法 information filtering personalized requirements user profile vector space model rocchio algorithm
  • 相关文献

参考文献13

二级参考文献70

共引文献45

同被引文献38

  • 1黄春毅,邓红军.一种自适应搜索引擎的构建研究[J].情报杂志,2006,25(2):118-120. 被引量:4
  • 2马文峰,杜小勇.关于知识组织体系的若干理论问题[J].中国图书馆学报,2007,33(2):13-17. 被引量:27
  • 3CNNIC.中国互联网络发展状况统计报告[R],北京:中国互联网信息中心,2014.
  • 4Jeong H Y,Park J H,Jeong Y S,et al. Petri-Net Based User Profile Data Ontology for SNS[C]//Advanced Infor- mation Networking and Applications Workshops (WAINA ),2013 27th International Conference on.2013:744-748.
  • 5Gao Q, Cho Y I. A multi-agent personalized ontology profile based query refinement approach for information retrieval[C]//Control, Automation and Systems ( IC- CAS),2013 13th International Conference on.2013:543-547.
  • 6Gao Q, Xi S M,Im Cho Y. A multi-agent personalized ontology profile based user preference profile construction method[C]//Robotics (ISR ) ,201344th International Symposium on.2013:1-4.
  • 7Kawamae N. Author interest topic model[C]//Proceed- ings of the 33rd international ACM SIGI Rconference on Research and development in information retrieval.2010:887-888.
  • 8Tang J, Meng Z, Nguyen X,et al. Understanding the limiting factors of topic modeling via posterior contrac- tion analysis[C]//Proceedings of The 31st International Conference on Machine Learning.2014:190-198.
  • 9Frederic Morin, Yoshua Bengio. Hierarchical probabi- listic neural network language model[C]// Proceedings of the international workshop on artificial intelligence and statistics.2005:246-252.
  • 10Mikolov T, Sutskever I,Chen K, et al. Distributed rep- resentations of words and phrases and their composition- ality[C]//Advances in Neural Information Processing Systems.2013:3111-3119.

引证文献3

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部