期刊文献+

基于层次分析的微博短文本特征计算方法 被引量:9

Calculating the feature method of short text based on analytic hierarchy process
下载PDF
导出
摘要 为了建立用户精准兴趣模型以有效发现具有相似兴趣的用户群,提出了一种针对微博的短文本特征计算方法用于聚类算法,提升聚类效果以更好地挖掘微博用户的相似兴趣集合。该方法融合了微博转发数、评论数、点赞数等多个关键指标来度量微博短文本特征的重要性。同时,引入层次分析技术,改进了传统的tf-idf特征计算方法,并利用经典文本聚类算法进行实验。实验结果表明,改进后的短文本特征计算方法与传统的tf-idf特征计算方法相比,在类内集中度和类间分散度上取得了更好的效果。 In order to model the accurate interest preference of microblog users and discover user groups with similar interest, a new method was proposed which considered the total amount of retweets, comments and attitudes of each microblog for text feature calculation with utilizing classic analytical hierarchy process method. The proposed method used three indicators to evaluate the importance of the text feature representation and made an improvement on traditional tf-idf feature calculation method to fit for short text. Furthermore, this method was also implemented in the traditional clustering algorithm. Experimental results show that, compared with the traditional tf-idf method, the improved approach has a better clustering effect on the average scattering for clusters and the total separation between clusters.
作者 邹学强 包秀国 黄晓军 马宏远 袁庆升 ZOU Xue-qiang BAO Xiu-guo HUANG Xiao-jun MA Hong-yuan YUAN Qing-sheng(Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China National Computer Network Emergency Response Technical Team/Coordination Center of China, Beijing 100029, China University of Chinese Academy of Sciences, Beijing 100049, China School of Information and Communication Engineering, Beijing University of Posts and Tel Beijing 100876, China)
出处 《通信学报》 EI CSCD 北大核心 2016年第12期50-55,共6页 Journal on Communications
基金 国家高技术研究发展计划("863"计划)基金资助项目(No.SS2014AA012303) 国家自然科学基金资助项目(No.61300206 No.61402123)~~
关键词 层次分析 特征计算 文本聚类 短文本 analytic hierarchy process feature calculation text clustering short text
  • 相关文献

参考文献5

二级参考文献83

  • 1彭京,杨冬青,唐世渭,付艳,蒋汉奎.一种基于语义内积空间模型的文本聚类算法[J].计算机学报,2007,30(8):1354-1363. 被引量:44
  • 2许树伯.层次分析法原理[M].天津大学出版社,1988.
  • 3林霜梅,汪更生,陈弈秋.个性化推荐系统中的用户建模及特征选择[J].计算机工程,2007,33(17):196-198. 被引量:45
  • 4ZHAO WSYNE Xin,JIANG Jing,WENG Jianshu. Comparing Twitter and traditional media using topic models[A].2011.338-349.
  • 5HONG L,DAVISON B D. Empirical study of topic modeling in Twitter[A].2010.
  • 6ABELF,GAOQI,JANG. Sematic Enrichment of Twitter Posts for User Profile Construction on the Social Web[A].2011.
  • 7RAMAGE D,DUMAIS S T LIEBLINGOL. Liebling.Characterizing Microblogs with Topic Models[A].2010.
  • 8ABELF,GAO QI,JANG. Analyzing User Modeling on Twitter For Personalized News Recommendations[A].2011.
  • 9ABELF,GAO QI,JANG. TUMS:Twitter-based User Modeling Service[A].2011.
  • 10Matthew Michelson,Sofus A.Macskassy. Discovering users'topics of interest on twitter:a first look[A].2010.73-80.

共引文献666

同被引文献68

引证文献9

二级引证文献43

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部