期刊文献+

一种基于动态步长的微博搜索排序算法

A microblog search sort algorithm based on dynamic stepsize
下载PDF
导出
摘要 微博搜索主要是计算文档与查询词之间的相关性,通过统计方法确定词量的权重,再用向量空间模型计算相关度.然而使用词量搜索方法,搜索精度并不高,检测到某条微博的信息含量有限,难以保证用户查询的关注度.针对这一问题,提出基于动态步长的微博搜索排序算法.该算法的主要实现过程:首先对微博已有的特征进行分析,然后用信息熵的方法计算微博信息含量,不使用词量为计算单位,而以词性为单位计算微博的相关度.最后把动态步长加入到List Net排序算法中,并用Armijo-Goldstein准则对步长进行优化.通过仿真实验表明,本算法排序效果更优. Microblog search is mainly calculation the relevance between the document and query,these weight of words are determined by the statistical method,and the relevance degree is calculated by vector space model. However,searching by words is not enough accuracy,the information content of microblog unit detection through this method is limited,thus inadequate to show the true attention paid by users in their query. Aiming to this problem,we proposed a sort algorithm for microblog search based on dynamic stepsize. The main process of algorithm: firstly,the existing features of microblog were analyzed. Secondly,the information content of microblog were calculated by using information entropy method,words were not as the calculating unit,but calculation the relevance of microblog based on part of speech. Finally,the dynamic stepsize was introduced to the List Net sort algorithm,and it was optimized by Armijo-Goldstein principle. The simulation experiment results show that the algorithm sort effect is better.
出处 《湖北大学学报(自然科学版)》 CAS 2016年第3期258-266,共9页 Journal of Hubei University:Natural Science
基金 国家自然科学基金(61202248)资助
关键词 微博 搜索排序 LIST Net算法 Armijo-Goldstein准则 特征值 动态步长 microblog search sort List Net algorithm Armijo-Goldstein principle eigenvalue dynamic stepsize
  • 相关文献

参考文献33

  • 1Sina Tech. The numbers of registered users of twitter is over 500 million:rank only second to Facebook [ EB/OL ]. http :// tech. sina. com. cn/i/m/2012-O7-31/O0387445367, shtml.
  • 2Liu X H, Wei F R, Duan Y J, et al. Semantic search of microblogs [ J ]. Journal of Shandong University ( Natural Science), 2012,47 (5) :39-42.
  • 3PhoenixNet. The numbers of registered users of sina microblog is nearly 500 milhon,75% of active users login in with mobile terminals[ EB/OL]. http ://tech. sina. com. cn/i/2012-02-06/15246687778, shtml.
  • 4Asur S,Huberman BA, Szab6 G, et al. Trends in social media: Persistence and decay [ C ]. Adamic LA, Baeza-Yates RA, Counts S. Proc of the 5th Int' 1 AAAI Conf. on Weblogs and Social Media. Menlo Park:The AAAI Press,2011:434-437.
  • 5Wang C X, Guan X H, Qin T, et al. Who are active? An in-depth measurement on user activity characteristics in sina microblogging[ J]. Proc of the GLOBECOM Piscataway:IEEE,2012:2083-2088.
  • 6Nagmoti R,Teredesai A, De Cock M. Ranking approaches for microblog search[ C ]. Proc of the 2010 IEEE/ACM Int' 1 Confon Web Intelligence-lntelligent Agent Technology (WI-IAT),New York:IEEE Press,2010:153-157.
  • 7Wu S, Hofman J M, Mason W A, et al. Who says what to whom on Twitter[ C]. Srinivasan S, Ramamritham K, Kumar A, Ravindra MP, Bertino E, Kumar R. Proc of the 20th Int' 1 Conf on World Wide Web, New York :ACM Press,2011:705-714.
  • 8Yu L,Asur S, Huberman B A. What trends in Chinese social media[ C]. Proc of the 5th SNA-KDD Workshop' 11 (SNA- KDD 2011 ) ,New York:ACM Press,2011:81-87.
  • 9Cha M, Mislove A, Gummadi KP. A measurement-driven analysis of information propagation in the flickr social network [ C ]. Proc of the 18th Int'l Conf on World Wide Web, New York:ACM Press,2009:721-730.
  • 10Meij E, Weerkamp W, Rijke M D. Adding semantics to microblog posts [ C ]. Adar E, Teevan J, Agichtein E, Maarek Y. Proc ofthe 5th ACM Int'l Conf on Web Search and Data Mining,New York :ACM Press,2012:563-572.

二级参考文献23

  • 1樊兴华,孙茂松.一种高性能的两类中文文本分类方法[J].计算机学报,2006,29(1):124-131. 被引量:70
  • 2Lazarsfield P et al. The People's Choice. New York= Columbia University Press, 1948.
  • 3Weimann Gabriel, Tustin Deon Harold, van Vuuren I)aan, Joubert J P R. Looking for opinion leaders Traditional vs. modern measures in traditional societies. International Journal of Public ()pinion Research, 2007, 19(2).- 173 190.
  • 4Matsumura Naohiro, Ohsawa Influence diffusion model in Transactions of the Japanese genee, 2002, 17(3): 259 267 Yukio, Ishizuka Mitsuru text based communication Society for Artificial Intelli.
  • 5Zhou Hengmin, Zeng Daniel, Zhang Changli. Finding leaders from opinion networks//Proceedings of the 2009 IEEE International Conference on Intelligence and Security Informatics(ISI09). DallasTX, USA, 2009 266 268.
  • 6Zhang Jun, Ackerman Mark S, Adamic Lada. Expertise net works in online communities: Structure and algorithms// Proceedings of the 16th International World Wide Web Con [erenee Committee(IW3C2). Ban[f, Canada, 2007:221 230.
  • 7Song Xiaodan, Chi Yun, Hino Koji, Belle Tseng. Identifying opinion leaders in the blogosphere//Proeeedings of the 16th ACM Conference on Information and Knowledge Manage ment(CIKM07). New York, USA, 2007:971-974.
  • 8Zhai Zhongwu, Xu Hua, Jia Peifa. Identifying opinion lead ers in BBS//Proceedings of the IEEE/WIC/ACM Interna tional Conference on Web Intelligence and Intelligent Agent Technology(WI IAT'08). Sydney, NSW, Australia, 2008: 398 401.
  • 9Fan Xing-Hua, Nie Jian-Yun. lank distribution dependency model for document retrieval. Journal of Information Computational Science, 2009, 6(3): 1553-1564.
  • 10MISLOVE A,MARCON M,GUMMADI K P. Measurement and analysis of online social networks[A].New York,USA:ACM,2007.29-42.

共引文献111

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部