基于微博行为数据的不活跃用户探测被引量：2

Detecting Inactive Users from Behavior Data Based on Weibo

下载PDF

导出

摘要随着微博注册用户的增长,探测不活跃账号,自动判定用户活跃度有重要的商业价值。该文提出了一种自动检测算法并通过实验验证。算法核心是提出的影响用户活跃度的4个判定因子,可由用户行为计算得到。算法包含用户活跃度概率层次模型(ADPHM)和用户评分模型(USM)。ADPHM模型计算用户是不活跃用户的概率;USM模型计算用户活跃度得分。实验数据集包含了新浪微博2 316 281个用户信息和141 322 019条微博内容。实验结果表明,该算法能在线性时间复杂度下自动检测出不活跃账号,完善用户可信度评估体系。 With the growth of registered users in microblog, how to detect inactive accounts and automatically judge the user activity have an important commercial value. To meet this need, an automatic detection algorithm is proposed and experimentally tested. The kernel of automatic detection algorithm is four determining factors of inactive users we defined, which can be calculated by user’s behavior. The algorithm contains User Active Degree Probability Hierarchical Model （ADPHM） and User Scoring Model （USM）. The ADPHM is employed to estimate the probability of inactive user;the USM is used to give a user＆#39;s activity score. Experiment data contains 2 316 281 users’ information and their 141 322 019 tweets crawled from Sina-Weibo. Experimental results show that this method can detect inactive users automatically and improve user confidence evaluation system in linear time complexity.

作者刘晶王峰胡亚慧李石君

机构地区武汉大学计算机学院中南民族大学计算机科学学院空军预警学院计算机教研室

出处《电子科技大学学报》 EI CAS CSCD 北大核心 2015年第3期410-414,444,共6页 Journal of University of Electronic Science and Technology of China

基金国家自然科学基金(61272109) 中央高校基本科研业务费专项资金(CZY15006)

关键词活跃度自动识别不活跃用户微博社交网络 activity automatic identification inactive users microblog social network

分类号 TP182 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献11

1丁兆云,周斌,贾焰,汪祥.微博中基于统计特征与双向投票的垃圾用户发现[J].计算机研究与发展,2013,50(11):2336-2348. 被引量：11
2STRINGHINI Cg KRUEGEL C, VIGNA G. Detecting spammers on social networks[C]//Proceedings of the 26th Annual Computer Security Applications Conference. New York, USA: ACM, 2010: 1-9.
3WANG A H. Don't follow me: Spam detection in twitter[C]//Proceedings of the 2010 International Conference on Security and Cryptography. Washington, USA: IEEE Press, 2010: 1-10.
4SOFUS A. MACSKASSY. On the study of social interactions in twitter[C]//Proceedings of the 6th International AAAI Conference on Weblogs and Social Media. Dublin, USA: AAAI Press, 2012.
5REZA Z, MOHAMMAD-AMIN J, HAMIDREZA B, et al. A novel approach for social behavior analysis of the blogosphere[C]//Proceedings of the 21st Conference of the Canadian Society for Computational Studies of Intelligence. Windsor: Springer-Verlag, 2008: 356-367.
6BOYD D, GOLDER S, LOTAN G. Tweet, tweet, retweet: Conversational aspects of retweeting on twitter[C]// Proceedings of the 43rd Hawaii International Conference on System Sciences. Honolulu, USA: IEEE Press, 2010.
7LAS-CASAS PHB, GUEDES D, ALMIDA J M, et al. SpaDeS: Detecting spammers at the source network[J]. Computer Networks, 2012, 57(2): 526-539.
8莫倩,杨珂.网络水军识别研究[J].软件学报,2014,25(7):1505-1526. 被引量：55
9LIM E P, NGUYEN V A, JINDAL N, et al. Detecting product review spammers using rating behaviors[C]// Proceedings of the 19th ACM International Conference on Information and Knowledge Management (CIKM 2010). New York, USA: ACM Press, 2010: 939-948.
10AKOGLU L, CHANDY R, FALOUTSOS C. Opinion fraud detection in online reviews by network effects[C]// Proceedings of the 7th International Conference on Weblogs and Social Media (ICWSM 2013). Menlo Park: AAAI Press, 2013: 2-11.

二级参考文献26

1张泽明,罗文坚,王煦法.一种基于人工免疫的多层垃圾邮件过滤算法[J].电子学报,2006,34(9):1616-1620. 被引量：16
2中国互联网络信息中心.中国互联网络发展状况统计报告[EB/OL].http://www.cnnic net.cn,2003—07-01.
3Kwak H. Lee C. Park H. et al. What is twitter. a social network or a news media? [C] / /Proc of the 19th Int World Wide Web Conf. New York, ACM. 2010, 591-600.
4Yin D. Hong L. Xiong X. et al. Link formation analysis in microblogs [C] / /Proc of the 34th Annual Int ACM SIGIR Conf on Information Retrieval. New York, ACM. 2011, 1235-1236.
5Becchetti L. Boldi P. Castillo C. er al. Efficient semistreaming algorithms for local triangle counting in massive graphs [C] / /Proc of the 14th ACM SIGKDD Int Conf On Knowledge Discovery and Data Mining. New York, ACM. 2008, 16-24.
6Tsourakakis C. Fast counting of triangles in large real networks without counting, Algorithms and laws [C] / /Proc of the 8th IEEE Int Conf on Data Mining. Piscataway. NJ, IEEE. 2008, 608-617.
7Gyongyi Z, Garcia-Molina H. Pedersen J. Combating Web sparn with TrustRank [C] / /Proc of the 30th Int Conf on Very Large Data Bases. San Franciso . Morgan Kaufmann, 2004, 576-587.
8Sobek M. PRO-Google's PageRank 0 penalty [EB/OL]. (2003-01-31) [2012-07-28]. http://pr. efactory. dele-prO. shtml.
9Wu B. Goel V. Davison B. Propagating trust and distrust to demote Web sparn [C] / /Proc of Models of Trust for the Web Workshop of 15th Int World Wide Web Conf. New York, ACM. 2006, 29-37.
10Chu Z. Gianvecchio S. Wang H. et al. Who is tweeting on twitter, Human. bot. or cyborg? [C] / /Proc of the 26th Annual Computer Security Applications Conf. New York, ACM. 2010, 21-30.

共引文献63

1李丹珉,谢耘耕.政治传播视角下社交机器人的研究现状及发展趋势——基于SCI和SSCI文献的计量分析[J].新媒体与社会,2023(2):140-156.
2罗云松,黄慕宇,贾韬.重采样在微博机器人识别中的应用研究[J].中文信息学报,2021,35(12):133-148. 被引量：1
3刘肖凡,吴晔,许小可.融媒体环境下的受众计算:途径与挑战[J].中国传媒大学学报（自然科学版）,2021(1):64-70. 被引量：4
4杨柳青.浅析淘宝网络刷单中的“水军”现象[J].新西部（中旬·理论）,2015(2):72-72. 被引量：11
5刘勘,袁蕴英,刘萍.基于随机森林分类的微博机器用户识别研究[J].北京大学学报（自然科学版）,2015,51(2):289-300. 被引量：19
6张进,刘琰,罗军勇,董雨辰.基于特征分析的微博炒作账户识别方法[J].计算机工程,2015,41(4):48-54. 被引量：3
7吴敏,尹芳,陈慧安.电子商务网络水军产业链分析[J].中国管理信息化,2015,18(8):204-205. 被引量：1
8程晓涛,刘彩霞,刘树新.基于关系图特征的微博水军发现方法[J].自动化学报,2015,41(9):1533-1541. 被引量：25
9张玉清,吕少卿,范丹.在线社交网络中异常帐号检测方法研究[J].计算机学报,2015,38(10):2011-2027. 被引量：26
10石文华,高羽,胡英雨.基于情感倾向和观察学习的在线评论有用性影响因素研究[J].北京邮电大学学报（社会科学版）,2015,17(5):32-39. 被引量：5

同被引文献32

1张彩虹,曹和安.护理文化探讨的现状及展望[J].护理学杂志（外科版）,2005,20(4):68-70. 被引量：20
2杜琳,王桂生.护理人员与患者对护理文化认知的差异性[J].护理学杂志（综合版）,2007,22(3):17-19. 被引量：6
3张红,贾琦,姚荷英.护理文化概念及建设的研究进展[J].护理管理杂志,2010,10(9):646-648. 被引量：22
4李海芳,何海鹏,陈俊杰.性格、心情和情感的多层情感建模方法[J].计算机辅助设计与图形学学报,2011,23(4):725-730. 被引量：19
5孙宜君,王建磊.论新媒体对文化传播力的影响与提升[J].当代传播,2012(1):46-48. 被引量：124
6周莉.浅谈科室护理文化建设[J].内蒙古中医药,2013,32(9):110-111. 被引量：1
7李永凤.微信用户增长原因探微[J].传媒,2014(5):54-56. 被引量：25
8董玉红,章静,章海燕.微信群在护理单元业务学习的应用效果[J].护士进修杂志,2014,29(8):700-701. 被引量：92
9邓芬,王秀菊,邓牡红.微信＋QQ群在现代临床护理管理中的应用[J].中国急救复苏与灾害医学杂志,2014(4):382-384. 被引量：56
10HE Li,JIA Yan,HAN Weihong,DING Zhaoyun.Mining User Interest in Microblogs with a User-Topic Model[J].China Communications,2014,11(8):131-144. 被引量：17

引证文献2

1牛振军,周红.新媒体对建设护理文化的意义与启示[J].护理实践与研究,2016,13(16):16-18.
2黄发良,冯时,王大玲,于戈.基于多特征融合的微博主题情感挖掘[J].计算机学报,2017,40(4):872-888. 被引量：62

二级引证文献62

1李玉强,黄瑜,孙念,李琳,刘爱华.基于性格情绪特征的改进主题情感模型[J].中文信息学报,2020(7):96-104. 被引量：1
2王勇,马钰,徐胜华,王艳东,罗安,刘万增,狄琳.兴趣点推荐方法研究进展与展望[J].测绘科学,2023,48(12):217-224. 被引量：1
3童丽萍,李明.风荷载作用下玻璃幕墙结构的受力分析与计算[J].工业建筑,2000,30(4):27-30. 被引量：13
4王娟丽.网络社会公共危机影响因素的实证分析[J].图书馆,2017(5):40-46. 被引量：3
5苏兵杰,周亦鹏,梁勋鸽.基于XGBoost算法的电商评论文本情感识别模型[J].物联网技术,2018,8(1):54-57. 被引量：11
6张强,陶皖,王海燕.微博情感分析综述[J].安庆师范大学学报（自然科学版）,2017,23(4):68-74. 被引量：2
7金志刚,胡博宏,张瑞.基于深度学习的多维特征微博情感分析[J].中南大学学报（自然科学版）,2018,49(5):1135-1140. 被引量：14
8刘秋慧,柴玉梅,刘箴.中文微博情感分析模型SR-CBOW[J].小型微型计算机系统,2018,39(8):1693-1699. 被引量：4
9刘纳,王新.基于主题流与深度学习的情感分析算法[J].软件导刊,2018,17(8):28-30. 被引量：1
10赵乐,张兴旺.面向LDA主题模型的文本分类研究进展与趋势[J].计算机系统应用,2018,27(8):10-18. 被引量：8

1王晓堤,王屾,赵旭.基于用户可信度聚类的协同过滤推荐模型[J].微计算机信息,2010,26(30):219-221. 被引量：2
2傅若岩.从内容到社交:新闻客户端挖掘流量变现新商机[J].IT时代周刊,2013(16):28-29. 被引量：7
3潘骏驰,张兴明,汪欣.一种结合用户可信度与相似度的鲁棒性推荐算法[J].计算机应用研究,2016,33(10):2988-2991. 被引量：2
4王峰,余伟,李石君.新浪微博平台上的用户可信度评估[J].计算机科学与探索,2013,7(12):1125-1134. 被引量：9
5王锦坤,姜元春,孙见山,孙春华.考虑用户活跃度和项目流行度的基于项目最近邻的协同过滤算法[J].计算机科学,2016,43(12):158-162. 被引量：12
6梁宏,许南山,卢罡.新浪微博用户及其微博特征分析[J].计算机工程与应用,2015,51(7):141-148. 被引量：5
7宋福英.电子政务集成用户可信度的PKI/PMI安全机制的研究[J].智能计算机与应用,2016,6(3):81-83. 被引量：1
8陈平华,陈传瑜,洪英汉.一种结合关联规则的协同过滤推荐算法[J].小型微型计算机系统,2016,37(2):287-292. 被引量：15
9杨卫华.构建更开放的微博平台[J].程序员,2011(3):43-45.
10Google欲推新的身份认证技术[J].中国自动识别技术,2016,0(3):30-30.

电子科技大学学报

2015年第3期

浏览历史

内容加载中请稍等...

基于微博行为数据的不活跃用户探测被引量：2

参考文献11

二级参考文献26

共引文献63

同被引文献32

引证文献2

二级引证文献62

相关作者

相关机构

相关主题

浏览历史

基于微博行为数据的不活跃用户探测 被引量：2

参考文献11

二级参考文献26

共引文献63

同被引文献32

引证文献2

二级引证文献62

相关作者

相关机构

相关主题

浏览历史

基于微博行为数据的不活跃用户探测被引量：2