期刊文献+

基于SVM算法的微博用户识别和分类研究

Research on MicroBlog User Recognition and Classification Based on SVM Algorithm
下载PDF
导出
摘要 基于广受欢迎的微博平台,利用新浪微博提供的API接口及网络爬虫技术从微博中提取用户数据,通过支持向量机算法(SVM)将微博用户分为水军用户和非水军用户两类。再利用改进的支持向量机算法(SVM)从大量的用户数据中提取特征值,实现多分类支持向量机模型,将用户分为正常用户、炒作型水军、营销型水军、谣言型水军四类。研究结果表明,构建的模型可以较为准确地识别出用户的类型,识别误差率较低。 Based on the popular MicroBlog platform,the user data is extracted from MicroBlog by using the API interface provided by Sina MicroBlog and Web crawler technology,and the MicroBlog users are divided into two categories of water army users and non water army users by support vector machine (SVM) algorithm.Then the improved support vector machine (SVM) algorithm is used to extract feature values from a large number of user data to realize a multi classification support vector machine model.Users are divided into four categories:normal users,hyped water army,marketing water army and rumor water army.The study results show that the constructed model can accurately identify the types of users,and the recognition error rate is low.
作者 李新焕 黄伟力 LI Xinhuan;HUANG Weili(Jiangxi Engineering Vocational College of Jiangxi Open University,Nanchang 330046,China)
出处 《现代信息科技》 2022年第16期107-109,共3页 Modern Information Technology
基金 江西省教育厅科技项目(GJJ205702)。
关键词 新浪微博 特征提取 网络爬虫 支持向量机算法 识别误差率 Sina MicroBlog feature extraction Web crawler SVM algorithm recognition error rate
  • 相关文献

参考文献8

二级参考文献76

  • 1郑智斌,邓兰花.网络个人信源及其可信度分析[J].情报理论与实践,2008,31(6):857-859. 被引量:8
  • 2韩忠明,许峰敏,段大高.面向微博的概率图水军识别模型[J].计算机研究与发展,2013,50(S2):180-186. 被引量:10
  • 3周树德,孙增圻.分布估计算法综述[J].自动化学报,2007,33(2):113-124. 被引量:209
  • 4WenE,SunV新浪微博研究报告[EB/OL].[2011-05-20].http://www.techweb.corn.cn.data/2011-02-25/916941.shtml.
  • 5Zi Chu, Gianvecchio S, Wang Haining, et al. Detecting Automation of Twitter Accounts: Are You a Human, Bot, or Cyborg?[J]. Dependable and Secure Computing, 2010, 9(6): 811-824.
  • 6Charles-Antoine J, John E, France B. Controlled User Evaluations of Information Visualization Interfaces for Text Retrieval: Literature Review and Meta-analysis[J]. Journal of the American Society for Information Science and Technology, 2008, 59(6): 1012-1024.
  • 7Haruechaivasak C J, Wittawat S. Implementing News Article Category Browsing Based on Text Categorization Technique[C]//Proc. of IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology. Washington D. C., USA: Is. n.], 2008: 143-146.
  • 8Jang N, Peter S, Michal H. Conditional Mutual Information Based Feature Task[C]//Proc. of CIARP'07 Verlag, 2007: 417-426. Selection for Classification Berlin, Germany: Springer-.
  • 9Mishra S, Bhende C N, Panigrahi B K. Detection and Classification of Power Quality Disturbances Using S-transform and Probabilistic Neural Network[J]. Power Delivery, 2008, 23(1): 280-287.
  • 10Abdeslam D O, Wira P, Merckle J, et al. A Unified Artificial Neural Network Architecture for Active Power Filters[J]. Industrial Electronics, 2007, 54(1): 61-76.

共引文献42

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部