期刊文献+

重大公共卫生事件中的舆情分析方法研究——以新冠肺炎疫情为例 被引量:17

Research on Public Opinion Analysis Methods in Major Public Health Events: Take COVID-19 Epidemic as an Example
原文传递
导出
摘要 2019年12月以来,新冠肺炎疫情迅速席卷全球,截至北京时间2020年5月10日16时40分,全球累计确诊病例4 115 662例,已成为全球聚焦的主要话题。微博等社交媒体平台成为此次疫情相关信息传播的重要渠道和公众情绪的有效传感器之一。对微博信息进行深入挖掘分析不但能研判舆情特点,更有助于政府对公众的情绪进行针对性疏导,合理管控舆情。因此,本文采集了2020年1月18日到2020年1月28日期间关于新冠肺炎的33万余条新浪微博数据,基于Louvain和Kmeans的空间聚类、改进的BTM主题词提取等算法,将用户关注热点信息和情感特征作为地域标签,构建了反映情感特征、地域关联与热点关注在内的舆情评价方法,实现了基于位置的信息融合,能够分析不同区域的舆情特点与关注主题差异。研究表明:基于BERT词向量的BTM主题词提取方法可以有效弥补传统主题词提取的计算量大、数据冗余等缺点,在热点挖掘时具有更强的表达能力;不同区域关注热点具有一定的差异性,结合省级、市级及基于Louvain-Kmeans的空间聚类的多尺度舆情分析方法,可以全方位展现不同区域舆情特点。本文提出的舆情分析方法可以有效反映不同区域的舆情特征,为重大公共卫生事件的舆情分析提供参考。 Since December 2019, COVID-19 has rapidly swept the world. As of May 10, 2020, 16:40 PM, Beijing time, the global confirmed COVID-19 cases reached 4,115,662, which has become a major global issue. Social media platforms such as microblog have become the important channel for information transmission and an effective sensor of public sentiment. In-depth mining and analysis of microblog information can not only characterize the public opinion, but also help the government to conduct targeted guidance on public sentiment and properly control public opinion. Therefore, this study collected more than 330,000 Sina Weibo data about COVID-19 from January 18, 2020 to January 28, 2020. Based on the spatial clustering method using Louvain and Kmeans and an improved BTM subject word extraction algorithm, users’ attention information and emotional characteristics are labeled with their locations. Thus, the evaluation method of public opinion is constructed by integrating user’s location information, which is able to analyze the characteristics of public opinion and the difference in the topics concerned at different regions. Our results show that the characteristics of public opinion in different regions can be comprehensively evaluated using the spatial clustering method based on Louwain and Kmean. The BTM subject word extraction method based on BERT word vector can effectively make up the disadvantages of traditional subject word extraction methods that need large computation and have data redundancy,and thus has stronger expression ability in user data mining. The hot topics concerned in different regions have certain differences. The public opinion analysis method proposed in this paper can effectively reflect the public opinion characteristics of different regions and provide reference for the public opinion analysis of major public health events.
作者 韩珂珂 邢子瑶 刘哲 刘峻明 张晓东 HAN Keke;XING Ziyao;LIU Zhe;LIU Junming;ZHANG Xiaodong(College of Land Science and Technology,China Agriculture University,Beijing 100083,China;Key Laboratory of Remote Sensing for Agri-Hazards,Ministry of Agriculture and Rural Affairs,Beijing 100083,China)
出处 《地球信息科学学报》 CSCD 北大核心 2021年第2期331-340,共10页 Journal of Geo-information Science
基金 国家重点研发计划项目(2018YFC1508901-3)。
关键词 新冠肺炎 微博 情感分析 空间聚类 舆情 主题词提取 热点挖掘 爬虫 COVID-19 Weibo sentiment analysis spatial clustering public opinion subject word extraction hot mining the crawler
  • 相关文献

参考文献13

二级参考文献161

  • 1李健,曹垚,王宗敏,王广印.融合k-means聚类和Hausdorff距离的散乱点云精简算法[J].武汉大学学报(信息科学版),2020,45(2):250-257. 被引量:16
  • 2朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:325
  • 3刘毅.略论网络舆情的概念、特点、表达与传播[J].理论界,2007(1):11-12. 被引量:311
  • 4刘毅.简析舆情变动规律[J].天津社会科学,2007(3):63-65. 被引量:28
  • 5Juha M.Investigations on Event Evolution in TDT[C].Proceed-ings of the 2003 Conference of the North American Chapter ofthe Association for Computational Linguistics on Human Lan-guage Technology.Edmonton,Canada,2003.
  • 6Blei DM,Ng A Yj Jordan M I.Latent Dirichletal Allocation[J].Journal of Machine Learning Research,2003(3); 993-1022.
  • 7Wang X R,McCallum A.Topic Over Time:a Non-Markov Con-tinuous-time Model of Topical Trends[C].Proceedings of the12th ACM SIGKDD International Conference on Knowledge Dis-covery and Data Mining,Philadelphia,PA,USA,2006:424-433.
  • 8Blei D M,Lafferty J D.Dynamic Topic Model[C].Proceedingsof the 23 rd International Conference on Machine Learning,Pitts-burgh,Pennsylvania,2006:113-120.
  • 9Wei X,Sun J,Wang X.Dynamic Mixture Models for MultipleTime Series[C].Proceedings of the 20th International JointCon-ference on Artifical Intelligence,Hyderabad,India,2007:2909-2914.
  • 10Nallapati R M,Cohen W,Ditmore S,et.al.Multi-scale TopicTomography[C].Proceedings of the 13th ACM SIGKDD Inter-national Conference on Knowledge Discovery and Data Mining,San Jose,California,USA,2007:520-529.

共引文献243

同被引文献266

引证文献17

二级引证文献132

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部