摘要
针对互联网数据快速增长和舆情信息飞速传播的问题,提出一种基于大数据的网络舆情分析系统。该系统包括数据采集、预处理、分析和报告汇总四个模块,实现舆情信息的全网自动搜索与采集,大规模舆情数据的格式化存储以及舆情信息的分析、统计汇总等功能。该系统还使用Hadoop平台进行数据处理,并使用HDFS分布式文件系统存储舆情数据,使用MapReduce技术完成舆情分析和报告。仿真结果表明,该系统有助于及时、准确地分析网络舆情,能较好地满足网络舆情分析的需求。
In allusion to the rapid growth of Internet data and the rapid spread of public opinion information, a network public opinion analysis system based on big data is proposed. Four modules of data collection, preprocessing, analysis and re- port aggregation are included in the system to realize the automatic search and collection of the overall network public opinion in- formation, the formatted storage of large-scale public opinion data, and the analysis and statistical summary of public opinion in- formation. In the system, the Hadoop platform is used for data processing, the HDFS distributed file system is used to store pub- lic opinion data, and the MapReduce technology is used to complete public opinion analysis and report. The simulation results show that the system can help analyze network public opinion timely and accurately, and meet the requirement of network pub- lic opinion analysis well.
出处
《现代电子技术》
北大核心
2017年第24期15-17,共3页
Modern Electronics Technique