期刊文献+

Twitter数据采集方案研究 被引量:4

Research of Twitter data collection
原文传递
导出
摘要 为了能够实时、高效地获取Twitter数据,在分析了传统采集方法的缺陷后,提出了基于Twitter List API和Lookup API的用户数据采集方案。该方案通过对用户进行分类,进而精确控制API的调用频率。经在超过26万Twitter用户和600万条消息的一系列实验证明,通过两套方案的结合可以实现Twitter用户数据高效实时的获取。 In order to achieve real-time and efficient access to the data of Twitter,two different methods based on Twitter List API and Lookup API were presented after analyzing the shortcomings of traditional collection methods.By classi-fying users,this method can precisely control the frequency of calling API.A series of experiments on over 260,000 users and over 6 million messages were carried out,and the results show that the combination of the two methods can be efficiently used to collect Twitter data in real-time.
出处 《山东大学学报(理学版)》 CAS CSCD 北大核心 2012年第5期73-77,共5页 Journal of Shandong University(Natural Science)
基金 国家信息安全专项项目(2010F032) 国家"八六三"高技术研究发展计划基金项目(2010AA012500) 自然科学基金重点项目(60933005)
关键词 TWITTER LIST API LOOKUP API 数据采集 Twitter List API Lookup API data collection
  • 相关文献

参考文献8

  • 1Stina Westman, Luanne Freund. Information interaction in 140 characters or less: genres on twitter[ C]//Proceedings of the 3rd Symposium on Information Interaction in Context. New York: ACM Press, 2010: 323-328.
  • 2Pieter Noordhuis, Michiel Heijkoop, Alexander Lazovik. Mining Twitter in the cloud: a case study cloud computing (CLOUD) [ C ]//2010 IEEE 3rd International Conference on Cloud Computing. Washington: IEEE Computer Society, 2010:107-114.
  • 3Abraham Ronel, Martinez Teutle. Twitter: network properties analysis [ C ]//Proceedings of the 20th International Conference on Electronics, Communications and Computer. Washington : IEEE Computer Society, 2010:180-184.
  • 4Ahn Yong-Yeol, Han Seungyeop, Kwak Haewoon. Analysis of topological characteristics of huge online social networking services[ C ]//Proceedings of the 16th International Conference on World Wide Web. New York: ACM Press, 2007:835-844.
  • 5Haewoon Kwak, Changhyun Lee, Hosung Park, et al. What is Twitter, a social network or a news media? [ C ]//Proceedings of the 19th International Conference on World Wide Web. New York, NY, USA: ACM Press, 2010:591-600.
  • 6Daniel M Romero, Brendan Meeder, Jon Kleinberg. Differences in the mechanics of information diffusion, across topics : idioms, political hash tags, and complex contagion on Twitter[ C ]//Proceedings of the 20th International Conference on World Wide Web. New York: ACM Press, 2011:695-704.
  • 7WU Shaomei, Jake M Hofman. Who Says What to Whom on Twitter[ C]//Proceedings of the 20th Interna- tional Conference on World Wide Web. New York: ACM Press, 2011 : 705-714.
  • 8Carlos Castillo, Marcelo Mendoza, Barbara Poblete. Information credibility on Twitter [ C ]//Proceedings of the 20th international conference on World Wide Web. New York: ACM Press, 2011:675-684.

同被引文献26

引证文献4

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部