期刊文献+

基于中心化的微博热点发现方法 被引量:17

Micro-blogging Hotspot Discovery Method Based on Centralization
下载PDF
导出
摘要 以解决微博平台海量信息碎片为切入点,结合微博信息文本短小、来源广泛、传播方式多样等特点,设计基于中心化的微博热点的发现机制。通过微博平台开放API记录的结构化元数据信息,设计微博的元数据模型,将微博热点发现看作是原始语料到热点语料簇的生产加工增值过程,设计以数据预处理技术为核心的语料初加工方法,以及基于短文本聚类、基于传播路径与用户行为的中心化深加工方法,构建完整的生产加工过程模型,并通过实例验证理论研究成果。 In order to solve the problem of massive pieces of information on micro-blogs, this paper studies the centralization theory-based hotspot discovery methods for micro-blogs, in consideration of the features of micro-blogging content such as short text, variety of sources and diverse means of dissemination. Through the structured metadata acquired from open APIs, some metadata models for micro-blogging content are analyzed, and the hotspot discovery process is regarded as a value-added process of the original materials to clusters of hot products. For initial and deep processing methods during the production process, some data pre-processing techniques as well as short text clusteringbased and disseminating path and users behavior-based centralizing techniques are proposed. And a complete production and processing model is established. Finally, a series of experiments have verified the theoretical achievement.
出处 《管理学报》 CSSCI 北大核心 2012年第6期874-879,共6页 Chinese Journal of Management
基金 国家自然科学基金资助项目(71071066) 教育部人文社会科学研究资助项目(11YJA630098)
关键词 热点发现 微博 中心化 元数据模型 hotspot discovery micro-blogging centralization~ metadata model
  • 相关文献

参考文献20

  • 1GOETHALS R G, SNOECK M, LEMAHIEU W, et al. Considering (de) Centralization in a Web Services World[C]//Proceedings of Second International Con- ference on Internet and Web Applications and Serv- ices, Mauritius, 2007:22.
  • 2宋丹,林鸿飞,杨志豪.基于内容计算和链接分析的Web话题跟踪方法[J].情报学报,2007,26(4):555-560. 被引量:3
  • 3李保利,俞士汶.话题识别与跟踪研究[J].计算机工程与应用,2003,39(17):7-10. 被引量:61
  • 4JORDAN C. Sur les Assemblages de Lignes [J]. Journal fur Die Reine und Angewandte Mathematik, 1869(70):185-190.
  • 5WASSERMAN S, FAUST K. Social Network Anal- ysis [M] Cambridge: Cambridge University Press, 1994.
  • 6TUTZAUER F, ELBIRT B. Entropy-based Centrali zation and Its Sampling Distribution in Directed Corn munication Networks [J].Communication Mono graphs, 2009, 76(3): 351-375.
  • 7CRUCITTI P, LATORA V, PORTA S. Centrality Measures in Spatial Networks of Urban Streets [J]. Physical Review E, 2006, 73(3): 1-5.
  • 8YANG C C, SAGEMAN M. Analysis of Terrorist Social Networks with Fractal Views [J}. Journal of Information Science, 2009, 35(3): 299-320.
  • 9GOLDSZMIDT G, YEMINI Y. Distributed Man- agement by Delegation [C]//Proceedings of 15th IEEE International Conference on Distributed Corn-puting Systems, Vancouver, 1995:333-340.
  • 10CRASWELL N, HAWKING D, THISTLEWAITE P. Merging Results from Isolated Search Engines [C]//Proceedings of the Tenth Australasian Data- base Conference, Auckland, 1999.. 189-200.

二级参考文献16

  • 1金珠,林鸿飞,赵晶.基于HowNet的话题跟踪及倾向性分类研究[J].情报学报,2005,24(5):555-561. 被引量:21
  • 2James Allan,Jaime Carbonell,George Doddington et al.Topic Detection and Tracking Pilot Study:Final Report[C].In:Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop,San Francisco ,CA,Morgan Kaufmann Publishers ,Inc, 1998:194-218.
  • 3Yiming Yang,Jaime Carbonell,Ralf Brown et al.Learning Approaches for Detecting and Tracking News Events[J].IEEE Intelligent Systems:.Special Issue on Applications of Intelligent Information Retrieval,1999;14(4) :32-43.
  • 4Wayne C.Multilingual Topic Detection and Tracking:Successful Research Enabled by Corpora and Evaluation[C].In:Language Resources and Evaluation Conference (LREC),2000 : 1487-1494.
  • 5James Allan (ed.).Topic Detection and Tracking : Event-based Information Organization[M].Kluwer Academic Publishers,2002.
  • 6James Allan,Victor Lavrenko,Hubert Jin.First Story Detection in TDT is Hard[C].In:Proceedings of 9th Conference on Information Knowledge Management CIKM ,2000: 374---381.
  • 7Yiming Yang,Tom Ault,Thomas Pierce et al.Improving Text Categorization Methods for Event Tracking[C].In:Proeeedings of the 23rd International Conference on Research and Development in Information Retrieval ( SIGIR-2000),2000: 65-72.
  • 8Alvin Martin,George Doddington,Terri Kamm et al.The DET Curve in Assessment of Detection Task Performance[C].In:Proceedings of Eurospeech 1997,1997:1895-1898.
  • 9Ying-Ju Chen,Hsin-His Chen.NLP and IR Approaches to Monolingual and Multilingual Link Detection[C].In:Proceedings of the 19^th International Conference on Computational Linguistics(COLING 2002).
  • 10James Allan.Introduction to topic detection and tracking[M]∥James Allan,ed.Topic Detection and Tracking:Event-based Information Organization.USA:Kluwer Academic Publishers,2002:1-16.

共引文献62

同被引文献197

引证文献17

二级引证文献154

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部