期刊文献+

面向社会文本流数据探测爆发主题方法浅析

A Survey of Burst Topic Detection Towards Social Text Stream Data
原文传递
导出
摘要 社会文本流数据富含上下文环境信息、语言不规范且参与用户数量庞大。针对这类数据开展爆发主题探测需要寻找新的思路。本文对社会文本流数据的概念、特点以及爆发主题表达形式进行系统性梳理,从文本内容、时间、社会三个维度阐述探测爆发主题的主要研究思路和基本流程,分析利用社会特征(如用户参与、上下文环境、社团结构)进行爆发主题探测的主要技术方法。 Social text streams have rich contextual information and huge participants who communicate with informal steams. It needs to find suitable solutions to detect burst topics from this kind of data. In this paper, the authors comb through the concepts, the characteristics of social text stream data and the presentation forms of burst topics. It also sum- marizes the main research ideas and the basic procedures of burst topic detection towards social text stream data in three dimensions : textual content, social, and temporal. The principal approaches to make use of social features, such as user participation, social context and community structure evolution, for burst topic detection are generally discussed.
作者 乐小虬 洪娜
出处 《现代图书情报技术》 CSSCI 北大核心 2012年第10期21-27,共7页 New Technology of Library and Information Service
基金 国家社会科学基金项目"网络科技信息中爆发主题的监测与分析方法研究"(项目编号:09BTQ035)的研究成果之一
关键词 社会文本流 爆发主题探测 社会网络 Social text stream Burst topic detection Social network
  • 相关文献

参考文献22

  • 1Matsumura N, Goldberg D E, Llora X. Mining Directed Social Network from MessageBoard [ OL ]. [ 2012 - 09 - 20 ]. http ://de- livery, acre. org/10.1145/1070000/1062884/p1092 - matsumura. pdf? ip = 159. 226. 100. 225&acc = ACTIVE%20SERVICE&CFID = l18941016&CFTOKEN = 80199382& acre = 1348453917_ c173 fc996f56a0c611739bal 00e23712.
  • 2Mathioudakis M, Koudas N. TwitterMonitor: Trend Detection over the Twitter Stream [ OL ]. [ 2012 - 03 - 11 ]. http://delivery. acre. org/10. 1145/1810000/1807306/pl 155 - mathioudakis, pdf? ip = 159. 226. 100. 225&CFID = 35057597&CFTOKEN = 95005305& acm =1310711607_fd4454ee954f38cla4c767c8b 5047820.
  • 3Zhao Q, Mitra P, Chen B. Temporal and Information Flow Baseq Event Detection fk'om Social Text Streams [ C ] In : Proceedings the 22nd AAAI Conference on Artificial Intelligence. 2007:1501 - 1506.
  • 4Kleinberg J. Bursty and Hierarchical Structure in Streams[ J]. Da- ta Mining and Knowledge Discovery, 2003,7(4) : 373 -397.
  • 5Fujiki T, Nanno T, Suzuki Y, et al. Identification of Bursts in a Document Stream[ OL]. [ 2012 - 10 - 12 ]. http ://citeseerx. ist. psu. edu/viewdoc/download? doi = 10.1.1. 106. 6773.
  • 6Chen Y, Yang S, Cheng X Q. Bursty Topics Extraction for Web Fo- rums [ C/OL ]. In : Proceedings of the 11 th International Workshop on Web Information and Data Management. 2009 : 55 - 58. [ 2012 -03 -09 ]. http ://portal. acre. org/citation, cfm? id = 1651587. 1651600&coll = DL&dl = ACM&CFID = 19983188&CFTOKEN = 89593705.
  • 7Yao J, Cui B, Huang Y, et al. Temporal and Social Context Based Burst Detection from Folksonomies [ C ] In: Proceedirrgs of the 24th AAAI Conference on Artificial Intelligence. 2010 : 1474 - 1479.
  • 8陈友,程学旗,杨森.面向网络论坛的突发话题发现[J].中文售息学报,2010,24(3):29-36.
  • 9徐戈,王厚峰.自然语言处理中主题模型的发展[J].计算机学报,2011,34(8):1423-1436. 被引量:233
  • 10Diao Q, Jiang J, Zhu F, et al. Finding Bursty Topics from Microb- logs[ OL]. [2012 - 10 - 02 ]. http ://www. mysmu, edu/faculty/ jingjiang/papers/ACL' 12. pdf.

二级参考文献78

  • 1Del Corso G M and Gulli A,et al..Ranking a stream of news[C].Proceedings of the 14th international conference on World Wide Web,Chiba,Japan,May 10-14,2005:97-106.
  • 2He T T,Qu G Z,and Li S W,et al..Semi-automatic hot event detection[C].Proceedings of the 2nd International Conference on Advanced Data Mining and Applications 2006,LNAI 4093:1008-1016.
  • 3Yao J Y,Wang J,and Li Z W,et al..Ranking web news via homepage visual layout and cross-site voting[C].Proceedings of the 28th annual European Conference on Information Retrieval,2006,LNCS 3936:131-142.
  • 4Hu Y,Li M J,and Li Z W,et al..Discovering authoritative news sources and top news stories[C].Proceedings of 3rd Asia Information Retrieval Symposium,2006,LNCS 4182:230-243.
  • 5Chen K Y,Luesukprasert Luesak,and Chou Seng-cho T.Hot topic extraction based on timeline analysis and multidimensional sentence modeling[J].IEEE Transactions on Knowledge and Data Engineering,2007,19(8):1016-1025.
  • 6Wang C H,Zhang M,and Ru L Y,et al..Automatic online news topic ranking using media focus and user attention based on aging theory[C].Proceeding of the 17th ACM conference on Information and knowledge management,Napa Valley,California,USA,October 26–30,2008:1033-1042.
  • 7Liu Y T,Gao B,and Liu T Y,et al..BrowseRank:Letting Web users vote for page importance[C].Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval,Singapore,July 20-24,2008:451-458.
  • 8Ma H,Yang H X,and King Irwin,et al..Learning latent semantic relations from clickthrough data for query suggestion[C].Proceeding of the 17th ACM Conference on Information and Knowledge Management,Napa Valley,California,USA,October 26–30,2008:709-718.
  • 9Yang H X,King Irwin,and Lyu Michael R.DiffusionRank:A possible penicillin for Web spamming[C].Proceedings of the 30th annual international ACM SIGIR conference on Research and Development in Information Retrieval,Amsterdam,The Netherlands,July 23-27,2007:431-438.
  • 10Song X D,Chi Y,and Hino Koji,et al..Information flow modeling based on diffusion rate for prediction and ranking[C].Proceedings of the 16th international conference on World Wide Web,Banff,Alberta,Canada,May 8-12,2007:191-200.

共引文献237

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部