
论坛数据形式化表示技术研究 被引量:2

Formal Representation of BBS Data
摘要 随着互联网的日益发展,网络论坛已成为人们发表自身观点的重要场所。论坛数据形式化表示是论坛内容挖掘的前提。根据论坛数据的特点,利用向量空间模型表示论坛数据,并提出了一种基于多因子加权策略的特征权重计算方法。实验结果表明,该方法可以有效解决论坛数据形式化表示问题。 With the development of Internet,BBS has become an important place for people to air their opinions.BBS data representation is the precondition of BBS content mining.Based on the characteristics of BBS data,Vector Space Model(VSM) is used to represent BBS data and a multi-factor weight strategy is proposed.The experiment results show that this method can solve the problem of BBS data representation effectively.
出处 《信息工程大学学报》 2011年第6期734-737,744,共5页 Journal of Information Engineering University
基金 国家社科基金资助项目(09&ZD014)
关键词 论坛 文本表示 向量空间模型 多因子加权 BBS text representation vector space model multi-factor weight
  • 相关文献


  • 1Banerjee A, Basu S. Topic models over text streams: A study of batch and online unsupervised learning[ C ]//Proe. SIAM Conference on Data Mining. 2007: 437-442.
  • 2Blei D M, Ng A Y, Jordan M I. Latent dirichlet allocation [ J ]. The Journal of Machine Learning Research, 2003, 3 (5) : 993- 1022.
  • 3Qi He, Chang Kui-yu, Lira Ee-peng, et al. Keep It Simple with Time: A Reexamination of Probabilistic Topic Detection Models [ J ]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010,32(10) : 1795-1808.
  • 4Zhang Jian, Zoubin Ghahramani, Yang Yi-ming, A Probabilistic Model for Online Document Clustering with Application to Novelty Detection [ C ]//NIPS. 2005 : 1617-1624.
  • 5A1Sumait L, Barbara D, Domeniconi C. On-line LDA: Adaptive topic models for mining text streams with applications to topic detection and tracking[ C ]//Data Mining, Eight IEEE International Conference on. 2008: 3-12.
  • 6Zhu Ming-liang, Hu Wei-ming, Wu Ou. Topic Detection and Tracking for Threaded Discussion Communities[ C ]//Web Intelligence and Intelligent Agent Technology, 2008 IEEE/WIC/ACM International Conference on. 2008,1:77-83.
  • 7鲁明羽,姚晓娜,魏善岭.基于模糊聚类的网络论坛热点话题挖掘[J].大连海事大学学报,2008,34(4):52-54. 被引量:20
  • 8李昕,朱永盛,武港山.论坛消息的语义漂移分析[J].计算机工程,2006,32(4):88-90. 被引量:1


  • 1YE Hui-min,CHENG Wei,DAI Guan-zhong.Design and Implementation of On-Line Hot Topic Discovery Model[J].Wuhan University Journal of Natural Sciences,2006,11(1):21-26. 被引量:14
  • 2MATSUMURA N, OHSAWA Y, ISHIZUKA M. Influence diffusion model in text-based communication [ J ]. Journal of the Japanese Society for Artificial Intelligence, 2002, 13(3) : 259-267. (in Japanese)
  • 3YOU Lan, HUANG Xuan-jing, WU Li-de, et al. Exploring various features to optimize hot topic retrieval on WEB [CJ// Proceedings of the 1st International Symposium on Neural Networks (ISNN'04). Dalian, China: LNCS 3173, 2004 : 1025-1031.
  • 4YOU Lan, DU Yong-ping, GE Jia-yin, et al. BBS based hot topic retrieval using back-propagation neural network [C]// Proceedings of the 1st International Symposium on Natural Language Processing (IJCNLP' 04). Hainan Island, China : LNAI 3248, 2004 : 139-148.
  • 5YU Jian, YANG Min-shen. Optimality test for generalized FCM and its application to parameter selection [J ]. IEEE Transactions on Fuzzy Systems, 2005, 13(1) :164-176.
  • 6Mizuuchi Y,Tajima K.Finding Context Paths for Web Pages[C].Proc.of ACM Hypertext,1999-02:13-22.
  • 7Salton G,McGill M J.Introduction to Modern Information Retrieval[M].McGraw-Hill,1983.
  • 8Salton G,Wong A,Yang C S.A Vector Space Model for Automatic Indexing[J].Communications of the ACM,1975,18(5):613-620.
  • 9Lee D L,Chuang H,Seamons K.Document Ranking and the Vector-space Model[J].IEEE Software,1997,14(2).
  • 10Han J,Kamber M.Data Mining:Concepts and Techniques[M].Morgan Kaufmann,2001.



  • 1刘胜志,朱钟炎.产品语义学和产品设计[J].包装工程,2006,27(1):182-184. 被引量:38
  • 2夏云庆,黄锦辉,张普.中文网络聊天语言的奇异性与动态性研究[J].中文信息学报,2007,21(3):83-91. 被引量:8
  • 3任丽梅,黄斌.云创新-21世纪的创新模式[M].北京:中共中央党校出版社,2010.
  • 4LEE Y C, ZOMAYA A Y.Energy Efficient Utilization of Re- sources in Cloud Computing Systems[J].The Journal of Super- computing, 2012,60(2) : 268-280.
  • 5XU Xun.From Cloud Computing to Cloud Manufacturing[J]. Robotics and Computer-Integrated Manufacturing, 2012, 28 (1):75-86.
  • 6RHEINGOLD H.The Virtual Community: Homesteading on the Electronic Frontier[M].Addison-Wesley, 1993.
  • 7LEE S M, HWANG T, CHOI D. Open Innovation in the Public Sector of Leading Countries[J].Management Decision, 2012, 50( 1 ) : 147-162.
  • 8CNNIC. Statistical reports on the Internet development inChina[R].北京:中国互联网信息中心,2014.
  • 9Ding Yuxin, Meng Xuejun, Chai Guangren, et al. User Identification for Instant Messages [ C ]//2011 Interna- tional Conference on Neural Information Processing. 2011:11-13.
  • 10David C, Uthus,David W. Aha. Multiparticipant chat a- nalysis: A survey [ J ]. Artificial Intelligence, 2013,2 (4) :106-121.










使用帮助 返回顶部