期刊文献+

基于PLSA的即时通信取证方法

A PLSA Based Forensic Method of Instant Messenger
原文传递
导出
摘要 面对大量繁杂的即时通信数据,司法取证人员很难快速从中找到与案件相关的数据.本文提出一种基于PLSA(probability latent semantic analysis)算法的即时通信取证方法,即利用PLSA算法进行主题挖掘,快速获取与案件相关的可疑数据.通过建立自定义词库和动态调整词库中词项的矢量权重,提高PLSA算法主题挖掘的准确性,对聊天会话中主题的矢量值进行可视化.实验结果表明,该方法的准确率,召回率及F1值比单纯用PLSA算法都有提高. Because of amounts of miscellaneous instant message(IM)data,the data related to the case can't be found quickly by judicial forensic.A method of IM forensics based PLSA(probability latent semantic analysis)algorithm was presented in this paper.Using PLSA algorithm,the topic was mined to get suspicious crime-related data rapidly.By creating custom thesaurus and adjusting the weight vector of term dynamically,we can improve the accuracy of PLSA algorithm in topic mining and visualize the vector value of topic.The experiments showed the method is feasibility and accuracy.
出处 《武汉大学学报(理学版)》 CAS CSCD 北大核心 2016年第2期122-126,共5页 Journal of Wuhan University:Natural Science Edition
基金 国家自然科学基金资助项目(60903220) 郑州市科技攻关项目(10PTGG341-5)
关键词 即时通信 取证 主题挖掘 PLSA算法 矢量权重 instant message forensics topic mining PLSA(probability latent semantic analysis)algorithm vector weight
  • 相关文献

参考文献1

二级参考文献16

  • 1张玉芳,彭时名,吕佳.基于文本分类TFIDF方法的改进与应用[J].计算机工程,2006,32(19):76-78. 被引量:120
  • 2宋惟然.中文文本分类中的特征选择和权重计算方法研究[D].北京:北京工业大学,2013.
  • 3Salton G, McGill M J. Introduction to Modem Information Retrieval[M]. McGraw-Hill, 1983.
  • 4Luhn H P. Auto-encoding of Documents for Information Re- trieval Systems [ M ]// Modem Trends in Documentation. New York: Pergamon Press, 1959:68-95.
  • 5Salton G, Wong A, Yang C S. A vector space model for automate indexing[ J ]. Communications of ACM, 1975,18 ( 11 ) :613-620.
  • 6Lewis D D. Naive Bayes at forty: The independence assump- tion in information retrieval [ C ]// Proceedings of the lOth European Conference on Machine Learning. 1998:4-15.
  • 7Hsu C, Lin C. A comparison on methods for multi-class support vector machines[ J]. IEEE Transactions on Neural Networks, 2002,13 (2) :415-425.
  • 8候敏.计算语言学与汉语自动分析[M].北京:北京广播学院出版社,1999.
  • 9Salton G. On the construction of effective vocabularies for information retrieval[ C ]// Proceedings of the 1973 Meet- ing on Programming Languages and Information Retrieval. 1973 : 48-60.
  • 10Cohen W, Singer Y. Context-sensitive learning methods for text categorization [ J ]. ACM Trans. Information Systems, 1996,17 (2) : 146-173.

共引文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部