期刊文献+

跨模LDA融合的多模态数据主题分析方法

Multimodal data topic analysis method based on cross-modal LDA fusion
原文传递
导出
摘要 随着互联网的高速发展,社会大众可以通过网络对医疗事件以及医患关系自由地发表个人意见和观点言论,这对于引导公众正确的价值导向有着重大研究意义.然而,仅考虑单模态数据的主题分析算法不能精准地把握整个舆情事件的真相,存在主题提取不准确、个人情感先入为主等问题.提出一种基于LDA的多模态数据主题分析算法MD_LDA(multimodal data topic analysis based on LDA).通过对各模态主题分析结果进行决策级融合来计算多模态的主题分析结果,进而解决传统方法对多模态数据考虑不全面的缺陷.实验结果表明,针对多模态舆情事件,在主题词的提取效果上,所提出的MD_LDA算法优于单一模态数据进行主题分析的算法.而相对于传统的关键词提取算法TF_IDF与TextRank和MD_LDA算法的准确率以及主题词提取效率均有所提高,验证了结合多模态数据进行主题分析的MD_LDA算法的有效性. With the rapid development of the Internet,the public can freely express personal opinions on medical events and doctor-patient relationships through the Internet,which are of the correct value for guiding the public.Orientation has great research significance.However,the topic analysis algorithm that only considers single-modal data cannot accurately grasp the truth of the entire public opinion event,and there are problems such as inaccurate topic extraction and preconceived personal emotions.To solve this problem,this paper proposes a LDA-based multimodal data topic analysis algorithm,named MD_LDA(multimodal data topic analysis Based on LDA).The multimodal topic analysis is calculated by the decision-level fusion of the results of each modal topic analysis.As a result,it further solves the defect that traditional methods do not fully consider multimodal data.The experimental results show that for multimodal public opinion events,the proposed MD_LDA algorithm is better than the algorithm for topic analysis of single-modal data in terms of the extraction effect of topic words.Compared with the traditional keyword extraction algorithms TF_IDF and TextRank,the accuracy of the MD_LDA algorithm and the extraction efficiency of subject words are improved,which proves the effectiveness of the MD_LDA algorithm for subject analysis combined with multimodal data.
作者 赵越 郝琨 时彩云 解胜震 王之琼 信俊昌 ZHAO Yue;HAO Kun;SHI Cai-yun;XIE Sheng-zhen;WANG Zhi-qiong;XIN Jun-chang(College of Medicine and Biological Information Engineering,Northeastern University,Shenyang 110169,China;School of Computer Science and Engineering,Northeastern University,Shenyang 110169,China)
出处 《控制与决策》 EI CSCD 北大核心 2024年第4期1325-1332,共8页 Control and Decision
基金 国家自然科学基金项目(62072089) 中央高校基本科研业务费专项资金项目(N2116016,N2104001,N2019007) 东软集团股份有限公司开放课题项目(NCBETOP2102)。
关键词 主题分析 多模态 LDA主题模型 网络舆情 topic analysis multimodality LDA topic model network public opinion
  • 相关文献

参考文献6

二级参考文献36

  • 1Koike A, Takagi T. Classifying biomedical figures using combination of bag of keypoints and bag of words[C]//Internationai Conference on Complex, In- telligent and Software Intensive Systems. Kokubunji, Tokyo.. Dept of Comput Biol Ltd, 2009: 848-853.
  • 2Weizman L, Goldberger J. Detection of urban zones in satellite images using visual words[J]. Geoseience and Remote Sensing Symposium, 2008, 5 : 160-163.
  • 3Aucouturier J J. The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music[J]. Journal of Acoustical Society of America, 2007, 122 (2) : 881- 891.
  • 4Zeng Z, Zhang S. A novel approach to musical genre classification using probahilistic latent semantic analy- sis model [C]//Multimedia and Expo. New York: ICME, 2009:486-489.
  • 5Kastner T, Allamanche E. MPEG-7 scalable robust audio fingerprinting[C]//Fraunhofer Institute for Inte- grated Circuits IIS-A. Germany: Erlangen, 2002: 5511-5520.
  • 6Jiang Yu-gang, Ngo C W. Towards optimal bag-of- features for object categorization and semantic video retrieval[C]//Conference on Image and Video Retriev- al. New York: ACM, 2007: 494-501.
  • 7Yang Jun, Jiang Yu-gang. Evaluating bag-of-visual- words representations in scene classification[C]//Mul- timedia Information Retrieval. New York: ACM, 2007:197-206.
  • 8Gong Yu, Wang Wei-qiang. Detecting violent scenes in movies by auditory and visual cues[J]. Pacific Rim Conference on Multimedia, 2008,5353 : 317-326.
  • 9LI Juanzi FAN Qi'na ZHANG Kuo.Keyword Extraction Based on tf/idf for Chinese News Document[J].Wuhan University Journal of Natural Sciences,2007,12(5):917-921. 被引量:24
  • 10方俊,郭雷,王晓东.基于语义的关键词提取算法[J].计算机科学,2008,35(6):148-151. 被引量:39

共引文献36

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部