跨模LDA融合的多模态数据主题分析方法

Multimodal data topic analysis method based on cross-modal LDA fusion

导出

摘要随着互联网的高速发展,社会大众可以通过网络对医疗事件以及医患关系自由地发表个人意见和观点言论,这对于引导公众正确的价值导向有着重大研究意义.然而,仅考虑单模态数据的主题分析算法不能精准地把握整个舆情事件的真相,存在主题提取不准确、个人情感先入为主等问题.提出一种基于LDA的多模态数据主题分析算法MD_LDA(multimodal data topic analysis based on LDA).通过对各模态主题分析结果进行决策级融合来计算多模态的主题分析结果,进而解决传统方法对多模态数据考虑不全面的缺陷.实验结果表明,针对多模态舆情事件,在主题词的提取效果上,所提出的MD_LDA算法优于单一模态数据进行主题分析的算法.而相对于传统的关键词提取算法TF_IDF与TextRank和MD_LDA算法的准确率以及主题词提取效率均有所提高,验证了结合多模态数据进行主题分析的MD_LDA算法的有效性. With the rapid development of the Internet,the public can freely express personal opinions on medical events and doctor-patient relationships through the Internet,which are of the correct value for guiding the public.Orientation has great research significance.However,the topic analysis algorithm that only considers single-modal data cannot accurately grasp the truth of the entire public opinion event,and there are problems such as inaccurate topic extraction and preconceived personal emotions.To solve this problem,this paper proposes a LDA-based multimodal data topic analysis algorithm,named MD_LDA(multimodal data topic analysis Based on LDA).The multimodal topic analysis is calculated by the decision-level fusion of the results of each modal topic analysis.As a result,it further solves the defect that traditional methods do not fully consider multimodal data.The experimental results show that for multimodal public opinion events,the proposed MD_LDA algorithm is better than the algorithm for topic analysis of single-modal data in terms of the extraction effect of topic words.Compared with the traditional keyword extraction algorithms TF_IDF and TextRank,the accuracy of the MD_LDA algorithm and the extraction efficiency of subject words are improved,which proves the effectiveness of the MD_LDA algorithm for subject analysis combined with multimodal data.

作者赵越郝琨时彩云解胜震王之琼信俊昌 ZHAO Yue;HAO Kun;SHI Cai-yun;XIE Sheng-zhen;WANG Zhi-qiong;XIN Jun-chang(College of Medicine and Biological Information Engineering,Northeastern University,Shenyang 110169,China;School of Computer Science and Engineering,Northeastern University,Shenyang 110169,China)

机构地区东北大学医学与生物信息工程学院东北大学计算机科学与工程学院

出处《控制与决策》 EI CSCD 北大核心 2024年第4期1325-1332,共8页 Control and Decision

基金国家自然科学基金项目(62072089) 中央高校基本科研业务费专项资金项目(N2116016,N2104001,N2019007) 东软集团股份有限公司开放课题项目(NCBETOP2102)。

关键词主题分析多模态 LDA主题模型网络舆情 topic analysis multimodality LDA topic model network public opinion

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献6

1王涛,李明.改进的关键词提取算法研究[J].重庆师范大学学报（自然科学版）,2019,36(3):98-104. 被引量：8
2蒋雨肖,丁晟春,吴鹏.基于BiLSTM-VGG16的多模态信息特征分类研究[J].情报理论与实践,2021,44(11):180-186. 被引量：15
3龚志,邵曦.基于多模态的音乐推荐系统[J].南京信息工程大学学报（自然科学版）,2019,11(1):68-76. 被引量：3
4吴悦,雒江涛,刘锐,胡钟尹.基于感知哈希和切块的视频相似度检测方法[J].计算机应用,2021,41(7):2070-2075. 被引量：3
5李荣杰,蒋兴浩,孙锬锋.一种基于音频词袋的暴力视频分类方法[J].上海交通大学学报,2011,45(2):214-218. 被引量：4
6冯霞,胡志毅,刘才华.跨模态检索研究进展综述[J].计算机科学,2021,48(8):13-23. 被引量：9

二级参考文献36

1Koike A, Takagi T. Classifying biomedical figures using combination of bag of keypoints and bag of words[C]//Internationai Conference on Complex, In- telligent and Software Intensive Systems. Kokubunji, Tokyo.. Dept of Comput Biol Ltd, 2009: 848-853.
2Weizman L, Goldberger J. Detection of urban zones in satellite images using visual words[J]. Geoseience and Remote Sensing Symposium, 2008, 5 : 160-163.
3Aucouturier J J. The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music[J]. Journal of Acoustical Society of America, 2007, 122 (2) : 881- 891.
4Zeng Z, Zhang S. A novel approach to musical genre classification using probahilistic latent semantic analy- sis model [C]//Multimedia and Expo. New York: ICME, 2009:486-489.
5Kastner T, Allamanche E. MPEG-7 scalable robust audio fingerprinting[C]//Fraunhofer Institute for Inte- grated Circuits IIS-A. Germany: Erlangen, 2002: 5511-5520.
6Jiang Yu-gang, Ngo C W. Towards optimal bag-of- features for object categorization and semantic video retrieval[C]//Conference on Image and Video Retriev- al. New York: ACM, 2007: 494-501.
7Yang Jun, Jiang Yu-gang. Evaluating bag-of-visual- words representations in scene classification[C]//Mul- timedia Information Retrieval. New York: ACM, 2007:197-206.
8Gong Yu, Wang Wei-qiang. Detecting violent scenes in movies by auditory and visual cues[J]. Pacific Rim Conference on Multimedia, 2008,5353 : 317-326.
9LI Juanzi FAN Qi＇na ZHANG Kuo.Keyword Extraction Based on tf/idf for Chinese News Document[J].Wuhan University Journal of Natural Sciences,2007,12(5):917-921. 被引量：24
10方俊,郭雷,王晓东.基于语义的关键词提取算法[J].计算机科学,2008,35(6):148-151. 被引量：39

共引文献36

1沙尔旦尔·帕尔哈提,阿布都热合曼·卡的尔,阿力木江·亚森.多字体印刷体维-哈-柯文关键词图像识别[J].计算机科学,2022,49(S02):615-620. 被引量：1
2李建平.手法治疗骶髂关节错缝52例[J].按摩与导引,2000,16(3):52-53.
3彭太乐,张文俊,丁友东,郭桂芳.基于时序上下文的视频场景分类[J].计算机工程与应用,2014,50(9):103-106. 被引量：2
4谷学汇.基于信息融合算法的暴力视频内容识别[J].济南大学学报（自然科学版）,2019,33(3):224-228. 被引量：4
5李伟,李硕.理解数字声音——基于一般音频/环境声的计算机听觉综述[J].复旦学报（自然科学版）,2019,58(3):269-313. 被引量：30
6丁祎姗,杜彦辉,朱衍丞,聂世民.基于知识图谱的国内关键词抽取技术研究[J].软件导刊,2020,19(2):273-277. 被引量：6
7洛桑嘎登,仁增多杰,索南尖措,才让叁智,布加.藏文问句分类及关键词提取[J].电子技术与软件工程,2020(6):126-127. 被引量：3
8张亚娜,高子婷,胡溢,杨成.融媒体新闻生产中的中文评论关键词提取[J].人工智能,2020(2):57-66. 被引量：4
9陈志泊,李钰曼,许福,冯国明,师栋瑜,崔晓晖.基于TextRank和簇过滤的林业文本关键信息抽取研究[J].农业机械学报,2020,51(5):207-214. 被引量：15
10熊华煜,余勤,任品,雒瑞森.基于机器学习的音频分类[J].计算机工程与设计,2021,42(1):156-160. 被引量：1

1张玉莹,朱广丽,张友强,孙争艳,张顺香.基于情感信息预处理和Bi-GRU的虚假评论识别模型[J].广西科学,2023,30(1):169-176. 被引量：2
2赵晓翠,康昭,田玲,惠孛,曾曦.基于组合赋权的暴恐转向风险预测研究[J].广西科学,2023,30(1):89-99. 被引量：2
3冯余.单元整理课的设计问题及对策——以七年级上册第二单元整理课为例[J].中学语文,2024(10):103-106.
4罗骏,庞建华.“互联网+”双创大赛信息推荐集成模型研究[J].科技创业月刊,2024,37(2):69-73.
5宋阳.基于LoRa通信技术的智能校园信息控制平台建设[J].长江信息通信,2023,36(9):115-117.
6王加桂.大单元视域下文学阅读与创意表达学习任务群的情境创设[J].小学教学参考,2024(10):39-41.
7Jielin Feng,Kehao Wu,Siming Chen.TopicBubbler:An interactive visual analytics system for cross-level fine-grained exploration of social media data[J].Visual Informatics,2023,7(4):41-56.
8高浩容,图虫创意(题图).后疫情时代:修复裂痕回归信任[J].现代家庭（下半月）,2023(3):50-51.
9类成阳.关于中国学术语境下“教育史学”概念使用的考察[J].教育学文摘,2023(2):99-100.
10王梓衡,沈继锋,左欣,武小红,孙俊.基于特征级与决策级融合的农作物叶片病害识别[J].江苏大学学报（自然科学版）,2024,45(3):286-294. 被引量：1

控制与决策

2024年第4期

浏览历史

内容加载中请稍等...

跨模LDA融合的多模态数据主题分析方法

参考文献6

二级参考文献36

共引文献36

相关作者

相关机构

相关主题

浏览历史