摘要
目的 应用LDA模型及其变形对PUBMED数据库中埃博拉病毒相关文献摘要进行主题识别并分类,了解埃博拉病毒研究文献的领域分布及发展趋势。方法 以PUBMED数据库中含有摘要信息的文献为数据源,使用Python、Endnote 20.1.0等工具,借助LDA模型及其变形对文献进行主题识别与演化分析。结果 应用LDA模型识别出监测预警、感染机制、影响因素、检测诊断、临床治疗、疫苗及致病机制7类主题;得到检测诊断、感染机制及疫苗研发领域主题词随时间发展频率变化。结论 疫情是埃博拉病毒研究的主要驱动力,疫情发生后本领域发文量迅速上升并在疫情消失后回归,预测未来发文量将随着疫情再暴发继续上升;预计未来研究将主要围绕疫情防控以及临床转化开展,而基础研究仍具有巨大发展空间。
s in PUBMED database were selected as data sources,and the topic identification and evolution analysis were carried out by using Python,Endnote 20.1.0 and other tools with the help of LDA model and its deformation.Results LDA model was used to identify 7 themes including surveillance and early warning,infection mechanism,influencing factors,detection and diagnosis,clinical treatment,vaccine and pathogenic mechanism.The frequency changes of detection and diagnosis,infection mechanism and vaccine research and development subject words over time were obtained.Conclusions The epidemic is the main driving force of Ebola virus research.The number of published papers in this field increased rapidly after the outbreak of the epidemic and returned after the epidemic disappeared.It is predicted that the number of published papers will continue to increase with the recurrence of the epidemic in the future.Future research will focus on epidemic prevention and clinical transformation,while basic research still has huge room for development.
作者
杨启帆
朱志华
王磊
YANG Qifan;ZHU Zhihua;WANG Lei(Institute of Health Service and Transfusion Medicine,Academy of Military Medical Sciences,Academy of Military sciences,Beijing 100850,China.)
出处
《预防医学情报杂志》
CAS
2023年第11期1409-1414,共6页
Journal of Preventive Medicine Information
关键词
埃博拉病毒
LA模型
主题识别
演化分析
Ebolavirus
The LDA model
topics identification
evolutionary analysis.