摘要
目前基于病历的疾病特征提取方法有很多,但是这些方法存在病历数据离散化程度高、描述语言标准不统一、疾病特征的提取困难等问题。鉴于此,提出一种基于LDA模型的疾病特征识别方法。该方法能同时建模病历、疾病、特征三者之间的关系,得出病历-疾病和疾病-特征两个分布矩阵,从而达到疾病特征识别的目的。实验表明该方法的疾病特征识别准确率高于ID3算法和C4.5算法,达到了良好的疾病特征识别效果。
There are many ways of extracting disease characteristics based on disease records, but these ways have many problems such as high discrete of disease record data, non - unified description language and difficulty in extracting disease features, etc.. How to i- dentify disease features from electronic medical records to assist doctors in diagnosing diseases is an important research in the field of computer - aided diagnosis. This paper proposes a LDA model - based disease feature recognition method, which can model medical re- cords, diseases and characteristics at the same time, and obtain the two distributions of disease record - disease and disease - feature so as to achieve the goal of disease feature recognition. Experiments show that the method has higher recognition accuracy than ID3 algo- rithm and C4.5 algorithm, and achieves good feature recognition results.
出处
《新余学院学报》
2017年第3期23-26,共4页
Journal of Xinyu University
基金
安徽省教育厅自然科学一般项目"安全关键嵌入式系统安全性评价关键技术研究"(KJ2015B061by)
安徽省高校人文社科重点项目"基于对比挖掘的医疗卫生网络舆情的发现
跟踪及倾向性分析"(sk2015A405)
安徽省高校人文社科重点项目"皖北地区独居老人身心健康状况调查与多维度关联分析"(sk2016A0607)
关键词
LDA模型
电子病历
辅助诊断
疾病特征
LDA Model
electronic medical records
auxiliary diagnosis
disease feature