期刊文献+

基于语义相似度计算的临床诊断自动编码算法研究 被引量:11

Algorithmic Research on Automatic Coding of Clinical Diagnoses Based on Semantic Similarity Calculation
下载PDF
导出
摘要 提出一种为中文临床诊断自动进行ICD-10编码的算法,利用分布式语义相似度计算方法计算文本语义相似度,考虑到中文的语言特点,不仅基于词语构建词向量,还基于汉字构建词向量,测试二者对查准率和查全率的影响。结果显示该算法在测试集上获得较高的准确率。 The paper proposes an algorithm which can implement ICD-10 coding automatically for clinical diagnoses in Chinese and calculate the semantic similarity of texts by the calculation method of distributed semantic similarity.In consideration to the linguistic features of Chinese,it constructs term vectors based on both words and Chinese characters and tests their influences on the precision ratio and recall ration.The results indicate that this algorithm has a higher precision ration in the test set.
作者 宁温馨 于明
出处 《医学信息学杂志》 CAS 2016年第2期52-56,共5页 Journal of Medical Informatics
关键词 自动编码 语义相似度 分布式语义 ICD-10 Automated code assignment Semantic similarity Distributional semantics ICD-10
  • 相关文献

参考文献17

  • 1Homberger J. Electronic Health Records : a guide for clini-cians and administrators [J]. JAMA, 2009,301 ( 1 ):110-110.
  • 2Meystre S M,Savova G K, Kipper - Schuler K C,et al. Ex-tracting Information from Textual Documents in the ElectronicHealth Record : a review of recent research [J]. Yearbook ofMedical Informatics, 2008, (35) : 128 -144.
  • 3OMalley K J, Cook K F, Price M D,et al. Measuring Diag-noses :ICD code accuracy [ J ]. Health Services Research,2005,40: 1620-1639.
  • 4Pereira S, N6v6ol A, Massari P, et al. Construction of aSemi - automated ICD ~ 10 Coding Help System to OptimizeMedical and Economic Coding [ C]. MIE. 2006 : 845 -850.
  • 5凌红,陈龙.医院信息系统发展案例分析[J].医学信息学杂志,2013,34(12):16-20. 被引量:3
  • 6贾末,王永刚,沈韬,张颖琦.医院信息系统性能优化策略探讨[J].医学信息学杂志,2014,35(9):28-31. 被引量:7
  • 7苏韶生,杨勇,何远源,程敏婷,张淑娟.电子病历文档管理系统设计与关键问题实现[J].医学信息学杂志,2015,36(1):23-27. 被引量:4
  • 8刘群,张华平,俞鸿魁,程学旗.基于层叠隐马模型的汉语词法分析[J].计算机研究与发展,2004,41(8):1421-1429. 被引量:197
  • 9Pakhomov S V S, Buntrock J D,Chute C G. Automating theAssignment of Diagnosis Codes to Patient Encounters UsingExample - based and machine learning techniques [ J ].Journal of the American Medical Informatics Association,2006,13 (5) : 516 -525.
  • 10Mihalcea R, Corley C, Strapparava C. Corpus - based andKnowledge - based measures of text semantic similarity[C]. In: Proceedings of the 21st National Conference onArtificial Intelligence. 2006, 6: 775 -780.

二级参考文献55

  • 1李全凯.HIS数据库性能优化分析[J].医学信息(西安上半月),2005,18(10):1241-1243. 被引量:4
  • 2周鸾杰,宋传军,周宝林.从SQL优化角度对医院信息系统进行优化[J].医疗设备信息,2007,22(5):23-25. 被引量:6
  • 3H Y Tan. Chinese place automatic recognition research. In: C N Huang, Z D Dong, eds. Proc of Computational Language.Beijing: Tsinghua University Press, 1999
  • 4Zhang Huaping, Liu Qun, Zhang Hao, et al. Automatic recognition of Chinese unknown words recognition. First SIGHAN Workshop Attached with the 19th COLING, Taipei, 2002
  • 5S R Ye, T S Chua, J M Liu. An agent-based approach to Chinese named entity recognition. The 19th Int'l Conf on Computational Linguistics, Taipei, 2002
  • 6J Sun, J F Gao, L Zhang, et al. Chinese named entity identification using class-based language model. The 19th Int'l Conf on Computational Linguistics, Taipei, 2002
  • 7Lawrence R Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. Proc of IEEE, 1989,77(2): 257~286
  • 8Shai Fine, Yoram Singer, Naftali Tishby. The hierarchical hidden Markov model: Analysis and applications. Machine Learning,1998, 32(1): 41~62
  • 9Richard Sproat, Thomas Emerson. The first international Chinese word segmentation bakeoff. The First SIGHAN Workshop Attached with the ACL2003, Sapporo, Japan, 2003. 133~143
  • 10J Hockenmaier, C Brew. Error-driven learning of Chinese word segmentation. In: J Guo, K T Lua, J Xu, eds. The 12th Pacific Conf on Language and Information, Singapore, 1998

共引文献207

同被引文献85

引证文献11

二级引证文献50

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部