期刊文献+

基于组合语义相似度计算的疾病术语自动编码 被引量:3

Automatic Coding Method for Disease Term Based on Combined Semantic Similarity Calculation
下载PDF
导出
摘要 当前国内医疗机构疾病编码主要采用国际疾病分类ICD-10标准并由人工完成,人工工作量大、时间成本高。提出了一种基于组合语义相似度技术进行疾病术语自动编码的方法,其基于领域知识库结合分词、实体识别和词向量表示技术进行术语相似度计算。通过在妇产科疾病中的应用表明,该方法在术语自动编码精度能达到80%以上,可以有效辅助临床医生书写诊断编码,减少病案科审核工作量,提升总体工作效率。 Currently, the national medical institutions mainly adopt the ICD-10 standard for the disease code,and it iscompleted manually. The large amount of manual work and time cost are the main problems that we are facing. This paper proposes an automatic coding disease terms methodbased on domain knowledge, entity recognition and Word2 vec technology for term similarity calculation. The application in the digestive diseases shows that the method can achieve more than 80% accuracy in the term automatic coding, which can effectively assist the clinician to write the diagnosis code, reduce the workload of the medical record review, and improve the overall work efficiency.
作者 黄嘉俊 HUANG Jiajun(Information Department,Shanghai Changning Maternity&Infant Health Hospital,Shanghai 200050,China)
出处 《微型电脑应用》 2020年第8期157-160,共4页 Microcomputer Applications
关键词 自动编码 语义相似度 实体识别 词向量 automatic encoding semantic similarity entity identification word embedding
  • 相关文献

参考文献4

二级参考文献16

  • 1张锋,许云,侯艳,樊孝忠.基于互信息的中文术语抽取系统[J].计算机应用研究,2005,22(5):72-73. 被引量:36
  • 2Bourigault D. Surface Grammatical Analysis for the Extraction of Terminological Noun Phrases [C]. In: Proceedings of the 14th Conference on Computational Linguistics. Association for Computational Linguistics, 1992: 977-981.
  • 3Justeson J S, Katz S M. Technical Terminology: Some Linguistic Properties and an Algorithm for Identification in Text [J]. Natural Language Engineering, 1995, 1 (1): 9-27.
  • 4Ananiadou S. A Methodology for Automatic Term Recognition [C]. In: Proceedings of the 15th Conference on Computational Linguistics. Association for Computational Linguistics, 1994: 1034-1038.
  • 5Frantzi K, Ananiadou S, Mima H. Automatic Recognition of Multi-word Terms: The C-value/NC-value Method [J]. International Journal on Digital Libraries, 2000, 3(2): 115-130.
  • 6ManningCD,SchutzeH.统计自然语言处理基础[M].范春法译.第4版.北京:电子工业出版社,2005:95-97.
  • 7Takeuchi K, Collier N. Use of Support Vector Machines in Extended Named Entity Recognition [C]. In: Proceedings of the 6th Conference on Natural Language Learning. Stroudsburg, PA, USA: Association for Computational Linguistics, 2002: 1-7.
  • 8Lafferty J D, McCallum A, Pereira F C. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data [C]. In: Proceedings of the 18th International Conference on Machine Learning. San Francisco, CA, USA: Morgan Kaufmann Publishers, 2001:282-289.
  • 9Lee C M, Huang C K, Tang K M, et al. IterativeMachine-Learning Chinese Term Extraction [C]. In: Proceedings of the 14th International Conference on Asia-Pacific Digital Libraries (1CADL 2012), Taipei, China. Berlin: Springer, 2012:309-312.
  • 10Mikolov T. Word2vec Code [CP/OL]. [2015-09-18]. http:// word2vec.googlecode.com/svn/trunk/.

共引文献50

同被引文献23

引证文献3

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部