期刊文献+

面向机加工艺规程文本的实体识别模型

Named Entity Recognition Method for Process Planning Text
下载PDF
导出
摘要 为实现非结构化工艺规程文本中关键信息的高效识别,建立一种基于机加工领域词典和神经网络的命名实体识别模型.首先,结合机加工领域词典与jieba分词技术进行数据集的自动标注,并在对工艺参数信息进行标注的过程中将数字和标志字母划分为一个分词单位以增强后续特征提取效果;其次,在word2vec词嵌入的基础上,采用双向长短时记忆网络对文本进行特征提取;最后,采用条件随机场综合上下文逻辑以提高关键工艺信息的识别准确率.在包含431条工步内容的数据集上,对所提模型的识别效果进行实验,结果表明,所提模型的准确率、召回率和F1值分别为90.20%,93.88%和92.00%,在与领域内传统模型的对比上具有一定优势,并使用3个不同工艺规程数据集验证了该模型的鲁棒性. To realize the efficient recognition of critical information in unstructured process planning text,a named entity recognition model based on technology dictionary and neural network is established.Firstly,the technology dictionary and jieba word segmentation technology are comprehensively combined to realize automatic annotation of datasets,especially,the number and its identification letters are recognized as one unit in the automatic annotation of process parameter data,which enhances the effect of subsequent feature extraction.Secondly,the bidirectional long short term memory network is used to extract the feature of text information based on word2vec.Finally,conditional random field model is used to synthesize contextual logic to improve the recognition accuracy of critical process information.To verify the effectiveness of the proposed model,431 work steps are utilized as training sample.Experimental results show that the values of accuracy rate,recall and F1 are 90.20%,93.88%and 92.00%respectively,which has certain advantages compared with traditional models in the field.In addition,three experimental datasets from different tech-nology books are tested,the results also show high robustness of the proposed model.
作者 董含笑 李豫虎 乔立红 黄志成 Dong Hanxiao;Li Yuhu;Qiao Lihong;Huang Zhicheng(School of Mechanical Engineering&Automation,Beihang University,Beijing 100191)
出处 《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2024年第2期313-320,共8页 Journal of Computer-Aided Design & Computer Graphics
基金 国家重点研发计划.
关键词 双向长短时记忆网络 条件随机场 命名实体识别 知识抽取 bidirectional long short term memory network conditional random field named entity recognition knowledge extraction
  • 引文网络
  • 相关文献

参考文献4

二级参考文献105

  • 1杨涛,肖田元,张林鍹.以上下文为中心的设计知识管理方法[J].计算机集成制造系统,2004,10(12):1541-1545. 被引量:12
  • 2周明建,陶俊才.知识管理系统中的知识推送[J].计算机辅助设计与图形学学报,2006,18(8):1218-1223. 被引量:24
  • 3祝锡永,潘旭伟,王正成.基于情境的知识共享与重用方法研究[J].情报学报,2007,26(2):179-184. 被引量:22
  • 4Salton G, Wong A, Yang C S. A vector space model for automatic indexing [J]. Communications of the ACM, 1975, 18(11) : 613-620.
  • 5Haroz S, Whitney D. How capacity limits of attention influence information visualization effectiveness [J]. IEEE Transactions on Visualization and Computer Graphics, 2012, 18(12) : 2402-2410.
  • 6Heer J, Robertson G G. Animated transitions in statistical data graphics [J]. IEEE Transactions on Visualization and Computer Graphics, 2007, 13(6): 1240-1247.
  • 7Lamping J, Rao R, Pirolli P. A focus+ context technique based on hyperbolic geometry for visualizing large hierarchies [C]//Proceedings of ACM SIGCHI Conference on Human Factors in Computing Systems. New York: ACM Press, 1995:401-408.
  • 8Sebastiani F. Machine learning in automated text categorization [J]. ACM Computing Surveys, 2002, 34, (1) : 1-47.
  • 9Viegas F B, Wattenberg M. TIMELINES: tag clouds and the case for vernacular visualization [J]. Interactions, 2008, 15 (4) : 49-52.
  • 10Viegas F B, Wattenberg visualization with wordle Visualization and Computer M, Feinberg J. Participatory [J]. IEEE Transactions on Graphics, 2009, 15(6): 1137-1144.

共引文献93

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部