期刊文献+

面向计算机科学领域的专业实体识别 被引量:1

Professional entity recognition for computer science
下载PDF
导出
摘要 为获取科研学术论文中涉及的专家研究领域等专业实体信息,给学术论文或科技项目评审专家的推荐提供理论参考,面向计算机科学领域,提出了一种基于RoBERTa-wwm的实体识别模型对专家学术论文中包含的专业实体进行识别。首先,以已有的专家基本信息数据表为参照,利用中国知网高级检索功能和爬虫技术获取表中列举专家的学术论文摘要数据;接着,将摘要数据经人工标注后,通过RoBERTa-wwm预训练模型获取具有语义特征的字符向量作为下游模型的输入;最后,将上游的语义字符向量输入BiLSTM-CRF模型中实现对文本中的专业实体识别。通过实验验证,提出的模型在自主标注的数据集中取得了更好的效果。其中,模型F1值达到了89.94%,高于实验中的对比模型,具有良好的识别专业实体的能力。 To obtain professional entity information including expert research fields in academic papers and provide theoretical references for academic paper or technology project review experts,an entity recognition model based on RoBERTa-wwm is proposed to identify professional entities in academic papers in the field of computer science.First,with the reference of the available experts’ basic information table,the abstract data of these experts’ academic papers are obtained through the advanced search of the China National Knowledge Infrastructure(CNKI)and crawler technology.Next,the abstrac data are manually annotated and the RoBERTa-wwm pre-training model is employed to obtain character vectors with semantic features as inputs for downstream models.Finally,the semantic character vectors are put into the BiLSTM-CRF model to identify professional entity recognition in the text.The experiments show the proposed model achieves better results in the self-labeled dataset.The F1 score of model reaches 89.94%,higher than all other comparison models in the experiment,demonstrating its excellent ability to identify professional entities.
作者 陈祥 张仰森 李尚美 胡昌秀 成琪昊 CHEN Xiang;ZHANG Yangsen;LI Shangmei;HU Changxiu;CHENG Qihao(Institute of Intelligent Information,Beijing Information Science&Technology University,Beijing 100101,China;National Economic Security Early Warning Engineering Beijing Laboratory,Beijing 100044,China)
出处 《重庆理工大学学报(自然科学)》 北大核心 2023年第11期205-212,共8页 Journal of Chongqing University of Technology:Natural Science
基金 国家自然科学基金项目(62176023)。
关键词 专业实体识别 RoBERTa-wwm 专家研究领域 计算机科学 professional entity identification RoBERTa-wwm expert research field computer science
  • 相关文献

参考文献15

二级参考文献60

共引文献360

同被引文献13

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部