期刊文献+

基于深度学习的中文网络招聘文本中的技能词抽取方法 被引量:4

Skill word extraction from Chinese online recruitment texts based on deep learning
下载PDF
导出
摘要 为了能够充分利用领域知识来提升技能词的抽取性能,提出了一种基于深度学习与语料特征相结合的技能词抽取方法。将技能词抽取转化为序列标注问题,以序列标注的基本模型Bi-LSTM-CRF为基础,在输入层中加入语料特征,并将输入层的输出与Bi-LSTM输出连接在一起作为CRF层的输入。实验结果表明,提出的技能词抽取方法效果提升明显,加入的语料特征有利于提升技能词抽取的准确率,并能够缓解标注数据的稀缺。 In order to make full use of domain knowledge to improve the performance of skill word extraction,a method of skill word extraction based on the combination of deep learning and corpus features is proposed.Skill word extraction is transformed into a sequence tagging problem.Based on the basic model of sequence tagging,Bi-LSTM-CRF,which adds corpus features to the input layer,and connects the output of the input layer with the output of the Bi-LSTM layer as the input of the CRF layer.The results of a large number of experiments show that the effect of the proposed skill word extraction method has improved significantly.The added corpus features can help improve the accuracy of skill word extraction and reduce the effort of data annotations.
作者 文益民 杨鹏 文博奚 蔡翔 WEN Yimin;YANG Peng;WEN Boxi;CAI Xiang(Guangxi Key Laboratory of Trusted Software,Guilin University of Electronic Technology,Guilin 541004,China;Guangxi Key Laboratory of Image and Graphic Intelligent Processing,Guilin University of Electronic Technology,Guilin 541004,China;School of Business,Guilin University of Electronic Technology,Guilin 541004,China)
出处 《桂林电子科技大学学报》 2020年第4期338-348,共11页 Journal of Guilin University of Electronic Technology
基金 国家自然科学基金(61866007,71463010) 广西自然科学基金(2018GXNSFDA138006) 广西学位与研究生教育改革课题(JGY2017055) 教育部人文社会科学研究项目(17JDGC022)。
关键词 网络招聘 技能词 序列标注 深度学习 online recruitment text skill words sequence labeling deep learning
  • 相关文献

参考文献11

二级参考文献159

共引文献213

同被引文献56

引证文献4

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部