期刊文献+

面向问题意图识别的深度主动学习方法

Deep Active Learning Method for Question Intention Recognition
下载PDF
导出
摘要 深度学习已经在许多自然语言处理任务中达到了最高水平的性能,但是这种方法通常需要大量的标注数据。该文围绕问题意图识别语料标注问题,结合深度学习和主动学习技术,实现了语料标注成本的降低。主动学习需要不断迭代地再训练,计算成本非常高,为加速这个进程,该文提出了一种适合问题意图识别任务的轻量级架构,使用双层CNN结构组成的深度学习模型。同时为更好地评估样本的价值,设计了一种结合样本的信息性、代表性与多样性的多准则主动学习方法。最终在民航客服语料下进行实验,实验结果表明该方法可减少约50%的标注工作量,同时在公开数据集TREC问题分类语料上验证了该方法的通用性。 Deep learning has achieved best performance in many natural language processing tasks on the basis of large amount of annotation data.To reduce the cost of corpus annotation,this paper combines the active learning and deep learning to identify the corpus of question intent.To minimize the iteration of retraining in active learning,a lightweight architecture suitable for question intent recognition task is proposed by using a deep learning model consisting of a two-layer CNN structure.At the same time,in order to better evaluate the value of the sample,a multi-criteria active learning method is designed by considering the information,representativeness and diversity of samples.Finally,experiments on the civil aviation customer service corpus show that the method can reduce the annotation workload by about 50%,which is also validated by the public dataset TREC question classification corpus.
作者 付煜文 马志柔 刘杰 白琳 薄满辉 叶丹 FU Yuwen;MA Zhirou;LIU jie;BAI Lin;BO Manhui;YE Dan(Institute of Software,Chinese Academy of Sciences,Beijing 100190,China;TravelSky Mobile Technology Limited,Beijing 100029,China)
出处 《中文信息学报》 CSCD 北大核心 2021年第4期92-99,109,共9页 Journal of Chinese Information Processing
基金 国家重点研发计划(2017YFB1002303) 国家自然科学基金(61802381,61972386) 民航科技重大专项(MHRD20160109)。
关键词 主动学习 文本标注 意图识别 active learning text annotation intention recognition
  • 相关文献

参考文献1

二级参考文献22

  • 1Hastie T,Tibshirani R, Friedman J.The elements of sta- tistical learning: data mining, inference, and prediction, ser.statistics[M].2nd ed.New York:Springer,2009.
  • 2Boser B E, Guyon I M, Vapnik V N.A training algo- rithm for optimal margin classifiers[C]//Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, 1992 : 144-152.
  • 3Haykin S.Neural networks and learning machines[M]. 3rd ed.Cambridge, MA: Prentice-Hall, 2008.
  • 4Settles B.Active learning literature survey[R].Univ of Wisconsin-Madison, 2011.
  • 5Tuia D, Ratle F, Pacifici F, et al.Active learning meth- ods for remote sensing image classification[J].IEEE Trans on Geosci Remote Sens,2009,47(7):2218-2232.
  • 6Copa L,'Tuia D, Volpi M, et al.Unbiased query-by-bagging active learning for VHR image classification[C]//Proc SPIE Remote Sens Conf,2010.
  • 7Di W, Crawford M.Multi-view adaptive disagreement based active learning for hyperspectral image classification[C]// IEEE International Geoscience and Remote Sensing Symposium, 2010 : 1374-1377.
  • 8Muslea I.Active learning with multiple views[J].Journal of Artificial Intelligence Research,2006,27:203-233.
  • 9Campbell C, Cristianini N, Smola A J.Query leaming with large margin classifiers[C]//Proc Int Conf Mach Leam(ICML), 2000: 111-118.
  • 10Schohn G, Cohn D.Less is more: Active learning with support vector machines[C]//Proc 17th ICML, 2000: 839-846.

共引文献23

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部