期刊文献+

基于查询表达式特征的时态意图识别研究 被引量:10

Temporal Intent Classification with Query Expression Feature
原文传递
导出
摘要 【目的】针对时态意图识别问题,探讨可抽取查询表达式特征的有效性及采用不同类别分类算法的识别准确度,为后续相关研究提供一定的借鉴。【方法】按查询表达式特征与时间的关联性,将其归类为时间无关特征、潜在时间特征、显式时间特征。在此基础上,分别采用有监督分类算法及半监督分类算法,探讨采用不同特征组合的有效性及不同分类算法的识别准确度。【结果】在抽取的三类查询表达式特征中,仅使用显式时间特征的平均分类准确率最高,且"查询是否包含年份"这一特征为强特征;使用不同分类算法的识别准确度相差不大;时态意图识别结果优于已有参与时态意图分类子任务(TQIC)测评的成果,平均分类准确率为81.14%。【局限】限于数据集的获取途径,仅对300条查询的时态意图识别效果进行验证;仅考虑已有的查询表达式特征,未提出用于时态意图识别的新特征。【结论】查询表达式特征中与时间关联性高的特征能提高时态意图识别准确度,而基于统计的特征(如查询词长度)对时态意图识别分类准确度的提升效果不明显。 [Objective]This paper investigates the effectiveness of query-based features and compares the performance of two types of classifiers in a query temporal intent classification task.[Methods]This paper first reviews all query-based features and then classifies those features into three types,according to their temporal relevance,namely,atemporal,implicit temporal and explicit temporal.Then,it tests accuracy of a temporal query intent classification task,using a supervised classifier and a semi-supervised classifier individually,with various combinations of query-based features of different types.[Results]Among all tested query-based features,using explicit temporal features achieves best accuracy,especially for the feature on whether a query contains a year;The performance hardly varies across classifiers;Our best macro average accuracy of 81.14%is higher than that in previous studies with the same experimental setups.[Limitations]Due to accessibility of dataset,our experiments are done on a limited size dataset.Only existing query-based features are studied and no new feature is proposed or tested.[Conclusions]Using highly temporal relevant features can improve accuracy in temporal query intent classification task,whereas using slightly temporal relevant features could hardly improve accuracy.
作者 桂思思 陆伟 张晓娟 Gui Sisi;Lu Wei;Zhang Xiaojuan(School of Information Management,Wuhan University,Wuhan 430072,China;Institute for Information Retrieval and Knowledge Mining,Wuhan University,Wuhan 430072,China;Center for Studies of Information Resources,Wuhan University,Wuhan 430072,China;School of Computer and Information Science,Southwest University,Chongqing 400715,China)
出处 《数据分析与知识发现》 CSSCI CSCD 北大核心 2019年第3期66-75,共10页 Data Analysis and Knowledge Discovery
基金 国家社会科学基金青年项目"融合用户个性化与实时性意图的查询推荐模型研究"(项目编号:15 CT Q019)的研究成果之一
关键词 时态意图 有监督分类 半监督分类 特征抽取 Temporal Intent Supervised Classification Semi-supervised Classification Feature Engineering
  • 相关文献

参考文献1

二级参考文献3

共引文献1

同被引文献94

引证文献10

二级引证文献24

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部