摘要
给定包含主旨概括句的汉语句群,针对该句群的内部结构标注是基于语言学的分析结果,而阅读句群时的眼动轨迹则蕴含着人的心理认知,两者的信息融合和内在关联性分析是该文主要工作。该文使用基于径向基函数支持向量机和递归特征消除的分类模型,根据标点小句片段对应的眼动指标数据预测该片段是否为包含主旨内容的关键信息,达到了0.76的准确率,并通过分析关键片段上眼动数据的分布特点,提取出对句群主旨概括信息区分度较好的眼动指标。
Given a Chinese sentence group that contains a theme sentence,the internal structure label of the sentence group is based on the results of linguistic analysis.The main work of this paper is the information fusion and internal relevance analysis of structure label and eye movement trace of reading sentence group,which contains human psychological cognition.A classification model based support vector machine and recursive feature elimination is used to predict whether the punctuation clause segment is the key information containing the thematic content according to the corresponding eye movement feature data.By analyzing the distribution characteristics of eye movement data on the key segment,eye movement features with good distinction for the thematic information of the sentence group are extracted,and the final accuracy of 0.76 is achieved.
作者
单昊聪
周强
SHAN Haocong;ZHOU Qiang(School of Informatics,The University of Edinburgh,Edinburgh,EH89JU,U.K;Institute for Artificial Intelligence,Department of Computer Science andTechnology,Tsinghua University,Beijing 100084,China)
出处
《中文信息学报》
CSCD
北大核心
2023年第1期169-178,共10页
Journal of Chinese Information Processing
基金
国家自然科学基金(61433018)。
关键词
眼动记录
文本结构标注
支持向量机
eye movement feature
text structure label
support vector machine