摘要
以应用题自动求解为目标,以高考入学考试数学试卷中的分层抽样应用题为研究对象,重点研究了分层抽样应用题的句子语义角色识别方法。根据分层抽样的原理,首先定义了分层抽样题意表征中的五种核心语义角色,分别为总体、样本、总体中的层、样本中的层和实体之间的关系。基于这五种语义角色,应用题题意理解中的核心问题被转换为对应用题文本中的句子进行语义角色判定。提出了一种基于特征词与n-gram模型相结合的句子语义角色判定方法,对分层抽样应用题文本中的句子进行语义角色判定。根据测试集中的实验结果,应用题的整题识别准确率由基于特征词的判定方法的17.95%提高到64.1%。实验结果说明基于特征词与ngram模型相结合的句子语义角色判定方法能够提高题意理解的准确率。
This paper proposed a semantic role annotation method of sentences for the stratification sampling word problem in China’s college entrance examination. According to the basic principles of stratification sampling, this paper defined five core semantic roles to represent the meaning of stratification sampling word problem. These five roles were population, levels in the population, sample, levels in the sample and the relation between entities. With the help of these five semantic roles, it could solve the word problem if the sentences in the word problem could be mapped with these semantic roles. To achieve this goal, this paper presented a hybrid method based on characteristic words and n-gram model to annotate the semantic roles of sentences in stratification sampling word problem. The experimental results show that the accuracy of the right-annotated word problems improves from 17.95% to 64.1%. It proves that the hybrid method can significantly improve the accuracy of stratification sampling word problem understanding.
作者
吴林静
劳传媛
范桂林
黄景修
刘清堂
Wu Linjing;Lao Chuanyuan;Fan Guilin;Huang Jingxiu;Liu Qingtang(School of Educational Information Technology,Central China Normal University,Wuhan 430079,Chin)
出处
《计算机应用研究》
CSCD
北大核心
2018年第8期2299-2303,共5页
Application Research of Computers
基金
国家"863"计划资助项目(2015AA015408)
国家教育部新世纪优秀人才计划项目(NCET-13-0818)
国家"十二五"科技支撑计划资助项目(2015BAK27B02)
中央高校基本科研业务费项目(CCNU15A02020
CCNU16A05023)
关键词
应用题自动求解
题意理解
语义角色
特征词
N-GRAM
word problem automatically solving
understanding of word problem
semantic roles
characteristic words
n-gram