摘要
针对在线教育中试题知识点自动标注问题,本文提出了一种基于二部图的试题知识点标注方法。首先,为了合理划分知识点粒度,本文构建了知识点知识图谱来融合知识点;其次,基于教材等语料抽取知识点与特征词二部图和试题与特征词二部图,并利用TF-IDF公式计算知识点与特征词、试题与特征词之间的边权,以构建试题-知识点二部图模型;再次,提出词频加权的相似性度量方法计算试题和知识点之间的相似度,将相似度最高的知识点作为试题知识点标签。最后,以某在线教育平台提供的高中历史试题为实验数据集进行数值实验,实验结果表明该方法的效果显著优于朴素贝叶斯、K最近邻、随机森林以及支持向量机。
Aiming at the problem of knowledge points labeling for test questions in online education,a bipartite graph-based method is proposed for knowledge points labeling in this paper.Firstly,considering the problem that the granularity of knowledge points is fuzzy,the knowledge graph of knowledge points is constructed to integrate knowledge points.Secondly,based on the corpus of textbooks and test questions,we extract the knowledge points bipartite graph as well as the test questions bipartite graph,calculate the weights of edges by the TF-IDF to construct a bipartite graph model between knowledge points and test questions.Besides,similarity measurement method based on term frequency weighting is used to calculate the similarity between questions and knowledge points,marking the knowledge points with the highest similarity to the test questions.Finally,the high school history test questions on the online education platform are used as the experimental data sets.The experiments and analysis show that the proposed method is obviously superior to classical machine learning methods such as Naive Bayes,K-Nearest Neighbor,Random Forest and Support Vector Machines.
作者
郭崇慧
邢小宇
魏伟
GUO Chong-hui;XING Xiao-yu;WEI Wei(Institute of Systems Engineering, Dalian University of Technology, Dalian 116024, China;Center for Energy, Environment & Economy Research, Zhengzhou University, Zhengzhou 450001, China)
出处
《运筹与管理》
CSSCI
CSCD
北大核心
2021年第11期71-75,112,共6页
Operations Research and Management Science
基金
国家自然科学基金资助项目(71771034,72001191)
中央高校基本科研业务费资助项目(DUT21YG108)。
关键词
教育数据挖掘
知识点标注
二部图
educational data mining
knowledge points labeling
bipartite graph