期刊文献+

Stack Overflow系统的特征融合答案推荐策略

FEATURE INTEGRATION ANSWER RECOMMENDATION STRATEGY FOR STACK OVERFLOW SYSTEM
下载PDF
导出
摘要 针对Stack Overflow系统中用户寻找问题答案效率低的问题,提出一种基于标题相似度、描述相似度、标签相似度、语义相似度的特征融合答案推荐策略(FIARS)。从Stack Overflow网站中抽取“问题与答案”语料集,对答案进行去重处理,建立问题索引和问题对应的答案集索引;采用余弦相似度计算新问题与语料库中问题在标题、标签、问题描述等维度上的相似度,并构建语义模型计算语义相似度;基于这些相似度筛选出最佳的“问题与答案”候选集并把答案推荐给用户。为了验证策略的可行性和有效性,使用Stack Overflow真实数据集进行分析实验,实验结果表明该策略能够较大地提高答案推荐的准确率。 Aiming at the problem of low efficiency of searching for answers in Stack Overflow system,this paper proposed a feature integrated answer recommendation strategy(FIARS) based on title similarity,description similarity,tag similarity and semantic similarity.We extracted the "question and answer" corpus from the Stack Overflow website,eliminated duplication and built the question and the answer set index.The cosine similarity was used to calculate the similarity between the new problem and the corpus in terms of title,tag and problem description,and the semantic model was constructed to calculate the semantic similarity.Based on these similarities,the best candidate sets of "questions and answers" were selected,and the answers were recommended to users.In order to verify the feasibility and effectiveness of the strategy,we used real data set from Stack Overflow to analyze the experiment.The experimental results show that the strategy can greatly improve the accuracy of answer recommendation.
作者 万杰 赵逢禹 刘亚 Wan Jie;Zhao Fengyu;Liu Ya(School of Optoelectronic Information and Computer Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China)
出处 《计算机应用与软件》 北大核心 2019年第8期60-64,129,共6页 Computer Applications and Software
基金 国家自然基金青年基金项目(61402288)
关键词 STACK OVERFLOW 特征融合 余弦相似度 语义模型 答案推荐 Stack Overflow Feature integration Cosine similarity Semantic model Answer recommendation
  • 相关文献

参考文献1

二级参考文献6

  • 1Agichtein E, Castillo C, Donato D, et al. F.inding High-quality Content in Social Media[C]//Proc. of WSDM'08. [S. 1.]: ACM Press, 2008: 183-194.
  • 2Jeon J, Croft W B, Lee J H. Finding Similar Questions in Large Question and Answer Archives[C]//Proc. of the 14th ACMInternational Conference on Information and Knowledge Management. [S. 1.]: ACM Press, 2005: 84-90.
  • 3Jurczyk P, Agichtein E. Discovering Authorities in Question Answer Communities by Using Link Analysis[C]//Proc. of the 16th ACM Conference on Information and Knowledge Management.[S. 1.]: ACM Press, 2007: 919-922.
  • 4Hofmann T. Probabilistic Latent Semantic Indexing[C]//Proc. of the 15th Conference on Uncertainty in Artificial Intelligence. [S. 1.]: ACM Press, 1999: 50-57.
  • 5Qu Mingcheng, Qiu Guang. Probabilistic Question Recommen-dation for Question Answering Communities[C]//Proc. of the18th International Conference on World Wide Web. [S. 1.]: ACM Press, 2009: 1229-1230.
  • 6罗景,涂新辉.基于概率潜在语义分析的中文信息检索[J].计算机工程,2008,34(2):199-201. 被引量:11

共引文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部