期刊文献+

关联首尾段落与首尾语句的多特征融合段落相似度计算

Multi-feature Fusion Paragraph Similarity Calculation Related to the First and the Last Paragraph and the First and the Last Sentence
下载PDF
导出
摘要 首尾段落和首尾语句对语义有着较大的贡献,应该作为判别段落相似度的主要因素。本文将其以恰当权重融入SiteQ算法,提出关联首尾段落和首尾语句的多特征融合段落相似度计算算法Topic-SiteQ。该算法采用多特征融合的算法计算首尾语句的语义相似度,并以一定的权值体现它们对段落相似度的贡献,同时提高首尾段落的评分值,并根据这次评分值进行推荐排序。实验表明,采用该算法,相关段落排序的MRR值提高了0.032,F测度值平均提高了1.4%,说明该算法的改进是有效的。 For their greater contribution to the semantics of the paragraph, the first and the last paragraphs and the first and the last sentences of the paragraph should be taken as the main factors in computing the similarity of the paragraphs. By using them in SiteQ with appropriate weight, we propose Topic-SiteQ calculation algorithm. It uses a multi-feature fusion algorithm to compute the semantic similarity of the first and the last sentences that contribute to the paragraph similarity by weight. At the same time, we improve the score of the first and the last paragraphs, recommend and sort the paragraphs by the final score. Experiments show that, with Topic-SiteQ, the MRR value of relevance ranking of paragraph increased about 0.032, and the F-measure increased a- bout 1.4%. The experimental results show that the optimized algorithm is effective.
作者 蒋宗礼 赵洁
出处 《计算机与现代化》 2016年第9期10-14,20,共6页 Computer and Modernization
基金 国家自然科学基金资助项目(61133003)
关键词 自动问答系统 SiteQ算法 语义相似度 多特征融合 automatic question answering system SiteQ semantic similarity multi-feature fusion
  • 相关文献

参考文献10

二级参考文献141

共引文献250

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部