期刊文献+

基于属性的文本相似度计算算法改进 被引量:6

Improvement of Text Similarity Computing Algorithm Based on Attribute
下载PDF
导出
摘要 基于属性的重心剖分模型是一种较为新颖的文档相似度计算模型,但容易导致语义信息丢失和效率低下。针对上述问题,提出一种改进的重心剖分模型,通过计算查询线与文档单纯形的交点与文档重心点之间的相似度,使得结果保留属性坐标系中文档向量的特征。实验结果表明,该模型的查全率、查准率和F1值可以提高2%~4%左右。 Documents similarity computing with attribute barycenter coordinate model is a relatively new method, but the semantic information easily loss and is inefficient. For resolving these problems, an improved algorithm based on the attribute barycenter coordinate is presented. The method is inspired from the satisfying degree function in decision-making assessment theory. Matching the points between the intersection of query line and document complex and document barycenter using the new algorithm can keep the character of document vector within the result and improve the precision as well as efficiency. Experimental results show that the recall, precision and value of F1 of the model can increase 2%,-4%.
出处 《计算机工程》 CAS CSCD 北大核心 2009年第17期4-6,共3页 Computer Engineering
基金 国家"863"计划基金资助项目(2007AA12Z221) 重庆市自然科学基金资助项目(CSTS2007BB2446) 南京师范大学科研基金资助重点项目(2006105XGQ0051)
关键词 相似度计算 属性坐标系 属性重心点 similarity computing attribute coordinate attribute barycenter point
  • 相关文献

参考文献3

二级参考文献8

  • 1史忠植,高级人工智能,1997年
  • 2Wong S K M,Proc 8th Annual ACMSIGIR Int Conf Research and Development in Information Retrieval,1985年,18页
  • 3冯嘉礼,董占球.基于属性整合的知觉模式生成与识别模型[J].计算机研究与发展,1997,34(7):481-486. 被引量:30
  • 4Baeza Yates R,Ribirero Neto B.Moden Information Retrieval[M].Addison Wesley:Longman Publishing,1999.
  • 5Feng J L.The research on decision supports system of nuclear accident emergency and its computer realization[D].Beijing:Chinese Atomic Energy Institute,2001,97-118.
  • 6Gerard Salton,Chris Buckley.Improving retrieval performance by relevance feedback[J].J of the American Society for Information Science,1990,41(4):288-297.
  • 7Salton G,Buckley C.Term-weighting approaches in automatic retrieval.Infor-mation[J].Processing and Management,1988,24(5):513-523.
  • 8潘谦红,王炬,史忠植.基于属性论的文本相似度计算[J].计算机学报,1999,22(6):651-655. 被引量:63

共引文献62

同被引文献45

引证文献6

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部