期刊文献+

文档相似度综合计算研究 被引量:40

The Study on the Comprehensive Computation of the Documents Similarity
下载PDF
导出
摘要 论文对几种传统的、具有代表性的文档相似度的计算方法进行了综述,并分析了各自的应用局限性。针对结构化描述的科技论文的特点,提出一种能综合文档特征信息、上下文领域知识和引用关系的新相似度计算算法,并通过原型系统讨论其有效性。 The paper reviews several traditional and typical similarity measures between documents.After analyzing their existing limitations,we propose a new similarity algorithm which can synthesize documents characteristic information, context domain knowledge and citation relation according to structural description of science and technology papers. Finally we discuss its availability using prototype system.
出处 《计算机工程与应用》 CSCD 北大核心 2006年第30期160-163,共4页 Computer Engineering and Applications
关键词 对象相似性 引文图 结构上下文相似性 层次域结构 objects similarity,citation graph,structural context similarity,hierarchy domain structure
  • 相关文献

参考文献14

  • 1L Egghe,C Michel.Construction of weak and strong similarity measures for ordered sets of documents using fuzzy set techniques.http://www.elsevier.com/locate/infoproman,2003
  • 2Prasanna Ganesan,Hector Garcia-Molina,Jennifer Widom.Exploiting hierarchical domain structure to compute similarity[J].ACM Transactions on Information Systems,2003; 21 (1) ;64~93
  • 3Elisa Bertino,Giovanna Guerrini,Marco Mesiti.Measuring the Structural Similarity among XML Documents and DTDs.http://citeseer.ist.psu.edu/bertino01measuring.html,2001
  • 4Sergio Flesca,Giuseppe Manco,Elio Masciari et al.Detecting Structural Similarities between XML Documents.http://www.db.ucsd.edu/Webdb2002/papers/19.pdf,2002
  • 5Ana G Maguitman,Filippo Menczer,Heather Roinestad.Algorithmic Detection of Semantic Similarity.http://www.informatics.indiana.edu/fil/Papers/semsim.pdf,2005
  • 6Wangzhong Lu,Jeannette Janssen,Evangelos Milios et al.Node Similarity in Networked Information Spaces.http://citeseer.ist.psu.edu/lu01node.html,2001
  • 7http://www.google.com
  • 8Glen Jeh,Jennifer Widom.SimRank:A Measure of Structural-Context Similarity.http://www-cs-students.stanford.edu/~glenj/simrank.pdf
  • 9Daniel Fogaras,Balazs Racz.Scaling link-based similarity search.http://www.ilab.sztaki.hu /Websearch/Publications/ fogaras04scaling_sim.pdf,2004
  • 10http://www.citeseer.com

二级参考文献12

  • 1Lindvall M, Rus L, Sinha S S. Software system support for knowledge management [J]. Journal of Knowledge Management, 2003, 7(5): 137-150.
  • 2Kwan M M, Balasubramanian P. Knowledgescope: managing knowledge in context [J]. Decision Support Systems, 2003, 35 (4): 467-486.
  • 3Ma J, Hemmje M. Knowledge management support for cooperative research [A]. IFIP 17th World Computer Congress-TC12 Stream on Intelligent Information Processing[C]. Montreal: kluwer Academic Publishers,2002. 281-285.
  • 4Klint P, Verhoef C. Enabling the creation of knowledge about software assets[J]. Data & Knowledge Engineering, 2002, 41(2-3): 141-158.
  • 5Martin P, Eklund P W. Knowledge retrieval and the world wide web[J]. IEEE Intelligent Systems, 2000, 15 (3): 18-25.
  • 6Shibata H, Hori K. A system to support long-term creative thinking in daily life and its evaluation[A]. Proceedings of the Fourth Conference on Creativity & Cognition[C]. Loughborough:ACM Press, 2002. 142-149.
  • 7Feng L, Marfred A J, Hoppenbrouwers J. Towards knowledge-based digital libraries[J]. SIGMOD Record, 2001, 30(1): 41-46.
  • 8Sheth A, Bertram C, Avant D, et al. Managing semantic content for the web[J]. IEEE Internet Computing, 2002, 6(4): 80-87.
  • 9Bohm C, Braunmuller B, Kriegel H P, et al. Efficient similarity search in digital libraries[A]. IEEE Proceedings of Advances in Digital Libraries[C]. Washington DC:IEEE Computer Society, 2000. 193-199.
  • 10He Y, Hui S C, Fong A C M. Citation-based retrieval for scholarly publications[J]. IEEE Intelligent Systems, 2003, 18(2): 58-65.

同被引文献310

引证文献40

二级引证文献225

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部