期刊文献+

基于维基百科结构特征的语义相关度计算方法研究 被引量:2

Research on Semantic Relatedness Computing Method Based on Wikipedia Structure
原文传递
导出
摘要 语义相关度计算是智能信息处理和文本挖掘领域的重要研究内容。本文将维基百科作为语义知识库,利用维基百科层次分类体系结构、解释页面之间的链接结构等结构特征进行词语之间的相关度计算。针对层次分类体系的有向无环图结构,采用多路径语义相关度计算方法进行相关度计算;针对链接的重要性和类型,融合链接权重和链接类型进行相关度计算。实验结果表明,该方法取得了预期的实验效果,证明该方法是可行和有效的。 Semantic relatedness computing is one of the key research content in intelligent informationprocessing and text mining. Based on Wikipedia, this paper computes the relatedness between words withtaxonomic hierarchies and link structure. Adopts multipath semantic relatedness calculation method forthe acyclic directed graph structure of taxonomic hierarchies, combines link weight and link type for theimportance and types of link structure. Experiment results demonstrate that this method achieved a goodanticipative effect, and this method is feasible and effective.
作者 曾光
出处 《情报科学》 CSSCI 北大核心 2015年第9期72-75,120,共5页 Information Science
关键词 语义相关度 结构特征 维基百科 semantic relatedness structure feature wikipedia
  • 相关文献

参考文献14

二级参考文献64

  • 1车万翔,刘挺,李生.实体关系自动抽取[J].中文信息学报,2005,19(2):1-6. 被引量:115
  • 2刘群 李素建.基于《知网》的词汇语义相似度的计算.中文计算语言学,2002,17(2):59-76.
  • 3Leacock C,Chodorow M.Combining Local Context and WordNet Similarity for Word Sense Identification[EB/OL].(1998-05-18).http://www.bibsonomy.org/bibtex/2087c974c471792ddlfa536aa6a 75eobc/asalber.
  • 4Resnik P Using Information Content to Evaluate Semantic Similarity in a Taxonomy[C]//Proc.of the 14th International Joint Conference on Artificial Intelligence.[S.l.]:Springer,1995:448-453.
  • 5Struve M,Ponzetto S P.WikiRelate!Computing Semantic Relatedness Using Wikipedia[C]//Proc.of Association for the Advancement of Artificial Intelligence.Boston,USA:IEEE Press,2006:1419-1424.
  • 6Dat P.TNguyen. Yutaka Matsuo, Mitsuru Ishizuka. Subtree Mining for Relation Extraction from Wikipedia [C]. In proceedings of NAACL HLT 2007, Companion Volume125-128.
  • 7Brin, S., Extracting patterns and relations from the World Wide Web [C]. In Proceedings of the 1st International Workshop on the Web and Databases (WebDB' 98), Valencia, Spain, 1998.
  • 8Agichtein, E., Gravano, L., Snowball: Extracting Relations from Large Plain-text Collections [C]. In Proceedings of the 5th ACM International Conference on Digital Libraries(DL'00), 2000.
  • 9Pantel, P., Pennacchiotti, M., Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations [C]. In Proceedings of 23rd International Conference on Computational Linguistics(COLING' 06), 2006.
  • 10Schutz, A., Buitelaar, P., RelExt: A Tool for Relation Extraction from Text in Ontology Extension [C]. In Proceedings of the 4th International Semantic Web Conference (ISWC' 05), 2005.

共引文献62

同被引文献100

  • 1王广正,王喜凤.基于知网语义相关度计算的词义消歧方法[J].安徽工业大学学报(自然科学版),2008,25(1):71-75. 被引量:10
  • 2Mohammad S,Hirst G. Distributional Measures of Concept- distance: A Task-oriented Evaluation [ C ]//Proceedings of the Conference on Empirical Methods in NaturalLanguage Processing, 2006:35-43.
  • 3Zesch T, Gurevych I. Wisdom of crowds versus wisdom of linguists-measuring the semantic relatedness of words [J]. Natural Language Engineering, 2010, 16 ( 1 ): 25 -59.
  • 4Zhang Z Q, Gentile A L, Ciranegna F. Recent advances in methods of lexical semantic relatedness-a survey [ J ]. Natural Language Engineering, 2013, 19(4) : 411-479.
  • 5Lesk M E. Automated sense disambiguation using machine readable dictionaries: how to tell a pine cone from all ice cream cone [ C ]//Proceedings of the S1GDOC, New York USA, 1986: 24-26.
  • 6Banerjee S, Pedersen T. Extended gloss overlaps as a measure of semantic relatedness//Proceedings of the IJCIA, Acapulco, Mexico, 2003 : 805-810.
  • 7Gurevych I. Using the structure of a conceptual network in computing semantic relatedness [ C ]//Proceedings of the IJCNLP, Berlin, Germany, 2005 : 767-778.
  • 8Gentleman R. Visualizing and distances using GO [ OL]. http://www, bioconductor, org. April 16, 2015.
  • 9Morris J, Hirst G. Lexical cohesion computed by thesaural relations as an indicator of the structure of text [ J ]. Computational Linguistics, 1991, 17 ( 1 ) :21-48.
  • 10Kozima H, Teiji H. Similarity between words computed by spreading activation on an English dictionary [ C ]// Proceedings of the EACL, Utrecht, Germany, 1993: 232-240.

引证文献2

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部