期刊文献+

基于在线百科知识库的文本语义相关度计算

Text Semantic Relativity Calculation Based on Online Encyclopedia
下载PDF
导出
摘要 本文在中文维基百科知识库的基础上,对文本语义相关度计算进行了研究.实验选取了2014年12月15日在中文维基百科网站下载的主题文章,进行处理后作为语义概念知识库.在Words-240测试集上的实验结果表明,该方法比基于Word Net的LSA算法的效果要好. In text semantic understanding , massive quantity of common sense and specialized knowledge is needed;it doesn ’ t suffice to only use human-compiled dictionary and thesaurus .As social networks develop , on-line Wikipedia provides a platform for sharing and improving human knowledge .This paper , on the basis of Chi-nese Wikipedia , studies the calculation of text semantic relativity .A corpus of processed texts is used as knowledge base of concepts;the texts are downloaded from Chinese Wikipedia as of 2014 December 15 .The results of experi-ment on Words-240 test set indicate that the method discussed in this paper is superior to WordNet-based approa-ches and LSA method .
作者 刘海静
出处 《洛阳师范学院学报》 2015年第5期80-83,共4页 Journal of Luoyang Normal University
关键词 语义理解 在线百科知识库 语义相关度 semantic understanding online encyclopedia semantic relativity
  • 相关文献

参考文献11

  • 1Deerwester S, Dumais S, Fumas G, et al.. Indexing by latent semantic analysis[ J]. Journal of the American Socie- ty for Information Science, 1990,41 (6), 391 -407.
  • 2Fellbaum C. WordNet: An Electronic Lexical Database [ M]. MIT Press, Cambridge, 1998.
  • 3Roget P. Roget's Thesaurus of English Words and Phrases [ M]. Longman Group Ltd ,1852.
  • 4Budanitsky A, Hirst G. Evaluating wordnet - based meas- ures of lexical semantic Relatedness [ J ]. Computational Linguistics, 2006, 32 (1), 13 - 47.
  • 5Michael S, Sinone P. WikiRelate Computingsemantic relat- edness using Wikipedia[ A]. In proceedings of.the 21th A- merican Association for Artifiaial Intelligence[C]. Boston, AAAI Press ,2006 : 1419 - 1424.
  • 6Gurevyeh I, Mueller C, Zesch T. What to be? - electron- ic career guidance based on semantic relatedness. In Pro- ceedings of the 45th Annual Meeting of the Association for Computational Linguistics ,2007.
  • 7Chang M, Ratinov L, Roth D, et al. Importance of seman- tic representation: Dataless classification. In Proceedings of the 23rd AAAI Conference on Artificial Intelligence , 2008.
  • 8Evgeniy G, Ahanl M. Wikipedia- based Semantic Inter- pretation for Natural Language Processing[ J]. Journal of Artificial Intelligence Research, 2009 (34) :443 - 498.
  • 9汪祥,贾焰,周斌,丁兆云,梁政.基于中文维基百科链接结构与分类体系的语义相关度计算[J].小型微型计算机系统,2011,32(11):2237-2242. 被引量:18
  • 10Lee M, Pincombe B, Welsh M. A comparison of machine measures of text document similarity with human judg- ments. In 27th Annual Meeting of the Cognitive Science Society, 2005.

二级参考文献13

  • 1Philip Resnik. Using information content to evaluate semantic simi- larity in a taxonomy [A]. In: C. Raymond Perrault, Chris S. Mellish, Renato deMori eds. Proceedings of the 14th International Joint Conference on Artificial InteUigence [ C]. Montreal: AAAI Press, 1995:448-453.
  • 2George A Miller. WordNet: a lexical database for english [ C].Communications of the ACM, 1995:38( 11 ) :39-41.
  • 3Ted Pedersen, Siddharth Patwardhan, Jason Michelizzi. WordNet: similarity: measuring the relatedness of concepts [ C ]. In: David Palmer, Joseph Polifroni, Deb Roy, eds. Proc. of Human Lan- guage Tectmology conference. Montteal: Association for Computa- tional Linguistics, 2004:38-41.
  • 4Li Yun. Mining semantic knowledge from chinese Wikipedia [D]. Beijing University of Posts and Telecommunications,2009.
  • 5Evgeniy Gabrilovich, Shaul Markovitch. Computing semantic relat edness using Wikipedia-based explicit semantic analysis [ A]. InI Manuela Veloso. Proceedings of the 20th International Joint Confe1 ence on Artificial Intelligence [ C ]. Hyderabad: AAAI Press 2007 : 1606-1611.
  • 6David Milne, Ian H Witten. An effective, low-cost measure of se- mantic relatedness obtained from Wikipedia links [ A]. In: Taylor Matthew, Dfiessens Kurt, Fern Alan eds. Proc. of the 23th Associ- ation for the Advancement of Artificial Intelligence [ C ]. Chicago: AAAI Press,2008:25-30.
  • 7Thomas K Landauer, Peter W Foltz, Darrell Laham. An introduc- tion to latent semantic analysis [ J]. Discourse Processes, 1998,25 (2-3) :259-284.
  • 8Liu Qun,Li Su-jian. Word slmHarlty computing based on how-net [ J]. International Journal of Computational Linguistics & Chinese Language Processing,2002,7 (2) :59-76.
  • 9Michael S~rube, Shnone Paolo Ponzetto. WfidRelate computing se- mantic relatedness using Wikipedia [ A]. In: Anthony Colin, Uni-versity of Leeds, eds. Proceedings of the 21th American Associa- tion for Artificial Intelligence [ C ]. Boston: AAAI Press, 2006: 1419-t424.
  • 10Jay J Jiang, David W Conrath. Semantic s'nnilarity based on corpus statistics and lexical taxonomy [ C]. In Proceedings of Internation- al Conference Research on Computational Linguistics, Taiwan, 1997 : 1-15.

共引文献163

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部