期刊文献+

词汇语义知识库的研究现状与发展趋势 被引量:4

State-of-the-Art and Prospect of Lexical Semantic Knowledge Bases
下载PDF
导出
摘要 作为文本内容理解的媒介与载体,词汇语义知识库已被广泛应用于信息检索、信息提取、问答系统、自动文摘等方面,成为自然语言处理不可或缺的基础资源。本文介绍词汇语义知识库研究与开发的现状,重点分析了WordNet、SinicaBOW、HowNet及CCD等具有代表性的词汇语义知识库的具体情况。在此基础上,盘点各种需求和解决方案,提出词汇语义知识库研究面临新的挑战和机遇,即本体化和多语化的大趋势,它们将从不同方面弥补词汇语义知识库在知识共享和知识交流上的不足,使其更好地为自然语言处理服务。本文最后探讨了词汇语义知识库未来发展中可能存在的问题和新的课题。 As the semantic knowledge carrier for text understanding by computers, lexical semantic knowledge bases have been widely used in natural language processing such as machine translation, infonnation retrieval, etc. We give a detailed analysis of the state-of-the-art of and tendencies in them, which are referred to as ontologieahzation and multilinguahzation respectively. These two developments could remedy of the defects of knowledge share and knowledge communication of lexical semantic knowledge bases. The existing problems and future directions are at last put forward.
作者 朱虹 刘扬
出处 《情报学报》 CSSCI 北大核心 2008年第6期870-877,共8页 Journal of the China Society for Scientific and Technical Information
基金 本文相关研究得到国家973计划(2004CB318102)、国家自然科学基金项目(60775031)和全国博士学位论文作者专项资金资助项目(200514)的支持.
关键词 本体 词汇语义知识库 多语 自然语言处理 ontology, lexical semantic knowledge base, multilingualization, natural language processing
  • 相关文献

参考文献33

  • 1俞士汶,朱学锋,段慧明,等.汉语词汇语义研究及词汇知识库建设[C].第七届汉语词汇语义学研讨会论文集.大会特邀报告.台湾,2006.
  • 2Fellbaum C. WordNet-An Electronic Lexical Database[ M]. MIT Press, 1998.
  • 3Pedersen T, Patwardhan S, Michelizzi J. WordNet: Similarity-Measuring the Relatedness of Concepts [ C ]. In Proceedings of the Nineteenth National Conference on Artificial Intelligence (AAAI-04). California: San Jose, 2004: 1024-1025.
  • 4Voorhees E M. Using WordNet to disambiguate word senses for text retrieval [ C ]//Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Pittsburgh, PA. 1993: 171-180.
  • 5Bamden J. Implications of an AI Metaphor Understanding Project [ C]//Proceedings of the Second Global WordNet Conference. Brno: Czech Republic, 2004: 7-7.
  • 6Alessandro A, Magnini B, Strapparava C. Lexical discrimination with the Italian version of WordNe [ C ]// Proceedings of the ACL/EACL Workshop on Automatic Extraction and Building of Lexical Semantic Resources for Natural Language Apphcations. Madrid, 1997: 32-38.
  • 7Eneko A. Clustering of Word Senses[ C]//Proceedings of the Second Global WordNet Conference. Bmo: Czech Republic, 2004: 4-4.
  • 8Hotho A. WordNet improves Text Document Clustering [ C ]//proceeding of the SIGIR2003 Semantic Web Workshop. Canada, 2003.
  • 9Huang Chu-Ren, Chang Ru-Yng, Shiang Bin Lee. Sinica BOW ( Bilingual Ontological Wordnet ): Integration of Bilingual WordNet and SUMO [ C ].//Proceedings of LREC2004. Lisbon, 2004: 1553-1556.
  • 10Dong Z D, Dong Q. Ontology and HowNet[OL]. [2006-04- 23 ]. http://www. keenage. com/html/e_ index. html.

二级参考文献17

  • 1郝秀兰,杨尔弘.基于小规模语料库和机器可读词典的二元分布语义获取[J].中文信息学报,2004,18(6):23-29. 被引量:2
  • 2黄昌宁,李涓子.词义排歧的一种语言模型[J].语言文字应用,2000(3):85-90. 被引量:16
  • 3Dagan Ido,et al.Similarity-based models of word co-occurrence probabilities.Machine Learning,1999,34:43-69.
  • 4Kohonen T.Self organization of a massive document collection.IEEE Transactions on Neural Networks,2000,11(3).
  • 5Kohonen T.Self-organization of very large document collections:State of the art//Proceedings of ICANN'98,London,1998:65-74.
  • 6Kohonen T.Self-organizing Maps.2nd ed.Spring Publisher,1997.
  • 7Ma Q,et al.Self-organization of Chinese semantic maps using TFIDF term weighting//Proceedings of NLPNN'01,Tokyo,2001a.
  • 8Ma Q,et al.Emergence of Chinese semantic maps from self-organization//Proceedings of ICONIP'01,Shanghai,2001b:681-686.
  • 9Zhang M,et al.Optimizing feature encoding for self-organizing Chinese semantic map//Proceedings of NLPNN'01.Tokyo,2001.
  • 10Donald Hindle. Noun Classification from PredicateArgument Structures[A]. In.. Proceedings of the 28th Annual Meeting of the ACL[C]. Pennsylvania: Association for Computational Linguistics, 1990, 268-275.

共引文献6

同被引文献36

  • 1王锦,陈群秀.汉语述语形容词机器词典机器学习词聚类研究[J].中文信息学报,2007,21(3):40-46. 被引量:3
  • 2Navigli R. Meaningful Clustering of Senses Helps Boost Word Sense Disambiguation Performance [C]// Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, COLING-ACL, 2006: 105-112.
  • 3Agirr E. and Soroa A. Evaluating Word Sense Induction and Discrimination Systems [C]//Proceedings of the 4th International Workshop on Semantic Evalua- tions (SemEval-2007), 2007: 7-12.
  • 4Schiitze H. Automatic Word Sense Discrimination[J]. Computational Linguistics, 1998, 24 ( 1 ): 97- 124.
  • 5Purandare A. and Pedersen T. Sense Clusters-Finding Clusters that Represent Word Senses [C]//Proceedings of 19th Conference on Artificial Intelligence (AAAI-04), San Jose, CA. 2004.
  • 6Niu, ZY. Ji, DH. Tan, CL. Learning word senses with feature selection and order identification capabilities [C]//Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, Barcelona, Spain. 2004.
  • 7Pantel P. Lin DK. Discovering Word Senses from Text [C]//Proeeedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. Edmonton, Canada. 2002: 613-619.
  • 8Fellbaum, C. WordNet - An Electronic Lexical Database [M]. MIT Press, 1998.
  • 9Velldal, E. A Fuzzy clustering approach to word sense discrimination [C]//Proceedings of the 7th International conference on Terminology and Knowledge Engineering, Copenhagen, Denmark. 2005.
  • 10Zhao Y. Karypis G. Hierachical Clustering Algorithms for Document Datasets [J].Data Mining and Knowledge Discovery, 2005, 10: 141-168.

引证文献4

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部