
一种介词-动词模式的获取方法 被引量:1

Method of Preposition-verb Pattern Acquisition
摘要 基于模式的知识获取方法研究是当前文本知识获取的重点研究之一,如何获得文本知识模式是该研究中的一个重要研究内容。提出一种新的基于介词和动词模式(称为PV模式)的获取方法。首先构造出一个候选的动词介词组合(称为PV组合),使用统计方法对其进行过滤。度量PV组合好坏有两个标准:一个是模式词的表示能力,另一个是模式词与概念词之间及多个概念词之间的相关性。依据这两个标准构造了6个数值特征,通过训练产生了3个分类器,采用交叉验证的方式估计出3个分类器的精度分别达到0.853,0.862和0.856。这些分类器为从PV组合中自动挑选PV模式提供依据。 Pattern-based knowledge acquisition is an important research area in the research of knowledge acquisition from text (KAT). One topic of this research is how to harvest textual knowledge patterns. A novel method on acquisition of preposition-verb patterns (PV Patterns) was proposed. First, candidate preposition-verb pairs (PV pairs) were generated, and filtered by a combination of a rule-based method and statistical methods. Designed two criteria to evaluate PV patterns:coverage on instances of semantic relations and relevance among the concept words and pattern words, which lead us to construct six numeric features for PV patterns. Three classifiers were trained using these six features. The precision rates,which are estimated via cross-validation,of three classifiers are up to 0. 853,0. 862 and 0. 856, respectively. These classifiers provide a solid basis for automatically selecting PV patterns from PV pairs.
出处 《计算机科学》 CSCD 北大核心 2008年第11期139-143,共5页 Computer Science
基金 国家自然基金(60496326 60573063 60573064和60773059) 863课题(2007AA01Z325)的资助
关键词 文本知识获取 文本模式获取 模式分类 Knowledge acquisition from tex,Text pattern acquisition,Pattern classification
  • 相关文献


  • 1Wang Shi,Cao Yanan, Cao Xiny, et al. Learning Concepts from Text Based on the Inner-constructive Model//Proceedings of 2nd International Conference on Knowledge Science Engineering and Management. Melbourne, Australia, 2007
  • 2余蕾,曹存根.基于Web语料的概念获取系统的研究与实现[J].计算机科学,2007,34(2):161-165. 被引量:6
  • 3Hearst M. Automatic acquisition of hyponyms from large text corpora//Proeeedings of the 14th Conference on Computational Linguistics. Nantes, France, 1992
  • 4Hearst M. Automated discovery of wordnet relations///Fellbaum C, ed. WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press, 1998 : 131-151
  • 5Riloff E. Automatically generating extraction patterns from untagged text // Proceedings of the 13rd National Conference on Artificial Intelligence. Oregon,USA, 1996
  • 6刘磊,曹存根,王海涛,陈威.一种基于“是一个”模式的下位概念获取方法[J].计算机科学,2006,33(9):146-151. 被引量:18
  • 7Tian Guogang, Cao Cungen, Liu Lie, et al. MFC: A Method of Co-referent Relation Acquisition from Large-scale Chinese Corpora//Proceedings of Third International Conference on Fuzzy Systems and Knowledge Discovery. Xi'an, China, 2006
  • 8Surdeanu M, Turmo J, Ageno A. A Hybrid Approach for the Acquisition of Information Extraction Patterns//Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics. Trento, Italy, 2006
  • 9Nello C, John S-T, Huma L. Latent Semantic Kemels // Proceedings of the 18th International Conference on Machine Learning. MA, USA, 2001
  • 10Ian W, Eibe F. Data Mining: Practical Machine Learning Tools and Techniques. Second Edition. Burlington, MA: Morgan Kaufmann, 2005


  • 1张春霞,郝天永.汉语自动分词的研究现状与困难[J].系统仿真学报,2005,17(1):138-143. 被引量:60
  • 2罗贝,吴洁,曹存根,邵志清.从文本中获取植物知识方法的研究[J].计算机科学,2005,32(10):6-13. 被引量:13
  • 3刘磊,曹存根,王海涛,陈威.一种基于“是一个”模式的下位概念获取方法[J].计算机科学,2006,33(9):146-151. 被引量:18
  • 4郑家恒 杜永萍 宋礼鹏.农业病虫害词汇获取方法初探[A]..第七届全国计算语言学联合学术会议论文集(JSCL-2003)[C].北京:清华大学出版社,2003..
  • 5Miller G.WordNet:An On-line Lexical Database.International Journal of Lexicography,1990,3(4)
  • 6Beeferman D.Lexical discovery with an enriched semantic network.In:Proceedings of the Workshop on Applications of Word-Net in Natural Language Processing Systems,ACL/COLING,1998
  • 7Richardson S D,Dolan W B,Vandervende L.Mindnet:acquiring and structuring semantic information from text.In:Proc.of COL-ING-ACL'98,1998.1098~1102
  • 8Cao Cungen,Shi Qiuyan.Acquiring Chinese Historical Knowledge from Encyclopedic Texts.In:Proceedings of the International Conference for Young Computer Scientists,2001.1194~1198
  • 9Dolan W,Vanderwende L,Richardson S D.Automatically Deriving Structured Knowledge Bases From On-Line Dictionaries.In:Proceedings of the Pacific Association for Computational Linguistics.Vancouver,British Columbia,1993.5~14
  • 10Shinzato K,Torisawa K.Acquiring hyponymy relations from web documents.In:Proceedings of HLT-NAACL 2004.73~80












使用帮助 返回顶部