期刊文献+

基于Web的概念属性获取方法研究

Research on a Method of Conceptual Attribute Acquisition Based on Web
下载PDF
导出
摘要 属性是概念的内涵表达,描述概念的特征或性质,通过属性可以区分不同的概念,发现它们之间的差异。属性具备描述概念和鉴别概念的功能。基于Web的属性获取是指对给定的概念从Web网页中自动获取其属性集合。属性获取是概念知识获取的起点,也是领域本体自动构建的关键。文中从文本知识获取的角度对属性进行分类,并结合属性的元性质,探讨属性名称在Web语料中的基本表达方式(词汇句法模式),并通过词汇句法模式从大规模语料中获取属性名称,并且提出了基于统计和语义的候选属性验证方法。最后利用属性迭代获取模式进行属性迭代获取。通过几组概念的实例进行属性获取,实验结果表明,文中方法获取的属性的准确率较高。 An attribute is the expression of connotation, which is used to explain some property of the conceptual word, and distinguish different concepts, and find their discrepancy. An conceptual word with attribute names are not an isolated vocabulary entry any more. Web-based attribute-acquisition is to acquire a set of attribute names from Web pages automatically for each given concept, enriching the semantics of the concept. Attribute acquisition is also a significant step of general knowledge acquisition from text, and an important task in automatic construction for domain ontologies. It makes a basic classification of attributes according to text knowledge acquisition in this paper and explores basic expressions (lexico-syntactical patterns) for attribute names in multi-linguistic Web corporal. After acquiring attribute names from large-scale corpus by patterns, a method based on statistics and semantic is proposed to validate. At last, attribute it- eration patterns are applied to acquire new attribute names through iteration method. The results show that the precision of attribute acqui- sition is very high through the experiment of several group concepts.
出处 《计算机技术与发展》 2016年第8期12-16,共5页 Computer Technology and Development
基金 国家自然科学基金资助项目(61203284) 国家社科基金重点项目(10AYY003)
关键词 知识获取 概念 属性 属性获取 语义 knowledge acquisition concept attribute attribute acquisition semantic
  • 相关文献

参考文献12

  • 1董振东,董强,郝长伶.知网的理论发现[J].中文信息学报,2007,21(4):3-9. 被引量:99
  • 2中文维基百科.维基媒体基金会[EB/OL].2002.http://zh.wikipedia.org/.
  • 3Reddy R. Three open problems in AI[ J]. Journal of the ACM, 2003,50( 1 ) :83-86.
  • 4Miller G. WordNet :a lexical database for English [ J ]. Commu- nications of the ACM, 1995,38 ( 11 ) :39-41.
  • 5Hearst M. Automatic acquisition of hyponyms from large text corpora[ C]//Proc of 14th international conference on compu- tational linguistics. [s. l. ] :[s. n. ] ,1992:539-545.
  • 6田国刚.受限中文语料的自监督文本知识获取研究[D].北京:中国科学院计算技术研究所,2007.
  • 7Yamada I, Baldwin T. Automatic discovery of telic and agen- tive roles from corpus data[ C ]//Proceedings of the 18th Pa- cific Asia conference on language, information and computa- tion. [s. l. ] :[s.n. ] ,2004.
  • 8Brin S. Extracting patterns and relations from the world wide web [ C ]//Proc of selected papers from the international work- shop on the world wide web and databases. [ s. l. ] : [ s. n. ], 1998 : 172-183.
  • 9Zhao J, Liu H, Lu R. Automatic extending HowNet's attribute lexicon on the web. signal-image technologies and internet- based system [ C ]//Proc of SITIS 97. [ s. l. ] : [ s. n. ] ,2007 : 315-320.
  • 10Cimiano P, Wenderoth J. Automatically learning qualia struc- tures from the web[ C]//Proceedings of the ACL workshop on deep lexical acquisition. [ s. l. ] :[ s. n. ] ,2005:28-37.

二级参考文献4

  • 1Dong.Zhendong.Knowledge description:what,how,and who?[A].Manuscript & Program of International Symposium on Electronic Dictionary[C].Tokyo:1988.18.
  • 2http://afflatus.ucd.ie The Creative Language System Group
  • 3www.is.sinica.edu.tw/pages/kchen/publications-e.html.
  • 4Zhendong Dong,Qiang Dong.HowNet and the Computation of Meaning[M].Singapore:World Scientific Publishing Company,2006.

共引文献102

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部