基于Web的概念属性获取方法研究

Research on a Method of Conceptual Attribute Acquisition Based on Web

下载PDF

导出

摘要属性是概念的内涵表达,描述概念的特征或性质,通过属性可以区分不同的概念,发现它们之间的差异。属性具备描述概念和鉴别概念的功能。基于Web的属性获取是指对给定的概念从Web网页中自动获取其属性集合。属性获取是概念知识获取的起点,也是领域本体自动构建的关键。文中从文本知识获取的角度对属性进行分类,并结合属性的元性质,探讨属性名称在Web语料中的基本表达方式(词汇句法模式),并通过词汇句法模式从大规模语料中获取属性名称,并且提出了基于统计和语义的候选属性验证方法。最后利用属性迭代获取模式进行属性迭代获取。通过几组概念的实例进行属性获取,实验结果表明,文中方法获取的属性的准确率较高。 An attribute is the expression of connotation, which is used to explain some property of the conceptual word, and distinguish different concepts, and find their discrepancy. An conceptual word with attribute names are not an isolated vocabulary entry any more. Web-based attribute-acquisition is to acquire a set of attribute names from Web pages automatically for each given concept, enriching the semantics of the concept. Attribute acquisition is also a significant step of general knowledge acquisition from text, and an important task in automatic construction for domain ontologies. It makes a basic classification of attributes according to text knowledge acquisition in this paper and explores basic expressions （lexico-syntactical patterns） for attribute names in multi-linguistic Web corporal. After acquiring attribute names from large-scale corpus by patterns, a method based on statistics and semantic is proposed to validate. At last, attribute it- eration patterns are applied to acquire new attribute names through iteration method. The results show that the precision of attribute acqui- sition is very high through the experiment of several group concepts.

作者刘亮亮汪平仄

机构地区上海对外经贸大学统计与信息学院江苏科技大学计算机科学与工程学院中国科学院计算技术研究所智能信息处理重点实验室

出处《计算机技术与发展》 2016年第8期12-16,共5页 Computer Technology and Development

基金国家自然科学基金资助项目(61203284) 国家社科基金重点项目(10AYY003)

关键词知识获取概念属性属性获取语义 knowledge acquisition concept attribute attribute acquisition semantic

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献12

1董振东,董强,郝长伶.知网的理论发现[J].中文信息学报,2007,21(4):3-9. 被引量：99
2中文维基百科.维基媒体基金会[EB/OL].2002.http://zh.wikipedia.org/.
3Reddy R. Three open problems in AI[ J]. Journal of the ACM, 2003,50( 1 ) :83-86.
4Miller G. WordNet :a lexical database for English [ J ]. Commu- nications of the ACM, 1995,38 ( 11 ) :39-41.
5Hearst M. Automatic acquisition of hyponyms from large text corpora[ C]//Proc of 14th international conference on compu- tational linguistics. [s. l. ] :[s. n. ] ,1992:539-545.
6田国刚.受限中文语料的自监督文本知识获取研究[D].北京:中国科学院计算技术研究所,2007.
7Yamada I, Baldwin T. Automatic discovery of telic and agen- tive roles from corpus data[ C ]//Proceedings of the 18th Pa- cific Asia conference on language, information and computa- tion. [s. l. ] :[s.n. ] ,2004.
8Brin S. Extracting patterns and relations from the world wide web [ C ]//Proc of selected papers from the international work- shop on the world wide web and databases. [ s. l. ] : [ s. n. ], 1998 : 172-183.
9Zhao J, Liu H, Lu R. Automatic extending HowNet's attribute lexicon on the web. signal-image technologies and internet- based system [ C ]//Proc of SITIS 97. [ s. l. ] : [ s. n. ] ,2007 : 315-320.
10Cimiano P, Wenderoth J. Automatically learning qualia struc- tures from the web[ C]//Proceedings of the ACL workshop on deep lexical acquisition. [ s. l. ] :[ s. n. ] ,2005:28-37.

二级参考文献4

1Dong.Zhendong.Knowledge description:what,how,and who?[A].Manuscript & Program of International Symposium on Electronic Dictionary[C].Tokyo:1988.18.
2http://afflatus.ucd.ie The Creative Language System Group
3www.is.sinica.edu.tw/pages/kchen/publications-e.html.
4Zhendong Dong,Qiang Dong.HowNet and the Computation of Meaning[M].Singapore:World Scientific Publishing Company,2006.

共引文献102

1钱小飞.语言数据资源建设中的关键问题及对策[J].语料库语言学,2021,8(2):94-105. 被引量：2
2张瑞霞,肖汉.基于知网的词图构造[J].华北水利水电学院学报,2008(3):53-56. 被引量：6
3陈锐,张蕾,卢春俊,牟力科.基于概念图的信息检索的查询扩展模型[J].计算机应用,2009,29(2):545-548.
4刘磊,曹存根.基于混合特征的上下位关系验证方法[J].计算机工程,2008,34(14):12-13. 被引量：4
5周波,蔡东风.基于条件随机场的中文组织机构名识别研究[J].沈阳航空工业学院学报,2009,26(1):49-52. 被引量：8
6张瑞霞,朱贵良,杨国增.基于知识图的汉语词汇语义相似度计算[J].中文信息学报,2009,23(3):116-120. 被引量：11
7苏晓路,李景,孟宪学,胡海燕,钱平.OWL Full表示的顶级本体到OWL DL的转换研究[J].现代图书情报技术,2009(2):39-45. 被引量：1
8王石,曹存根.WNCT:一种WordNet概念自动翻译方法[J].中文信息学报,2009,23(4):63-70. 被引量：6
9刘兴林.词汇语义知识库浅述[J].福建电脑,2009,25(9):47-48. 被引量：2
10周蓝海,蔡东风.多策略英汉词对齐方法的研究[J].计算机工程与设计,2009,30(17):4138-4140. 被引量：5

1苑金海,刘弘.基于遗传算法和K-medoids算法的产品设计文本知识获取[J].聊城大学学报（自然科学版）,2010,23(4):100-102. 被引量：1
2徐德智,肖文芳,王怀民.本体映射过程中的概念相似度计算[J].计算机工程与应用,2007,43(9):167-169. 被引量：16
3张阳,程亮.一种基于指针逻辑的代码安全属性分析方法[J].计算机学报,2009,32(6):1119-1125. 被引量：3
4余蕾,曹存根.基于Web语料的概念获取系统的研究与实现[J].计算机科学,2007,34(2):161-165. 被引量：6
5赵晓静,张月.PowerPoint文档的属性获取与自动阅卷的实现[J].科技风,2008(4):128-128.
6温春,石昭祥,杨国正.一种利用度属性获取本体概念层次的方法[J].小型微型计算机系统,2010,31(2):322-326. 被引量：5
7耿小玉,钟桂玲.Asp在个人防火墙上的应用[J].克山师专学报,2003,22(3):50-51.
8张希府,戴云徽,高志强.利用句法模式从术语词典中抽取语义关系[J].南京师范大学学报（工程技术版）,2008,8(4):43-45. 被引量：3
9杨宏宇,孙宇超,姜德全.基于SAML和PMI的授权管理模型[J].吉林大学学报（工学版）,2009,39(5):1321-1325. 被引量：3
10曾璇.基于句法模式的评教信息挖掘[J].电脑编程技巧与维护,2016(16):57-58. 被引量：2

计算机技术与发展

2016年第8期

浏览历史

内容加载中请稍等...

基于Web的概念属性获取方法研究

参考文献12

二级参考文献4

共引文献102

相关作者

相关机构

相关主题

浏览历史