期刊文献+

领域本体概念实例、属性和属性值的抽取及关系预测 被引量:31

Extraction and relation prediction of domain ontology concept instance, attribute and attribute value
下载PDF
导出
摘要 研究了如何使用协作分类器(协作使用条件随机场(CRFs)和支持向量机(SVM))解决领域概念实例、属性及属性值的抽取以及它们三者之间对应关系预测的问题.首先将概念实例、属性及属性值看作三类实体,把概念实例、属性及属性值的抽取问题转化为命名实体识别问题,利用条件随机场建模进行命名实体识别;在此基础上定义实体间对应关系,对概念实例、属性及属性值三者的对应关系做预测,把概念实例、属性与属性值三者之间存在关系的向量标记为1,否则标记为0,利用支持向量机建模进行关系的预测.且以云南旅游景点概念实例、属性及属性值进行六组相关的实验.实验表明,在开放测试中协作分类器精确度达到84.4%、召回率达到82.7%及F值达到为83.6%,相比于词语共现F值提高了20个百分点. This paper studies how to use the Collaboration Classifier (Conditional Random Fields (CRFs) and Support Vector Machine (SVM)) to solve the extraction and relation prediction problem of ontology concept instance, attribute and attribute value. Firstly, taken concept instance, attribute and attribute value as three entities, the problem of extraction these three entities was converted to a named entity recognition problem, CRFs classifier model was adopted to recognize entities; Furthermore, made a definition for the relations between the concept instance, attribute and attribute value and made relations prediction among concept instance, attribute andattribute value after they were identified respectively, if there is a relationship among the concept instance, attribute and attribute value, marked 1, otherwise marked 0, then use SVM classifier model to make predictions on entity corresponding relation. Taking six trials on concept instance, attribute and attribute value on Yunnan tourist attractions for instance, the experiment is done to make that the accuracy rate of Collaborative Classifier achieves 84.4% and recall rate is up to 82.7% and the F score is 83.6% ,compared to Words Co-occurrence model, its F- score increased by 20%.
出处 《南京大学学报(自然科学版)》 CAS CSCD 北大核心 2012年第4期383-389,共7页 Journal of Nanjing University(Natural Science)
基金 国家自然科学基金(60863011) 云南省自然科学基金(2008CC023) 云南省中青年学术技术带头人后备人才项目(2007PY01-11) 云南省教育厅基金(07Z11139)
关键词 领域本体 概念实例抽取 属性抽取 属性值抽取 条件随机场 支持向量机 domain ontology, concept instance extraction, attribute extraction, attribute values extraction,conditional random fields, support vector machine.
  • 相关文献

参考文献16

  • 1Eric T, Wang W M. A cgncept-relationship ac- quisition and inference approach for hierarchical taxonomy construction from tags. Information Processing and Management: An International Journal, 2010, 46(1):44-57.
  • 2Sanchez D. A methodology to learn ontological attributes from the Web. Data and Knowledge Engineering, 2010, 6(69):57-597.
  • 3Poesio M, Almuhareb A. Identifying concept attributes using a classifier. Proceedings of the ACL-SIGLEX Workshop on Deep Lexical Ac- quisition, Ann Arbor, 2005,18-27.
  • 4Yoshinaga N, Torisawa K. Open-domain at- tribute-value acquisition from semi-Structured texts. Proceedings of the OntoLex 2007,Busan, South-Korea, 2007, 55-66.
  • 5Ravi S, Pasca M. Using structured text for large-scale attribute extraction. Proceedings of the 17^th International Conference on Information and Knowledge Management. Napa Valley, California, USA, 2008, 1183-1192.
  • 6康为,穗志方.基于Web弱指导的本体概念实例及属性的同步提取[J].中文信息学报,2010,24(1):54-59. 被引量:4
  • 7叶正,林鸿飞,苏绥,刘菁菁.基于支持向量机的人物属性抽取[J].计算机研究与发展,2007,44(z2):271-275. 被引量:11
  • 8郭剑毅,薛征山,余正涛,张志坤,张宜浩,姚贤明.基于层叠条件随机场的旅游领域命名实体识别[J].中文信息学报,2009,23(5):47-52. 被引量:36
  • 9Darroch J,I.auritzen S,Speed T. Markov fields and log-linear interaction models for contingency tables. Annals of Statistics, 1980, 8 ( 3 ): 522-539.
  • 10Della P S, Della P V, Lafferty J. Inducting fea- tures of random fields. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997, 19(4):380-393.

二级参考文献74

共引文献90

同被引文献326

引证文献31

二级引证文献1046

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部