一种基于IC参数的知识图谱嵌入方法

Knowledge Graph Embedding Based on IC Parameters

下载PDF

导出

摘要 TransC是一种高效的知识图谱嵌入方法,通过区分概念和实例来建立概念、实例及关系的嵌入。TransC将概念编码为球体,球体半径被随机初始化并在训练中迭代更新。由此导致模型出现两个问题:一是训练得到的部分球体半径与模型训练目标不符;二是忽略了概念本身提供的语义信息。针对上述两个问题,该文提出了TransIC模型,首先,基于IC参数给出新的概念球体半径求解方法,使求得的半径满足TransC目标,并且丰富了概念嵌入向量的语义信息。其次,该模型以TransC为基础,在概念编码阶段引入基于IC参数的概念球体半径。最后,在公开的数据集YAGO39K上完成链接预测和三元组分类两个任务,并将该文方法实验所得性能与TransC及其他模型的性能进行对比。结果表明,TransIC在多数指标上均取得显著提升。 TransC is an efficient method for embedding knowledge graphs.It establishes the embedding of concepts,instances,and relations by distinguishing concepts and instances.TransC encodes the concept as a sphere,and the radius of the sphere is randomly initialized and updated iteratively during training.This leads to two problems in the model.First,part of the sphere radius obtained from training does not match the model training target.Second,the semantic information provided by the concept itself is ignored.This paper proposes a model named TransIC to deal with the two issues above.TransIC adopts a novel concept sphere radius solution method based on IC parameters,so that the obtained radius meets the TransC goal,and enriches the semantic information of the concept embedding vector.Then it is based on TransC and introduces a concept sphere radius based on IC parameters during the concept coding phase.Finally,the two tasks of link prediction and triple classification are completed on the public data set YAGO39 K,and the experimental performance of the method in this paper is compared with the performance of TransC and other models.The results show that TransIC has achieved a significant improvement in most indicators.

作者赵晓函周子力李天宇陈丹华王凯莉 ZHAO Xiaohan;ZHOU Zili;LI Tianyu;CHEN Danhua;WANG Kaili(School of Cyber Science and Security,Qufu Normal University,Qufu,Shandong 273100,China)

机构地区曲阜师范大学网络空间安全学院

出处《中文信息学报》 CSCD 北大核心 2021年第10期48-55,共8页 Journal of Chinese Information Processing

基金国家自然科学基金(61871185) 山东省自然科学基金(ZR2017MD019) 教育部高教司产学合作协同育人项目(201701020098) 赛尔网络下一代互联网技术创新项目(NGII20190516)

关键词知识图谱嵌入 TransC 信息量 knowledge graph embedding TransC information content

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献4

1刘知远,孙茂松,林衍凯,谢若冰.知识表示学习研究进展[J].计算机研究与发展,2016,53(2):247-261. 被引量：259
2方阳,赵翔,谭真,杨世宇,肖卫东.一种改进的基于翻译的知识图谱表示方法[J].计算机研究与发展,2018,55(1):139-150. 被引量：51
3朱艳丽,杨小平,王良,张志宇.TransRD：一种不对等特征的知识图谱嵌入表示模型[J].中文信息学报,2019,33(11):73-82. 被引量：9
4游彬,严岳松,孙英阁,刘靖.基于HowNet的信息量计算语义相似度算法[J].计算机系统应用,2013,22(1):129-133. 被引量：16

二级参考文献101

1刘群李素建.基于《知网》的词汇语义相似度计算[A]..Computational Linguistics and Language Processing[C].,2002.7.2:59-76.
2Pirro G A Semantic Similarity Metric Combining Features and Intrinsic Information Content, Data & Knowledge Engineering 68, 2009:1289-1308.
3Xiao B, Xue LM, Zhao Y. Scrneme Description Based Estimating for Semantic Orientation of Chinese Vocabulary, in Proceedings Of The 2010 Intcrnational Conference on Computer Application and System Modeling (ICCASM2010), 2010: 671-674.
4Li Y, Bandar A, McLean D. An approach for measuring semantic similarity between words using multiple informa- tion sources, IEEE Trans. on Knowledge and Data Engineering, 2003,15(4):871-882.
5Resnik P. Information content to evaluate semantic similarity in a taxonomy. Proceedings of IJCAI, 1995,448-453.
6Resnik E Semantic Similarity in a Taxonomy: An Informa- tion-Based Measure and Its Application to Problems of Ambiguity in Natural Language. Journal of Artificial Intelligence Research, 1999,11:95-130.
7Seco N, Veale T, Hayes J. An intrinsic information content metric for semantic similarity in WordNet. Proe. of ECAI. 2004. 1089-1090.
8Jiang J, Conrath D. Semantic similarity based on corpus statistics and lexical taxonomy. Proe. of the International Conference on Research in Computational Linguistics. 1998.
9Lin D, An information-theoretic definition of similarity, in Proc. of the 15th International Conf. on Machine Learning. Morgan Kaufrnann, San Francisco, CA, 1998. 296-304.
10HowNet. HowNet's Home Page. http://www.keenage.com.

共引文献305

1余传明,李浩男,王曼怡,黄婷婷,安璐.基于深度学习的知识表示研究:网络视角[J].数据分析与知识发现,2020,4(1):63-75.
2张骁雄,杨琴琴,何浩然,丁鲲.面向俄乌冲突的时序知识图谱推理系统设计与实现[J].网络安全与数据治理,2023,42(S01):157-162.
3詹威威,程序,蔡惠民,刘汪洋,王彬,余正涛.基于综合影响力模型的改进EvolveKG方法及应用研究[J].计算机应用研究,2020,37(S01):159-162.
4阿布都克力木·阿布力孜,张雨宁,阿力木江·亚森,郭文强,哈里旦木·阿布都克里木.预训练语言模型的扩展模型研究综述[J].计算机科学,2022,49(S02):43-54. 被引量：11
5王永康,艾山·吾买尔,顾亚东,何江涛.TransREF:一种改进的基于邻域信息的知识表示模型[J].电子测量技术,2023,46(21):7-15.
6郝卫,魏赟.基于知识图谱表示学习的推荐算法优化[J].智能计算机与应用,2020,10(4):22-26. 被引量：3
7甘惟,吴志强,王元楷,徐浩文,严娟,何珍,赵紫辰.AIGC辅助城市设计的理论模型建构[J].城市规划学刊,2023(2):12-18. 被引量：14
8许升健.年薪制的困惑[J].金山企业管理,2000(1):40-41.
9范弘屹,张仰森.一种基于HowNet的词语语义相似度计算方法[J].北京信息科技大学学报（自然科学版）,2014,29(4):42-45. 被引量：12
10李国佳.基于知网的中文词语相似度计算[J].智能计算机与应用,2015,5(3):49-52. 被引量：2

1蒋建纲.在实验中建构科学概念[J].小学科学,2021(12):52-53.
2黄发杰,孟迎芳,邵丹妮.提取干扰对知觉和概念启动的影响[J].心理科学,2020,43(6):1289-1295. 被引量：2
3胡军,许正康,刘立,钟福金.融合多粒度社区信息的网络嵌入方法[J].计算机应用,2022,42(3):663-670.
4徐炜.聚焦新高考指向高阶思维的“混合式教学”实践探索--以“细胞增殖”为例[J].数理化解题研究,2022(3):143-144. 被引量：1
5区恩海.基于组合关系翻译的知识表示学习模型[J].计算机科学与应用,2022,12(3):654-661.
6朱肖磊,吴训成.车辆姿态感知注意力增强的车辆重识别[J].电子测量技术,2021,44(24):91-97. 被引量：2
7王慧,罗梦,李畅,王红娜.层次分析法和机器学习算法的煤矿区绿化环境治理绩效评价[J].自动化与仪器仪表,2022(1):83-85. 被引量：1
8孙玉文.汉语史学科建设问题:总体趋势与分支走向[J].湖北大学学报（哲学社会科学版）,2022,49(1):79-94. 被引量：4
9伍杰华,高学勤,王涛.融合链接预测相似度矩阵的属性网络嵌入算法[J].计算机应用研究,2022,39(4):1080-1085. 被引量：1
10杨茜雯,朱萌.基于ARIMA模型对扬州市PM_(2.5)的分析和预测[J].黑龙江环境通报,2022,35(1):35-37. 被引量：2

中文信息学报

2021年第10期

浏览历史

内容加载中请稍等...

一种基于IC参数的知识图谱嵌入方法

参考文献4

二级参考文献101

共引文献305

相关作者

相关机构

相关主题

浏览历史