MEIM:A Multi-Source Software Knowledge Entity Extraction Integration Model 被引量：1

下载PDF

导出

摘要 Entity recognition and extraction are the foundations of knowledge graph construction.Entity data in the field of software engineering come from different platforms and communities,and have different formats.This paper divides multi-source software knowledge entities into unstructured data,semi-structured data and code data.For these different types of data,Bi-directional Long Short-Term Memory(Bi-LSTM)with Conditional Random Field(CRF),template matching,and abstract syntax tree are used and integrated into a multi-source software knowledge entity extraction integration model(MEIM)to extract software entities.The model can be updated continuously based on user’s feedbacks to improve the accuracy.To deal with the shortage of entity annotation datasets,keyword extraction methods based on Term Frequency–Inverse Document Frequency(TF-IDF),TextRank,and K-Means are applied to annotate tasks.The proposed MEIM model is applied to the Spring Boot framework,which demonstrates good adaptability.The extracted entities are used to construct a knowledge graph,which is applied to association retrieval and association visualization.

作者 Wuqian Lv Zhifang Liao Shengzong Liu Yan Zhang

机构地区 School of Computer Science and Engineering School of Information Technology and Management School of Computing

出处《Computers, Materials & Continua》 SCIE EI 2021年第1期1027-1042,共16页 计算机、材料和连续体（英文）

基金 Zhifang Liao:Ministry of Science and Technology:Key Research and Development Project(2018YFB003800),Hunan Provincial Key Laboratory of Finance&Economics Big Data Scienceand Technology(Hunan University of Finance and Economics)2017TP1025,HNNSF 2018JJ2535 Shengzong Liu:NSF61802120.

关键词 Entity extraction software knowledge graph software data

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

参考文献2

1Bohan Niu,Yongfeng Huang.An Improved Method for Web Text Affective Cognition Computing Based on Knowledge Graph[J].Computers, Materials & Continua,2019(4):1-14. 被引量：1
2Ze-Qi Lin,Bing Xie,Yan-Zhen Zou,Jun-Feng Zhao,Xuan-Dong Li,Jun Wei,Hai-Long Sun,Gang Yin.Intelligent Development Environment and Software Knowledge Graph[J].Journal of Computer Science & Technology,2017,32(2):242-249. 被引量：11

共引文献10

1Yingkui CAO,Yanzhen ZOU,Yuxiang LUO,Bing XIE,Junfeng ZHAO.Toward accurate link between code and software documentation[J].Science China(Information Sciences),2018,61(5):68-82. 被引量：1
2丁君怡,赵青松,夏博远,邹志刚.基于开源数据的武器装备知识图谱构建方法研究[J].指挥控制与仿真,2018,40(2):22-26. 被引量：20
3王飞,刘井平,刘斌,钱铁云,肖仰华,彭智勇.代码知识图谱构建及智能化软件开发方法研究[J].软件学报,2020,31(1):47-66. 被引量：26
4韩鑫鑫,贲可荣,张献.军用软件测试领域的命名实体识别技术研究[J].计算机科学与探索,2020,14(5):740-748. 被引量：7
5张善文,王振,王祖良.结合知识图谱与双向长短时记忆网络的小麦条锈病预测[J].农业工程学报,2020,36(12):172-178. 被引量：26
6Zhongjie Wang,Hujie Huang,Xiaofei Xu.A Knowledge Graph based Software Engineering Curriculum Design Method[J].计算机教育,2020(12):134-143.
7于合龙,沈金梦,毕春光,梁婕,陈慧灵.基于知识图谱的水稻病害关联特征挖掘方法[J].吉林农业大学学报,2021,43(2):181-188. 被引量：3
8郭军军,王乐,王正源,姚大春,王长元.软件安全漏洞知识图谱构建方法[J].计算机工程与设计,2022,43(8):2137-2145. 被引量：6
9刘昕炜,陶传奇.一种静态分析与知识图谱结合的Java冗余代码检测方法[J].计算机科学,2023,50(3):65-71. 被引量：1
10马璐,牛珂.目标专业领域知识图谱构建与应用初探[J].现代信息科技,2023,7(23):156-161.

同被引文献29

1邸强,张超,唐元虎.组织知识产生和分享的情境研究[J].情报科学,2005,23(10):1564-1567. 被引量：3
2于腾,王忠群,宋俊杰,陈冬梅.基于知识场景的知识流建模[J].安徽工程科技学院学报（自然科学版）,2007,22(1):51-55. 被引量：1
3罗仕鉴,朱上上,唐云开.知识驱动的产品设计情境[J].浙江大学学报（工学版）,2008,42(11):1849-1855. 被引量：27
4刘峤,李杨,段宏,刘瑶,秦志光.知识图谱构建技术综述[J].计算机研究与发展,2016,53(3):582-600. 被引量：956
5周京艳,刘如,李佳娱,吴晨生.情报事理图谱的概念界定与价值分析[J].情报杂志,2018,37(5):31-36. 被引量：29
6王连成,代桃桃.数据驱动创新场景引领未来[J].山东电力技术,2018,45(10):22-26. 被引量：2
7夏蜀.数字化时代的场景主义[J].文化纵横,2019(5):88-97. 被引量：53
8张楚婷,常亮,王文凯,陈红亮,宾辰忠.基于BiLSTM-CRF的细粒度知识图谱问答[J].计算机工程,2020,46(2):41-47. 被引量：11
9王晰巍,贾若男,韦雅楠,许可.社交网络舆情事件主题图谱构建及可视化研究——以校园突发事件话题为例[J].情报理论与实践,2020,43(3):17-23. 被引量：22
10付洋,刘茂福,乔瑞.心脏病中文知识图谱的构建[J].武汉大学学报（理学版）,2020,66(3):261-267. 被引量：18

引证文献1

1陆泉,陈静宇,陈帅朴,姚苏梅,陈静.场景化知识图谱及构建方法[J].情报科学,2024,42(3):1-9.

1Wenguang Wang,Yonglin Xu,Chunhui Du,Yunwen Chen,Yijie Wang,Hui Wen.Data Set and Evaluation of Automated Construction of Financial Knowledge Graph[J].Data Intelligence,2021,3(3):418-443. 被引量：2
2Pingchuan Ma,Bo Jiang,Zhigang Lu,Ning Li,Zhengwei Jiang.Cybersecurity Named Entity Recognition Using Bidirectional Long Short-Term Memory with Conditional Random Fields[J].Tsinghua Science and Technology,2021,26(3):259-265. 被引量：13
3Shan Zhang,Bin Cao,Yueshen Xu,Jing Fan.Number Entities Recognition in Multiple Rounds of Dialogue Systems[J].Computer Modeling in Engineering & Sciences,2021(4):309-323. 被引量：1
4王红斌,李伊仝,李辉.基于TextRank与BERT预训练模型的新闻评论观点句识别方法[J].计算机科学与应用,2022,12(6):1489-1498.
5杨敏,李宏伟,任怡凤,张聪伟.基于旅客异质性画像的公铁联程出行方案推荐方法[J].清华大学学报（自然科学版）,2022,62(7):1220-1227. 被引量：6
6安海岗,白季晨,刘丽虹,李巧颖,马金龙.基于TF-IDF的网络新闻文本信息提取及复杂网络构建[J].信息与电脑,2022,34(9):34-37.
7Chun Cai,Yuexing Liu,Yanyun Li,Yan Shi,Haidong Zou,Yuqian Bao,Yun Shen,Xin Cui,Chen Fu,Weiping Jia,the SIM Study Group.Effectiveness of quality of care for patients with type 2 diabetes in China:findings from the Shanghai Integration Model(SIM)[J].Frontiers of Medicine,2022,16(1):126-138. 被引量：2
8Feichen Shen,Hongfang Liu,Sunghwan Sohn,David W. Larson,Yugyung Lee.Predicate Oriented Pattern Analysis for Biomedical Knowledge Discovery[J].Intelligent Information Management,2016,8(3):66-85. 被引量：2
9Yu Chen,Bin Chen,Jie Deng,Siwei Xu.The integration model of objective and subjective data of residential indoor environment quality in Northeast China based on structural equation modeling[J].Building Simulation,2022,15(5):741-754.
10李梦翔,尤丽珏.基于深度主动学习的中文电子病历命名实体识别[J].微型电脑应用,2022,38(6):132-134. 被引量：2

Computers, Materials & Continua

2021年第1期

浏览历史

内容加载中请稍等...

MEIM:A Multi-Source Software Knowledge Entity Extraction Integration Model 被引量：1

参考文献2

共引文献10

同被引文献29

引证文献1

相关作者

相关机构

相关主题

浏览历史