面向材料实验数据的本体生成与实例匹配方法(英文)

Method of Generating Ontologies and Instance Matching for Material Experiment Data

下载PDF

导出

摘要随着信息技术的进步,在线数据共享等应用成为研究焦点.现有材料实验数据存储结构为复杂表,难以直接转换为二维表;数据的结构、存储方式多样;难以数据分享.为解决材料领域异构数据间共享,本文提出以基于规则的本体生成方案,实现由复杂表生成本体的过程.从复杂表生成本体速度比从复杂表解析入库快五倍.为实现数据共享,本文提出利用本体实例匹配寻找相似信息.常用匹配工具对材料实验本体的实例匹配结果不佳.本文分析其原因并针对材料领域数据源当前情况,提出基于TF-IDF算法的两种改进匹配方案,改善了在缺乏领域知识和词典下的匹配结果.为整个材料数据生态环境的建设探索出一条实现路线.其与现有常用实例匹配工具相比在材料实验数据的实验结果更适合. With the development of information technology,applications such as online data sharing have become increasingly popular.The multiform data types in material experimental data sets cause information problems,increasing the challenges of discovering relationships among sources.To solve this data sharing problem,a rulebased automatic algorithm that transforms various complex tables in the materials research field to ontology information is proposed in this paper.Furthermore,an instance-matching method based on TF-IDF algorithm and its two improving schemes are also proposed.The experimental results indicate that the existing ontology matching tools work well with the ontology results,which are generated approximately five times faster than the approach of generating databases from complex tables.But the common tools work not well in instance matching.This paper analyzes the reason and proposes an improved matching scheme based on TF-IDF algorithm to the current situation of the data source in the material field,which lacks of domain knowledges and dictionary.The method explores an implementation route for the construction of the entire material data ecological environment.The experiment result of the method is more feasible than the common tools in this situation.

作者马致远曹旻 MA Zhiyuan;CAO Min;School of Computer Engineering and Science;Shanghai University;

机构地区上海大学计算机工程与科学学院

出处《复旦学报（自然科学版）》 CAS CSCD 北大核心 2018年第5期565-579,共15页 Journal of Fudan University：Natural Science

基金 Project supported by the Shanghai Municipal Science and Technology Commission(15DZ2260300)

关键词本体实例匹配 TF-IDF算法树型结构复杂表结构 ontology instance matching TF-IDF algorithms tree structures table structure

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1Chao Shao,Lin-Mei Hu,Juan-Zi Li,Zhi-Chun Wang,Tonglee Chung,Jun-Bo Xia.RiMOM-IM： A Novel Iterative Framework for Instance Matching[J].Journal of Computer Science & Technology,2016,31(1):185-197. 被引量：5

二级参考文献26

1Shvaiko P, Euzenat J. Ontology matching: State of the art and future challenges. IEEE Trans. Knowl. Data Eng., 2013, 25(1): 158-176.
2Ferrara A, Nikolov A, Noessner J et al. Evaluation of instance matching tools: The experience of OAEI. Web Smantics: Science, Services and Agents on the World Wide Web, 2013, 21: 49-60.
3Bellahsene Z, Bonifati A, Rahm E. Schema Matching and Mapping. Springer-Verlag Berlin, Heidelberg, 2011. Huber J, Sztyler T, Noessner Jet al. CODI: Combinato- rial optimization for data integration Results for OAEI 2011.
4In Proc. the 6th International Workshop on Ontology Matching, Oct. 2011, pp.134-141.
5Volz J, Bizer C, Gaedke M, Kobilarov G. Discovering and maintaining links on the web data. In Proc. the 8th Inter- national Semantic Web Conference, Oct. 2009, pp.650-665.
6Suchanek F M, Abiteboul S, Senellart P. PARIS: Probabilis- tic alignment of relations, instances, and schema. PVLDB, 2011, 5(3): 157-168.
7Lacoste-Julien S, Palla K, Davies A, Kasneci G, Graepel T, Ghahramani Z. SIGMa: Simple greedy matching for aligning large knowledge bases. In Proc. the 19th ACM SIGKDD International Conference on Knowledge Discov- ery and Data Mining, Aug. 2013, pp.572-580.
8Li 3, Tang J, Li Y, Luo Q. RiMOM: A dynamic multistrat- egy ontology alignment framework. IEEE Trans. Knowl. Data Eng., 2009, 21(8): 1218-1232.
9B6hm C, de Melo G, Naumann F, Weikum G. LINDA: Distributed web-of-data-scale entity matching. In Proc. the 21st CIKM, Oct.29-Nov.2, 2012, pp.2104-2108.
10Diallo G, Ba M. Effective method for large scale ontology matching. In Proc. the 5th SWAT4LS, Nov. 2012.

共引文献4

1漆桂林,高桓,吴天星.知识图谱研究进展[J].情报工程,2017,3(1):4-25. 被引量：226
2叶霞,许飞翔,曹军博,王馨.基于主成分分析和K-Modes蚁群聚类的本体映射方法[J].计算机应用与软件,2020,37(12):231-237. 被引量：8
3Zeynep Banu Ozger,Nurgul Yuzbasioglu Uslu.An Effective Discrete Artificial Bee Colony Based SPARQL Query Path Optimization by Reordering Triples[J].Journal of Computer Science & Technology,2021,36(2):445-462.
4Anitha Velu,Menakadevi Thangavelu.Ontology Based Ocean Knowledge Representation for Semantic Information Retrieval[J].Computers, Materials & Continua,2022(3):4707-4724. 被引量：1

1孙煜飞,马良荔,解嘉宇.基于遗传规划和主动学习的本体实例匹配[J].计算机应用研究,2018,35(5):1380-1385. 被引量：1
2本刊讯.中国科学院计算技术研究所发布深度文本匹配开源工具MatchZoo[J].数据分析与知识发现,2018,2(2):10-10.
3曾兵彬.用道通MX808IM为奥迪A6车增加钥匙的方法[J].汽车维护与修理,2017,0(9):81-81.
4刘家成,王艺憬,孙燕红.基于TF-IDF算法和K-means聚类的商品评论与价格波动相关性研究——以ThinkPad电脑为例[J].科技创业月刊,2018,31(7):45-49. 被引量：2
5王蕾,魏耕耘.对外汉语阅读教材话题的聚类研究[J].湖北第二师范学院学报,2018,35(9):86-90.
6张晶华,曹建梅,刘晓曦,乔磊,苏丹,杨祥来,陈伟杰,温斌.电力云存储系统技术架构分析与优化方案研究应用[J].国网技术学院学报,2018,21(5):3-6.
7田大芳,张瑞丽,魏瑞斌.基于关键词的期刊发文的相似性测度研究[J].现代情报,2018,38(11):105-108. 被引量：8
8陈希,王娟.智能平台下考虑主体心理行为的医疗服务供需匹配方法[J].运筹与管理,2018,27(10):125-132. 被引量：10
9向霞.试析装饰装修工程造价管理与成本控制[J].名城绘,2018,0(10):0368-0368.
10曾兵彬.用道通MX808IM执行2013款宝马X5钥匙匹配[J].汽车维修与保养,2017,0(10):82-82.

复旦学报（自然科学版）

2018年第5期

浏览历史

内容加载中请稍等...

面向材料实验数据的本体生成与实例匹配方法(英文)

参考文献1

二级参考文献26

共引文献4

相关作者

相关机构

相关主题

浏览历史