期刊文献+

面向材料实验数据的本体生成与实例匹配方法(英文)

Method of Generating Ontologies and Instance Matching for Material Experiment Data
下载PDF
导出
摘要 随着信息技术的进步,在线数据共享等应用成为研究焦点.现有材料实验数据存储结构为复杂表,难以直接转换为二维表;数据的结构、存储方式多样;难以数据分享.为解决材料领域异构数据间共享,本文提出以基于规则的本体生成方案,实现由复杂表生成本体的过程.从复杂表生成本体速度比从复杂表解析入库快五倍.为实现数据共享,本文提出利用本体实例匹配寻找相似信息.常用匹配工具对材料实验本体的实例匹配结果不佳.本文分析其原因并针对材料领域数据源当前情况,提出基于TF-IDF算法的两种改进匹配方案,改善了在缺乏领域知识和词典下的匹配结果.为整个材料数据生态环境的建设探索出一条实现路线.其与现有常用实例匹配工具相比在材料实验数据的实验结果更适合. With the development of information technology,applications such as online data sharing have become increasingly popular.The multiform data types in material experimental data sets cause information problems,increasing the challenges of discovering relationships among sources.To solve this data sharing problem,a rulebased automatic algorithm that transforms various complex tables in the materials research field to ontology information is proposed in this paper.Furthermore,an instance-matching method based on TF-IDF algorithm and its two improving schemes are also proposed.The experimental results indicate that the existing ontology matching tools work well with the ontology results,which are generated approximately five times faster than the approach of generating databases from complex tables.But the common tools work not well in instance matching.This paper analyzes the reason and proposes an improved matching scheme based on TF-IDF algorithm to the current situation of the data source in the material field,which lacks of domain knowledges and dictionary.The method explores an implementation route for the construction of the entire material data ecological environment.The experiment result of the method is more feasible than the common tools in this situation.
作者 马致远 曹旻 MA Zhiyuan;CAO Min;School of Computer Engineering and Science;Shanghai University;
出处 《复旦学报(自然科学版)》 CAS CSCD 北大核心 2018年第5期565-579,共15页 Journal of Fudan University:Natural Science
基金 Project supported by the Shanghai Municipal Science and Technology Commission(15DZ2260300)
关键词 本体 实例匹配 TF-IDF算法 树型结构 复杂表结构 ontology instance matching TF-IDF algorithms tree structures table structure
  • 相关文献

参考文献1

二级参考文献26

  • 1Shvaiko P, Euzenat J. Ontology matching: State of the art and future challenges. IEEE Trans. Knowl. Data Eng., 2013, 25(1): 158-176.
  • 2Ferrara A, Nikolov A, Noessner J et al. Evaluation of instance matching tools: The experience of OAEI. Web Smantics: Science, Services and Agents on the World Wide Web, 2013, 21: 49-60.
  • 3Bellahsene Z, Bonifati A, Rahm E. Schema Matching and Mapping. Springer-Verlag Berlin, Heidelberg, 2011. Huber J, Sztyler T, Noessner Jet al. CODI: Combinato- rial optimization for data integration Results for OAEI 2011.
  • 4In Proc. the 6th International Workshop on Ontology Matching, Oct. 2011, pp.134-141.
  • 5Volz J, Bizer C, Gaedke M, Kobilarov G. Discovering and maintaining links on the web data. In Proc. the 8th Inter- national Semantic Web Conference, Oct. 2009, pp.650-665.
  • 6Suchanek F M, Abiteboul S, Senellart P. PARIS: Probabilis- tic alignment of relations, instances, and schema. PVLDB, 2011, 5(3): 157-168.
  • 7Lacoste-Julien S, Palla K, Davies A, Kasneci G, Graepel T, Ghahramani Z. SIGMa: Simple greedy matching for aligning large knowledge bases. In Proc. the 19th ACM SIGKDD International Conference on Knowledge Discov- ery and Data Mining, Aug. 2013, pp.572-580.
  • 8Li 3, Tang J, Li Y, Luo Q. RiMOM: A dynamic multistrat- egy ontology alignment framework. IEEE Trans. Knowl. Data Eng., 2009, 21(8): 1218-1232.
  • 9B6hm C, de Melo G, Naumann F, Weikum G. LINDA: Distributed web-of-data-scale entity matching. In Proc. the 21st CIKM, Oct.29-Nov.2, 2012, pp.2104-2108.
  • 10Diallo G, Ba M. Effective method for large scale ontology matching. In Proc. the 5th SWAT4LS, Nov. 2012.

共引文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部