摘要
数据溯源技术对保证数据密集型科研的再现、验证和重用具有极为重要的作用。本文结合科学数据管理的特点和需求,在现有溯源技术的基础上,重点对溯源描述模型及规范进行了研究设计。其中对溯源内容构成模型进行完善,提出了W7+R3模型,并基于此细化设计了溯源内容元数据规范;同时对现有溯源表达模型进行了优化,设计了一种实用轻量级的溯源表达模型。文章最后对科研过程中溯源管理的发展提出了建设性的建议和思考。希望本文的研究对于规范科学数据溯源管理具有一定的基础性参考和指导价值。
Data provenance technology plays an important role in ensuring the reappearance, verification and reuseof data-intensive research. Based on the existing data provenance technology, this thesis combined the characteristics and requirements of scientific data management, focuses on the study and design of data provenance description model and specification. Firstly, the content of data provenance model is improved, the W7+R3 model is proposed, and the metadata standard of data provenance is designed and detailed; Secondly, the existing data provenance expression model is optimized, and a practical lightweight data provenance model is designed. Finally, this thesis came up with some constructive suggestions and thoughts in both the scientific management and development. It is hoped that the research of this thesis will have some basic reference and guiding value for standardizing scientific data provenance management.
作者
王逢阳
徐全军
刘峰
周园春
Wang Fengyang Xu Quanjun Liu Feng Zhou Yuanchun(Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China University of Chinese Academy of Sciences, Beijing 100049, China Marine Environment Special Office of the Chinese People's Liberation Army, Beijing 100081, China)
出处
《科研信息化技术与应用》
2017年第1期27-34,共8页
E-science Technology & Application
基金
国家重点研发计划项目(2016YFB0501900
2016YFB1000600)的研究成果之一
关键词
数据管理
数据溯源
溯源模型
data management
data provenance
provenance model