摘要
文章回顾了国际上大型科学数据中心知识集成的现状,认为越来越多的科学数据中心认识到数据中心需要集成更多的知识,以提高科学数据的可用性以及满足长时期数据存储的要求。传统的元数据结构在应对知识集成时面临诸多挑战。中国西部环境与生态科学数据中心(西部数据中心)形成了从数据收集、规范化整理、数据集成挖掘到数据服务的完整体系,集成了一批西部环境和生态研究的关键数据,在数据中心层次上集成多元知识作了有益的开拓性的工作。文章还介绍了西部数据中心知识的表达形式,描述了以元数据为核心,数据文档和科学文献为补充的知识组织形式,并介绍了西部数据中心开展的集成数据检索导航、模型数据集和在线模型服务等知识挖掘手段。
This paper reviews current status of knowledge integration made by the large intemational scientific data centers. With decades of development, scientific data centers have increasingly realized the demands of data-relevant knowledge integrated by data centers to enhance usability of data and meet challenges arising from long-term data stew- ardship. As traditional itemized metadata architecture is hard to include more unstructured knowledge, new approach or extension should be developed. Funded by the National Natural Science Foundation of China, Environmental and Ecolog- ical Science Data Center for West China (WestDC) was established to share valuable scientific data to support studies in West China. Besides its primary role on data sharing, WestDC has also carried out a number of tentative work to integrate more knowledge to extend data usability and applications. This paper describes possible forms of representing knowledge in the data center, followed by an introduction to the approach adopted in the WestDC to complement metadata with data documentation and scientific iterature. Activities on data mining and knowledge discovery in the WestDC including com- prehensive data navigation, model data sets, and online model sharing are also outlined.
出处
《中国科技资源导刊》
CSSCI
2010年第5期15-21,36,共8页
China Science & Technology Resources Review
基金
国家自然科学基金委"中国西部环境与生态科学研究计划"重点项目(90502010)
关键词
数据中心
知识集成
知识挖掘
元数据
中国西部环境与生态科学数据中心
data center, knowledge integration, knowledge discovery, metadata, Environmental and Ecological Science Data Center for West China (WestDC)