摘要
"中美合作百万册数字图书馆计划"(简称CADAL)已建成了含有100万册电子图书、数据信息量达到150Tbytes的数字图书馆,具有很高的科学参考价值。本文围绕数字图书馆的海量信息管理这一课题,逐步探讨了电子图书数据查重、电子图书精确管理和数字图书馆系统容灾快速恢复等急待解决的研究课题,根据项目引用的数字图书馆标准,对CADAL电子书查重、CADAL图书元数据入数据库、CADAL数字图书馆数据库设计、CADAL数字图书馆的备份等系统需求进行了软件实现。研究成果直接应用在CADAL项目中,有力地支持了CADAL生产和发布系统的平稳运行,保障了CADAL图书馆提供内容丰富的知识服务。
Scientists in China cooperated with American partners built a system which is called "China and the United States cooperating million volume digital library plan" (CADAL for short). Till now, it has already contained 1000 thousands volumes, total storage of CADAL is 150 TBytes. This paper presents a system to support massive digital resource management for CADAL. The system includes a series of softwares such as duplication elimination of E-book,metadata to database,database designing and backup of E-book, the R & D results have already been applied to the CADAL project. By means of the system,CADAL not only achieves Accurate management and fast recovery of massive data, but also supports web-publishing system and knowledge-based services effectively.
出处
《计算机工程与科学》
CSCD
北大核心
2010年第4期146-150,共5页
Computer Engineering & Science
基金
科学技术部的国际合作计划<中美百万册数字图书馆支撑软件平台>资助项目(2003AA119010)
"211工程"项目<CADAL北方技术中心支撑环境建设>课题资助项目
关键词
数字图书馆
数字资源管理
数据备份
查重
数据库设计
digital library
digital resource management
backup
duplication elimination
database designing