摘要
针对生物数据源的分布性、异构性和动态性等特性,探讨生物信息技术服务支撑系统整体解决方案,构建基于基因本体的信息集成模式以实现生物语义学上的数据集成。设计一种以半结构化形式规范生物元数据及基于MD5算法的增量更新技术,用以解决通用扩展性和效率问题,实现生物数据仓库中数据的共享并提高管理效率。
For the characters of distribution,heterogeneity and dynamic of biological data,a resolution of the service system for bioinformatics technology is presented,and an approach of biological data integration based on Gene Ontology(GO)is proposed in order to realize biological semantic integration.Semi-structured incremental updating method to standardize biological metadata with MD5 algorithm to improve the updating efficiency is designed,which resolves the data sharing and the efficiency of data management in biological data warehouse.
出处
《计算机工程》
CAS
CSCD
北大核心
2008年第8期38-40,共3页
Computer Engineering
基金
国家自然科学基金资助项目(60573093)
上海市重大科技项目(02DJ14013)
关键词
基因本体
半结构化
增量更新
MD5算法
Gene Ontology(GO)
semi-structured
incremental update
MD5 algorithm