摘要
针对信息系统中海量数据多源异构和难以共享的问题,提出了多源异构数据虚拟集成框架.数据集成系统中的GAV(Global-As-View)模式映射方法面对信息量分布不均匀的数据源时,查询效率较低,在对GAV改进的基础上,提出了基于HGAV(Hierarchical-Global-As-view)的模式映射算法,通过引入中间数据源模式,形成分层的全局视图,大大缩减了映射空间,简化了映射集合,便于查询的重写和优化.利用宁东智慧环保项目中的五大类数据对本文所提出的算法加以验证,实验结果表明该算法相较于GAV模式映射算法提高了数据集成效率,缩短了查询时间.
In order to solve the problem of heterogeneous data sources and data sharing in information systems,a virtual integration framework of multi-source heterogeneous data is proposed.Since GAV(Global-As-View)pattern mapping method in data integration system is less efficient when faced with uneven distribution of information,GAV method is improved,and the pattern mapping method based on HGAV(Hierarchical-Global-As-View)is proposed.By introducing the intermediate data source pattern,a hierarchical global view is formed,which greatly reduces the mapping space.In this way,the mapping set is simplified,and the query is easier to rewrite and optimize.The proposed algorithm is verified by the five main types of data in the Ningdong Intelligent Environment Protection project.The experimental results show that the pattern mapping algorithm based on HGAV improves the efficiency of data integration and shortens the query time,compared to the GAV schema mapping algorithm.
出处
《计算机系统应用》
2018年第3期27-35,共9页
Computer Systems & Applications
基金
国家自然科学基金(U150120175)