摘要
数据仓库环境由数据预备域、数据处理域和数据存储域等3个相互独立的组件组成,其中数据预备域主要负责接收和转换来自源应用系统的数据,其输出的数据质量直接决定着整个数据仓库的质量。首先讨论数据仓库的数据预备域和数据仓库的数据质量维度,然后在此基础上讨论从操作源应用系统来的数据可能存在的质量问题,最后针对这些问题,讨论在数据预备域中如何进行处理以得到高质量的数据。
There are three separate components in the data warehouse,namely,data staging region, data processing region and data storage region. During data staging region,it is mainly responsible for reception and transformation of data from the operational source systems. Data quality of the output from the region will directly affect the quality of the whole data warehouse. Firstly,data quality di-mensions would be discussed. According to it,the problems of the data from operational source sys-tems would be discussed. Finally,in order to get high quality data how to deal with the data would be discussed too.
出处
《重庆理工大学学报(自然科学)》
CAS
2014年第10期60-65,共6页
Journal of Chongqing University of Technology:Natural Science
基金
宜昌市科学技术研究与开发项目(A2012-302-19)
关键词
数据仓库构建
数据预备域
数据质量维度
data warehouse construction
staging region
data quality dimensions