摘要
数据质量问题关系到信息系统建设。首先,概述了数据质量的定义和数据质量问题的分类并总结其来源,介绍了数据质量维度这一数据质量评估指标;然后,说明了不同领域中数据清洗的概念,分析了不同数据质量问题的清洗方法,并归纳了数据清洗有关的框架和工具。最后,对数据清洗相关研究进行了展望。
The data quality issue is essential for the information system construction. Firstly, the definition of the data quality and the taxonomy of the data quality problems are discussed. The o- rigins of the data quality problem are summarized to describe the notion of data quality dimension as a data quality assessment criterion. Then, data cleaning concepts are presented, cleaning methods for various data quality problems are analyzed, and the framework and related tools for data cleaning are summarized. Finally, the prospect of the research of the data cleaning is pointed out.
出处
《指挥信息系统与技术》
2013年第5期63-70,共8页
Command Information System and Technology