摘要
针对在关系代数运算下数据质量传递影响问题,在属性粒度给出了一个数据质量评价模型,定义了正确性评价指标。通过分析量化前后属性错误率对质量评价的不同含义和作用,在数据错误随机分布的假设前提下,证明了两种错误率之间的定量关系;并研究了投影运算对正确性评价指标的质量传递影响,定量地给出了传递关系,分别对用量化前后属性错误率进行了表示。
The paper presented a data quality model at attribute level, and defined the accuracy metric. After analysising the difference of error rate before and after quantification, and with the assumption of random errors occurrence, the quantities relationship between them was presented. The paper also discussed the quality propagation of project operation, and represented the propagation formula with error rate before and after quantization separately.
出处
《计算机应用研究》
CSCD
北大核心
2008年第9期2751-2753,2770,共4页
Application Research of Computers
基金
国家自然科学基金资助项目(60504036)
关键词
数据库
数据质量
质量传递
关系代数
投影运算
正确性
database
data quality
quality propagation
relation algebra
projection
accuracy