摘要
Benford分布律是常用的数据质量评估方法。通常,Benford分布律只适用于完整数据集的数据质量评估。对于完整数据集的有界子集,提出修正Benford分布律评估其数据质量,拓宽了Benford分布律的适用范围。随机模拟结果显示,新方法的统计性质比Benford分布律更好,评估结果更合理。
Special Data Dissemination Standards(SDDS) of IMF has been accepted in China, it means that there is a challenging goal for studying and perfect evaluation methods of data quality. Benford's distribution has been widely applied in the data accuracy assessment. In General,Benford?s distribution can only be applied to complete recorded data quality assessment. This paper enhance Benford's distribution with the corrected Benford?s distribution to handle situations where data record is bounded. The simulation studies show that improved statistical property over traditional Benford's distribution in data quality assessment.
出处
《统计与信息论坛》
CSSCI
北大核心
2017年第9期9-16,共8页
Journal of Statistics and Information
基金
国家社会科学基金青年项目<基于三系统估计量的中国普查年人口总数估计研究>(17CTJ002)
国家自然科学基金项目<劣者淘汰两阶段自适应临床试验的设计和分析>(11471239)
天津财经大学研究生科研基金资助项目<人口普查数据质量评估方法研究>(2014TCB02)