期刊文献+

电力大数据质量评价模型及动态探查技术研究 被引量:22

Research for electric power big data quality evaluation model and dynamic exploration technology
下载PDF
导出
摘要 针对电力数据数量多、种类庞杂、横跨专业多等特点而引起的数据质量不高的现状,该项目以数据中心数据为研究对象,通过分析产生数据质量问题的原因,以元数据、数据质量、数据建模等理论为基础,对数据流动过程中的质量检查、质量控制和质量评估等进行深入的研究。构建电力大数据质量评价指标体系,研究电力大数据质量实时监控、快速动态检测方法与关键技术,提出了适合于电力大数据数据质量控制与评估体系模型,实现数据质量管控平台开发,确保企业数据质量,提升数据应用价值。 For the status quo of bad electric power data quality caused by mass data,complex species,multi-profession in-volvement and so on,quality inspection,quality control and quality assessment in the flow process of the data from the data cen-ter are studied based on metadata,data quality and data modeling theories,and cause analysis of poor data quality. The electric power big data quality evaluation index system was built. The real-time monitoring,rapid motion detection methods and key tech-nologies of the electric power big data quality were researched. The data quality control and assessment system model suitable for the power big data are proposed in thispaper. The development of a data quality control platform was achieved. It ensured the data quality of the enterprise and raised the value of data applications.
作者 陈超
出处 《现代电子技术》 2014年第4期153-155,共3页 Modern Electronics Technique
关键词 电力大数据 元数据 数据质量 实时监控 electric power big data metadata data quality real-time monitoring
  • 相关文献

参考文献6

二级参考文献193

  • 1程国达,苏杭丽.一种检测汉语相似重复记录的有效方法[J].计算机应用,2005,25(6):1362-1365. 被引量:8
  • 2韩京宇,徐立臻,董逸生.一种大数据量的相似记录检测方法[J].计算机研究与发展,2005,42(12):2206-2212. 被引量:32
  • 3朱恒民,王宁生.一种改进的相似重复记录检测方法[J].控制与决策,2006,21(7):805-808. 被引量:12
  • 4张永,迟忠先.位置编码在数据仓库ETL中的应用[J].计算机工程,2007,33(1):50-52. 被引量:12
  • 5HANJia-wei,Micheline Kanber著.数据挖掘概念与技术[M].北京:机械工业出版社,2007
  • 6Ahmed K, Panagiotis G, Vassilos, et al. Duplicate record detection: a survey[J]. IEEE Transactions on Knowledge and Data Engineering, 2007, 19 (1) : 1- 16.
  • 7Anestis Sitas, Sarantos Kapidakis. Duplicate detection algorithms of bibliographic descriptions[J]. Library Hi Tech, 2008, 26(2): 287-301.
  • 8McCallum A, Nigam K, Ungar L H. Efficient clustering of high-dimensional data sets with application to reference matching[C]//Sixth ACM SIGKDD Int'l Conf Knowledge Discovery and Data Mining. New York: ACM Press, 2000: 169-178.
  • 9Chaudhuri S, Ganjam, K, Ganti V, et al. Robust and efficient fuzzy match for online data cleaning [C]// ACM SIGMOD International Conference on Management of Data. New York: ACM, 2003: 313- 324.
  • 10Jaewoo Kang. Toward the scalable integration of internet information sources[D]. Madison: Computer Sciences Department, University of Wisconsin-Madison, 2004.

共引文献2439

同被引文献174

引证文献22

二级引证文献196

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部