期刊文献+

基于最大方差权信息系数的煤气数据填补 被引量:5

Missing data imputation based on maximal variance weight information coefficient for gas flow in steel industry
下载PDF
导出
摘要 在数据的挖掘、建模与优化领域,数据的完整性与准确性是进行此类研究的基础.鉴于冶金能源系统的复杂性和现场数据采集过程易受干扰的特点,其数据在获取过程中极易发生数据缺失的现象,从而造成模型无法建立,隐含信息无法准确挖掘等情况.本文针对钢铁企业副产煤气的发生、消耗流量数据出现的缺失情况,通过分析相似工况下能源流量数据的相关特性,提出一种基于最大方差权信息系数的冶金企业副产煤气系统流量数据填补方法.该方法针对现场经常发生的两类数据缺失情况,即数据点间断缺失和数据长时间连续缺失,以最大方差权信息系数作为样本筛选准则,并采用基于核学习的方法对缺失数据进行填补.为验证本文提出的数据填补方法的有效性,本文对上海宝钢高炉、焦炉和冷热轧用户的实际生产数据的运行试验,结果表明该方法相比其他的方法存填补精度上有很大优势. In data-driven-based modeling and optimization, the completeness and the accuracy of data are the foundations for further research tasks. Since the energy system of steel industry is rather complicate and its data-acquisition pro- cess might be frequently affected by the malfunctions of data transportation, storage and transformation, the data-missing phenomenon usually occurs, which might lead to the failure of model building or accurate information discovery. In this study, by analyzing the correlation of the energy data with respect to corresponding operation conditions in manufacturing, a data imputation method for the missing data in the byproduct gas flow is proposed. In this method, the proposed maximal variance weight information coefficient (MVWIC) is adopted as the sample selection criteria to realize the data imputation by using the kernel-learning-based method. To validate the proposed method, two types of missing modes that frequently occurred in steel industry are considered here, i.e., the intermittent missing and the long-term continuous missing. A series of experiments using the practical energy data indicates that the proposed method exhibits good performance on the imputation accuracy when compared to other methods.
出处 《控制理论与应用》 EI CAS CSCD 北大核心 2015年第5期646-654,共9页 Control Theory & Applications
基金 国家"863"计划项目(2013AA040703) 国家自然科学基金项目(61034003 61304213 61104157 61273037 61473056) 中央高校基本科研业务费专项资金项目(DUT13RC203)资助~~
关键词 冶金能源系统 数据填补 样本筛选 最大方差权信息系数 energy system of steel industry data imputation sample selection maximal variance weight information coefficient
  • 相关文献

参考文献27

  • 1LIU Y, LIU Q, WANG W, et al. Data-driven based model for flow prediction of steam system in steel industry [J]. Information Sciences, 2012, 193: 104- 114.
  • 2ZHAO J, WANG W, LIU Y, et al. A two-stage online prediction method for a blast furnace gas system and its application [J]. IEEE Transactions on Control Systems Technology, 2011, 19(3): 507 - 520.
  • 3ZHAO J, LIU Y, ZHANG X, et al. A MKL based on-line prediction for gasholder level in steel industry [J]. Control Engineering Practice, 2012, 20(6): 629 - 641.
  • 4LIU Y, ZHAO J, LV Z, et al. Study on balance scheduling based on the gasholder level prediction for blast furnace gas system in steel industry [J]. Applied Mechanics and Materials, 2012, 128:1464 - 1467.
  • 5LIU Y, ZHAO J, WANG W. A time series based prediction method for a coke oven gas system in steel industry [J]. ICIC Express Letters, 2010, 4(4): 1373 - 1378.
  • 6ZHANG X, ZHAO J, WANG W. A improved multiple kernel learn- ing based least square support vector regression and its application in the on-line prediction of gasholder level in iron and steel [J]. ICIC Express Letter, 2010, 4(5(B)): 1767 - 1772.
  • 7DONALD B R. Inference and missing data [J]. Biometrika, 1976, 63(3): 581 - 592.
  • 8RODERICK J A. Regression with missing x's: a review [J]. Journal of the American Statistical Association, 1992, 87(420): 1227 - 1237.
  • 9KURODA M, SAKAKIHARA M. Accelerating the convergence of the EM algorithm using the vector algorithm [J]. Comutational Statis- tics and Data Analysis, 2006, 51 (3): 1549 - 1561.
  • 10GELMAN A, HILL J. Data Analysis Using Regression and Mul- tilevel/Hierarchical Models [M]. London: Cambridge University Press, 2007.

同被引文献42

引证文献5

二级引证文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部