Feature extraction plays an important role in constructing artificial intel-ligence(AI)models of industrial control systems(ICSs).Three challenges in this field are learning effective representation from high-dimensio...Feature extraction plays an important role in constructing artificial intel-ligence(AI)models of industrial control systems(ICSs).Three challenges in this field are learning effective representation from high-dimensional features,data heterogeneity,and data noise due to the diversity of data dimensions,formats and noise of sensors,controllers and actuators.Hence,a novel unsupervised learn-ing autoencoder model is proposed for ICS data in this paper.Although traditional methods only capture the linear correlations of ICS features,our deep industrial representation learning model(DIRL)based on a convolutional neural network can mine high-order features,thus solving the problem of high-dimensional and heterogeneous ICS data.In addition,an unsupervised denoising autoencoder is introduced for noisy ICS data in DIRL.Training the denoising autoencoder allows the model to better mitigate the sensor noise problem.In this way,the represen-tative features learned by DIRL could help to evaluate the safety state of ICSs more effectively.We tested our model with absolute and relative accuracy experi-ments on two large-scale ICS datasets.Compared with other popular methods,DIRL showed advantages in four common indicators of AI algorithms:accuracy,precision,recall,and F1-score.This study contributes to the effective analysis of large-scale ICS data,which promotes the stable operation of ICSs.展开更多
基金This study is supported by The National Key Research and Development Program of China:“Key measurement and control equipment with built-in information security functions”(Grant No.2018YFB2004200)Independent Subject of State Key Laboratory of Robotics“Research on security industry network construction technology for 5G communication”(No.2022-Z13).
文摘Feature extraction plays an important role in constructing artificial intel-ligence(AI)models of industrial control systems(ICSs).Three challenges in this field are learning effective representation from high-dimensional features,data heterogeneity,and data noise due to the diversity of data dimensions,formats and noise of sensors,controllers and actuators.Hence,a novel unsupervised learn-ing autoencoder model is proposed for ICS data in this paper.Although traditional methods only capture the linear correlations of ICS features,our deep industrial representation learning model(DIRL)based on a convolutional neural network can mine high-order features,thus solving the problem of high-dimensional and heterogeneous ICS data.In addition,an unsupervised denoising autoencoder is introduced for noisy ICS data in DIRL.Training the denoising autoencoder allows the model to better mitigate the sensor noise problem.In this way,the represen-tative features learned by DIRL could help to evaluate the safety state of ICSs more effectively.We tested our model with absolute and relative accuracy experi-ments on two large-scale ICS datasets.Compared with other popular methods,DIRL showed advantages in four common indicators of AI algorithms:accuracy,precision,recall,and F1-score.This study contributes to the effective analysis of large-scale ICS data,which promotes the stable operation of ICSs.