摘要
在国家电网公司信息化工程的建设过程中,积累了大量的文本数据。如何挖掘文本数据中蕴含的有价值信息将成为电力企业大数据挖掘方向研究的重点对象。文章结合电力行业目前的数据现状,使用文本挖掘的方法对电力设备检修资金投入工作效能场景进行挖掘,对生产信息管理系统中报缺单数据进行文本聚类,实现对缺陷的细分。实践表明,该方法可以得出各类别的缺陷特征,从而证明了文本挖掘在电力行业的可用性。
SGCC has obtained a large amount of text data during the development of information construction process. Mining the valuable information hidden in the text data becomes a hot topic in data mining field for the power enterprises. Considering the current status of data in the electric power industry, we used the text mining methods to mine the investment performance knowledge of electric equipment maintenance. The classification of defects are got by clustering the defect data generated from production information management system. Applications show that the defect characteristics can be obtained and the availability of text mining in the power industry is proved.
出处
《电力信息与通信技术》
2016年第1期7-10,共4页
Electric Power Information and Communication Technology