期刊文献+

基于文本挖掘的铁路信号设备故障自动分类方法 被引量:8

Automatic classification method of railway signal fault based on text mining
下载PDF
导出
摘要 铁路信号设备在运营维护过程中积累了大量以文本方式记录的维护数据,为了实现高效准确分类,提出将Word2vec、SMOTE算法与卷积神经网络(Convolutional Neural Networks,CNN)相结合的铁路信号设备故障文本自动分类方法.首先,对故障文本使用自然语言方法完成预处理,并采用Word2vec训练词向量;其次,通过SMOTE算法自动生成小类别文本向量数据,嵌入至CNN的输入层;再次,利用CNN的卷积层和池化层提取故障文本的局部上下文高层特征;最后,通过softmax分类器对故障文本自动分类.依据某铁路局所记录的信号设备故障文本数据进行实验分析并与其他方法对比,实验结果表明新方法可使各评价指标得到明显提升,其中分类准确率和召回率分别达到95.26%和94.32%,可以作为铁路信号设备故障自动分类的有效方法. Railway signal equipment has accumulated a large amount of text-recorded maintenance data during the operation and maintenance process.In order to realize efficient and precise classification,an automatic classification method of railway signal equipment fault text combining Word2 vec,SMOTE algorithm and Convolutional Neural Network(CNN)was proposed in this paper.Firstly,the fault text was preprocessed by natural language methods,and Word2 vec was used to train word vector,then text vector data of small category was generated automatically by SMOTE algorithm.Secondly,the generated word vectors were embedded in the input layer of CNN,then convolutional and pooling layer were used to extract high-level features of the local context of the fault text.Finally,softmax classifier was used to complete automatic classification of the fault text data.According to the test analysis of fault text of signal equipment recorded by a railway bureau and comparison with other methods,the test results indicate that this method can obviously upgrade the evaluation indexes,among which classification precision rate and recall rate can reach 95.26%and 94.32%respectively,and it can be used as an effective method for automatic classification of railway signal equipment faults.
作者 林海香 陆人杰 卢冉 许丽 LIN Hai-xiang;LU Ren-jie;LU Ran;XU Li(School of Automation and Electrical Engineering,Lanzhou Jiaotong University,Lanzhou 730070,Gansu,China)
出处 《云南大学学报(自然科学版)》 CAS CSCD 北大核心 2022年第2期281-289,共9页 Journal of Yunnan University(Natural Sciences Edition)
基金 甘肃省高等学校创新基金(2020B-104) 甘肃省优秀研究生“创新之星”项目(2021CXZX-606)。
关键词 铁路信号设备 Word2vec SMOTE算法 卷积神经网络 故障文本数据 自动分类 railway signal equipment Word2vec SMOTE algorithm Convolutional Neural Networks(CNN) fault text data automatic classification
  • 相关文献

参考文献10

二级参考文献61

  • 1J.Alamelu Mangai,V.Santhosh Kumar,S.Appavu alias Balamurugan.A Novel Feature Selection Framework for Automatic Web Page Classification[J].International Journal of Automation and computing,2012,9(4):442-448. 被引量:3
  • 2徐薇,黄厚宽,秦勇.时空本体研究及在地理信息系统中的应用[J].铁道学报,2005,27(4):119-124. 被引量:11
  • 3郑丽英,王海涌,刘丽艳.基于粗糙集和模糊聚类理论的文本分类系统的研究与实现[J].铁道学报,2007,29(1):45-49. 被引量:11
  • 4RUBEN S’ ALBERTO G’ CARLOS G. An OntologyDriven Decision Support System for High-performance andCost-optimized Design of Complex Railway Portal Frames[J]. Expert Systems with Applications, 2012, 39 (10):8784-8792.
  • 5European Railway Open Maintenance System[EB/OL]. ht-tp://cordis. europa eu/data/PROJFP5/ACTIONeqDndSES-SIONeqll2422005919ndDC)Ceq902ndTBLeqEN_PROJ. htm.Completed 4/1/2002.
  • 6HOFMANN T. Probabilistic Latent Semantic IndexingProceedings of the 22nd Annual International SIGIR Con-ference[M]. New York: ACM Press, 1999: 50-57.
  • 7COOPER G F,HERSKOVITS E. A Bayesian Method forthe Induction of Probabilistic Networks from Data [J].Machine Learning, 1992,9(4) : 309-347.
  • 8GIUDICI P* ROBERT C. Improving Markov Chain MonteCarlo Model Search for Data Mining[J]. Machine Learn-ing, 2003, 50(1-2); 127-158.
  • 9KEVIN M. Bayes Net Toolbox for Matlab[EB/OL]. ht-tp://www. cs. ubc. ca/.murphyk/Software/BNT/usage,html # file.
  • 10张启宇,朱玲,张雅萍.中文分词算法研究综述[J].情报探索,2008(11):53-56. 被引量:35

共引文献179

同被引文献103

引证文献8

二级引证文献24

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部