期刊文献+

基于文本数据挖掘的轨道电路故障智能分类 被引量:2

Intelligent Classification of Track Circuit Fault Based on Text Data Mining
下载PDF
导出
摘要 针对轨道电路不均衡的故障历史文本数据,提出了一种针对非均衡历史文本数据挖掘的轨道电路智能分类模型。选取TF-IDF和先验LDA无监督机器学习模型对历史故障文本数据分别进行词项级和主题级故障特征提取并向量化,将提取的历史数据特征向量串行融合,得到其特征向量空间。采用SMOTE算法自动生成历史文本数据中的少数类数据,避免在机器学习过程中出现欠拟合现象。鉴于单个分类器在机器学习及智能分类的精度不高,采用投票的方式实现基分类器与集成分类器的集成学习。选择广铁集团电务段2011年的故障文本数据进行试验分析,验证该模型在历史故障数据分类的准确率和召回率等方面的优势。 According to the unbalanced fault history text data of track circuit,an intelligent classification model of track circuit for unbalanced history text data mining is presented. TF-IDF and priori LDA unsupervised machine learning model are selected to extract and quantify the fault features at word level and topic level respectively. The feature vectors of the fault history data are fused serially to obtain the feature vector space of the fault history data. SMOTE algorithm is used to generate a few classes of historical text data automatically,the phenomenon of under-fitting in the process of machine learning is avoided. In view of the low accuracy of single classifier in machine learning and intelligent classification,multi-classifier ensemble learning is implemented by voting. The fault text data of Guangzhou Railway Group Telecom Depot in 2011 are selected for experimental analysis to verify the advantages of the model in the accuracy and recall rate of historical fault data classification.
作者 刘伯鸿 孙浩洋 LIU Bo-hong;SUN Hao-yang(College of Automatic&Elctrical Engineering,Lanzhou Jiaotong University,lanzhou 730070,China)
出处 《测控技术》 2020年第10期32-36,92,共6页 Measurement & Control Technology
基金 国家自然科学基金地区科学基金项目(61661027)。
关键词 轨道电路 数据挖掘 串行融合 SMOTE 分类器 集成学习 track circuit data mining serial fusion SMOTE classifier integrated learning
  • 相关文献

参考文献3

二级参考文献21

  • 1徐薇,黄厚宽,秦勇.时空本体研究及在地理信息系统中的应用[J].铁道学报,2005,27(4):119-124. 被引量:11
  • 2郑丽英,王海涌,刘丽艳.基于粗糙集和模糊聚类理论的文本分类系统的研究与实现[J].铁道学报,2007,29(1):45-49. 被引量:11
  • 3RUBEN S’ ALBERTO G’ CARLOS G. An OntologyDriven Decision Support System for High-performance andCost-optimized Design of Complex Railway Portal Frames[J]. Expert Systems with Applications, 2012, 39 (10):8784-8792.
  • 4European Railway Open Maintenance System[EB/OL]. ht-tp://cordis. europa eu/data/PROJFP5/ACTIONeqDndSES-SIONeqll2422005919ndDC)Ceq902ndTBLeqEN_PROJ. htm.Completed 4/1/2002.
  • 5HOFMANN T. Probabilistic Latent Semantic IndexingProceedings of the 22nd Annual International SIGIR Con-ference[M]. New York: ACM Press, 1999: 50-57.
  • 6COOPER G F,HERSKOVITS E. A Bayesian Method forthe Induction of Probabilistic Networks from Data [J].Machine Learning, 1992,9(4) : 309-347.
  • 7GIUDICI P* ROBERT C. Improving Markov Chain MonteCarlo Model Search for Data Mining[J]. Machine Learn-ing, 2003, 50(1-2); 127-158.
  • 8KEVIN M. Bayes Net Toolbox for Matlab[EB/OL]. ht-tp://www. cs. ubc. ca/.murphyk/Software/BNT/usage,html # file.
  • 9翟婉明.车辆-轨道耦合动力学[M].4版.北京:科学出版社,2015.
  • 10张鹏.基于主成分分析的综合评价研究[D].南京:南京理工大学,2005:24-26.

共引文献55

同被引文献12

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部