摘要
煤矿作为高危行业,企业违章行为记录繁杂。为高效、准确、智能地检索和管理企业违章记录信息,减少违章行为发生,本文以某矿近3年的13935条违章行为数据库为样本,将违章行为分为3大类23小类,基于计算机文本分类技术,通过Jieba分词器文本预处理、向量空间模型构建、TF-IDF模型特征值选取、相似度计算等流程搭建了违章文本数据分类器,在Python环境下构建了可视化展示平台并进行分类统计。结果表明:违章操作在总违章行为中占比最高,达到64%,其次为违章行动和违章指挥。同时对各违章子类进行了高、中、低频类别划分,为预防事故发生提供重要数据支撑。
As a high-risk industry,coal mining enterprises have a complex record of violations.In order to efficiently,accurately and intelligently retrieve and manage an enterprise s illegal record and reduce the occurrence of illegal behaviors.A database of 13,935 violations in a mine in recent three years is taken as a sample.The illegal actions are divided into 3 categories and 23 subcategories.And based on the computer text classification technology,the illegal text data classifier is built.Its process includes text preprocessing of Jieba word segmentation,vector space model construction,feature value selection of TF-IDF model,and similarity calculation process.Finally,a visual classification statistics and presentation system was constructed in Python environment,and the classified statistics were carried out.The results showed that the proportion of illegal operation is 64%,which is the highest among all illegal behavior,followed by illegal action,and illegal command accounted for the smallest proportion.At the same time,the key subcategories of high frequency,medium frequency and low frequency were analyzed to provide quantitative support for accident prevention.
作者
栗婧
张志珍
杜璇
王真
刘紫薇
辛艳丽
Li Jing;Zhang Zhizhen;Du Xuan;Wang Zhen;Liu Ziwei;Xin Yanli(School of Emergency Management and Safety Engineering,China University of Mining and Technology-Beijing,Beijing 100083,China;Department of Intelligence and Reconnaissance,Special Police College of CAPF,Beijing 100100,China)
出处
《矿业科学学报》
CSCD
2022年第3期344-353,共10页
Journal of Mining Science and Technology
基金
中央高校基本科研业务费专项资金(2021YJSAQ12)。
关键词
文本分类技术
违章行为
安全生产
煤矿企业
text classification technology
violations behaviors
production safety
coal mining enterprises