摘要
为解决目前大部分的煤矿事故案例以非结构化文本的形式存储,不利于事故案例中关键信息的组织、共享和再利用的问题,文章基于发布在互联网上的大量煤矿事故案例,将分片思想引入文本挖掘技术,实现事故案例的关键信息抽取,结合框架表示法提出了一种层次化的数据存储结构,实现对煤矿事故案例多层次多类型数据的处理、存储和挖掘,并对事故发生的空间、时间以及原因等数据进行分析和可视化展示。
To solve the problem that most of the current coal mine accident cases are stored in the form of unstructured text,which is not conducive to the organization,sharing,and reuse of key information in the accident cases,this paper introduces the idea of slicing into text mining technology based on a large number of coal mine accident cases posted on the Internet to realize the extraction of key information from accident cases and proposes a hierarchical data storage structure combined with frame representation to realize the processing,storage,and mining of multi-level and multi-type data of coal mine accident cases,as well as the analysis and visualization of data such as the space,time and causes of accidents.
作者
赵中昊
冯彬浩
曾成
杨梦
ZHAO Zhong-hao;FENG Bin-hao;ZENG Cheng;YANG Meng(China University of Mining and Technology-Beijing School of Mechanical Electronic and Information Engineering,Beijing 100083,China)
出处
《电脑与信息技术》
2024年第3期63-67,共5页
Computer and Information Technology
基金
国家级大学生创新创业训练计划规划项目(项目编号:C202204056)。
关键词
关键信息抽取
框架表示法
可视化
文本挖掘
key information extraction
frame representation
visualization
text mining