摘要
为了对公共资源电子交易平台运行过程中产生的大规模结构化或非结构化的电子数据进行分析和利用,充分挖掘公共资源交易数据的价值,并提升电子交易水平和监管能力,本文设计了一种大数据平台的建设和应用方法,通过建立分层平台架构,集成自动分词、OCR文字识别、音频识别等技术,实现数据自动汇聚的能力;通过建立算法工具箱,实现数据汇聚、数据存储、算法管理、数据驾驶舱等多个子系统,完成公共资源电子交易大数据平台的建设。实践的结果表明,基于本文方法建设的大数据平台在实际的客户业务中部署使用,能够起到良好的应用效果。
The rapid development of public resource electronic trading business has produced large-scale structured or unstructured electronic data,which are stored in different ways and continue to grow rapidly.Due to the requirements of archiving time limit,these data not only occupy a lot of storage resources,but also are difficult to manage.In order to fully tap the value of the trading data,and improve the level of electronic transactions and supervision ability,this paper designs a method for the construction and application of big data platform,establishes a layered platform architecture,integrates technologies such as automatic word segmentation,OCR character recognition and audio recognition,and realizes the ability of automatic data aggregation.This paper standardizes the use of mining and analysis algorithms,and establishes an algorithm toolbox through data aggregation,data storage Algorithm management and other subsystems complete the construction of the big data platform,and establish a lead cockpit subsystem to realize the rapid display of data and analysis results.Finally,this paper expounds the methods and ideas of platform application through typical applications,which are deployed in the actual customer business and play a practical application effect.
作者
黄建新
HUANG Jianxin(Department of Xiamen Hymake Technology Co.,Ltd.Xiamen,China,361008)
出处
《福建电脑》
2022年第9期45-49,共5页
Journal of Fujian Computer
关键词
电子招标投标
数据主题
自动分词
深度学习
Electronic Bidding
Data Subject
Automatic Word Segmentation
Deep Learning