摘要
针对新一代信息技术带来的海量数据存储管理和高效利用问题,论文分析了传统的海量数据处理方法,进而提出了采用数据湖的架构理念处理大数据的设计思路,初步分析了数据湖的概念内涵、技术特点、体系结构、处理机制和应用思路,以期对行业大数据的高效共享利用提供建议和启发。
In order to cope with the storage management and efficient utilization of massive data brought about by new generation of information technology,this paper analyzes the traditional mass data processing metholds,then the design idea of using data lake architecture concept to deal with big data is put forward,the concept,technical characterstics,architecture,processing mechanism and application ideas of data lake are analyzed in depth,with a view to providing suggestions and inspiration for the efficient sharing and utilization of industry big data.
作者
陈永南
许桂明
张新建
CHEN Yongnan;XU Guiming;ZHANG Xinjian(No.92403 Troops of PLA,Fuzhou 350007;Nanjing Research Institute of Electronics Engineering,Nanjing 210007)
出处
《计算机与数字工程》
2019年第10期2540-2545,共6页
Computer & Digital Engineering
关键词
数据湖
生态系统
大数据处理
读时模式
data lake
ecological system
large data process
schema-on-read