期刊文献+

一种基于Kettle的无损增量数据同步方法研究 被引量:7

A Lossless Incremental Data Synchronization Method Based on Kettle
下载PDF
导出
摘要 在学校各系统运行管理过程中,产生了大量宝贵的数据资源,这些数据对学校重要决策制定及传统教育模式改革有着极其重要的作用。因此,如何将不同维度的数据采集到统一的数据中心便成为大数据研究的重点之一。在数据采集过程中,很多学校没有保存重要的历史数据以及已删除记录的状态标记,将对数据分析中诸如时间切片分析、历史状态分析等产生致命影响。以学校人事系统为例,提出一种基于Kettle的无损增量数据同步方法。该方法利用全量数据比对方式,找出新增、修改和删除的数据,并对其进行详细记录,从而实现了对历史数据的完整保留,弥补了如时间切片分析等数据分析策略中数据不足的缺陷。 The operation and management of school system generates a large number of valuable data resources.These data play an extremely important role in making important school decisions and reforming the traditional education model.How to collect these different dimensions of data into the data center has become one of the focuses in big data research.In the process of data acquisition,most schools do not pay attention to the preservation of historical data and the historical trace records of deleted data,which will have a fatal impact on data analysis such as time slice analysis,historical state analysis and so on.Taking the school personnel system as an example,this paper proposes a Kettle-based lossless incremental data synchronization method.This method uses the way of full data comparison to find new,modified and deleted data,and record them in detail.It realizes the complete preservation of historical data and fills in the shortcomings of data analysis strategies such as time slice analysis.
作者 赵亚伟 ZHAO Ya-wei(Information Office,Beijing Language and Culture University,Beijing 100083,China)
出处 《软件导刊》 2019年第10期55-58,共4页 Software Guide
基金 中央高校基本科研业务专项资金项目(18YJ120001)
关键词 数据采集 增量同步 KETTLE 教育大数据 data acquisition incremental synchronization Kettle education of big data
  • 相关文献

参考文献9

二级参考文献37

  • 1林子雨,杨冬青,宋国杰,王腾蛟.实时主动数据仓库中的变化数据捕捉研究综述[J].计算机研究与发展,2007,44(z3):447-451. 被引量:7
  • 2吴远红.ETL执行过程的优化研究[J].计算机科学,2007,34(1):81-83. 被引量:21
  • 3Oracle Data Integrator user's Guide.美国加州红木滩市:O-DI用户指南官方版本,2006:192.
  • 4(美)惠伦(Whalen,E.).基于Linux平台的Oracle Data-base10g管理[M].陈曙晖,译.北京:清华大学出版社,2006:333.
  • 5王海亮,等.精通Oracle10g SQL和PL/SQL[M].北京:中国水利水电出版社,2007:580.
  • 6Big Data for Development: Challenges & Opportunities [DB/OL]. [2012-05-01]. http://www.unglobalpulse.org/sites/default/files/Big- DataforDevelopment-UNGlobalPulseJune2012.pdf.
  • 7Big Data Research and Development Initiative[DB/OL].[2012-03-29] . http://www.whitehouse.gov/sites/defaultlfiles/microsites/ostp/big_data_ press_release_final_2.pdf.
  • 8Enhancing Teaching and Learning through Educational Data Mining and Learning Analyties [DB/OL]. [2012-10-12]. http:// www.ed.gov/edblogs/technology/files/2012/03/edm-la-brief.pdf.
  • 9Big data-Wikipedia, the free encyclopedia [EB/OL].[2013-09-23]. http://en.wikipedia.org/wiki/Big_data.
  • 10Barwick H. The "four Vs" of Big Data. Implementing Information In- fi'astructure Symposium[EB/OL]. [2012-10-02]. http://www.computer- world.com.au/article/396198/iiis four vs_big_data/.

共引文献641

同被引文献51

引证文献7

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部