摘要
随着大数据时代的到来,如何提高海量数据的新增与更新效率成为当前的研究重点。基于此,笔者对Greenplum数据库的高效插入、更新海量数据的方法进行了研究。在新增数据方面,Greenplum数据库除了insert指令外,还提供了外部表、copy指令、gpfdist组件来提高新增海量数据的效率,效率可以提升百倍;在更新数据方面,gpfdist组件和外部表的联合使用可以大大提高数据的更新效率。
With the advent of the era of big data, how to improve the efficiency of adding and updating massive data has become the focus of current research. Based on this, The author studies the Greenplum database widely used in the industry to improve the efficiency of adding and modifying massive data. In terms of adding data, in addition to the insert command, Greenplum Database also provides external tables, copy command, and gpfdist component to improve the efficiency of adding massive data, and the efficiency can be increased by a hundred times;in terms of modifying data, the combination use of gpfdist component and external tables can greatly improve the efficiency of modifying data.
作者
陆佳琦
LU Jiaqi(Shanghai Futures Information Technology Co.,Ltd.,Shanghai 200122,China)
出处
《信息与电脑》
2022年第6期191-193,共3页
Information & Computer