摘要
数据库技术是现代计算机软件技术中不可或缺的一部分,在各个行业中都发挥着十分重要的作用。但是,伴随着时代的发展,互联网和云计算要求数据库能够有效地存储和处理大数据,使传统的关系型数据库软件工具能力难以应对新的应用场景。其中,索引对于提升复杂数据集上检索任务的效率有着非常重要的意义。机器学习近年来取得了长足的发展,为数据库索引推荐技术提供了新的机遇。首先叙述了索引推荐的研究背景与挑战;然后,将索引推荐分为索引生成与索引选择2个阶段,分别对这2个阶段的相关技术进行论述;最后,对基于学习的数据库索引推荐技术做了介绍与展望,其中,重点介绍了索引生成与索引选择。索引生成阶段包含基于学习的范围索引、哈希索引、布隆过滤器等,总结了基于学习的索引生成技术的优缺点;对于索引选择阶段,将传统方法与基于学习的方法进行对比与分析,重点叙述强化学习解决索引选择问题。
Database technology is an indispensable part of modern computer software system and plays a very important role in various industries.However,with the development of the times,the Internet and cloud computing require databases to effectively store and process big data,which makes it difficult for traditional relational database software tools to cope with new application scenarios.Among them,indexing is of great significance to improve the efficiency of retrieval tasks on complex data sets.Machine learning has made great progress in recent years,which provides a new opportunity for database index advisor technology.Firstly,this survey summarizes the research background and challenges of index advisor.Then,index advisor is divided into two stages:index generation and index selection.Finally,it summarizes and prospects the database index advisor technology based on learning.Among them,the key techniques are index generation and index selection.The index generation stage includes learning-based range index,hash index and bloom filter,and summarizes the advantages and disadvantages of learning-based index generation technology.For the index selection stage,this survey compares and summarizes the traditional methods with the learning-based methods,and focuses on reinforcement learning to solve the index selection problem.
作者
杨国平
乔少杰
屈露露
韩楠
魏盛杰
元昌安
YANG Guoping;QIAO Shaojie;QU Lulu;HAN Nan;WEI Shengjie;YUAN Chang’an(School of Software Engineering, Chengdu University of Information Technology, Chengdu 610225, China;School of Management, Chengdu University of Information Technology, Chengdu 610103, China;Digital Media Art, Key Laboratory of Sichuan Province, Sichuan Conservatory of Music, Chengdu 610021, China;Guangxi College of Education, Nanning 530023, China)
出处
《重庆理工大学学报(自然科学)》
CAS
北大核心
2022年第6期189-199,共11页
Journal of Chongqing University of Technology:Natural Science
基金
国家自然科学基金项目(61772091,61802035,61962006,61962038,U1802271,U2001212,62072311)
四川省科技计划资助(2021JDJQ0021,2022YFG0186,2020YJ0481,2020YJ0430)
四川音乐学院数字媒体艺术四川省重点实验室资助项目(21DMAKL02)
CCF-华为数据库创新研究计划(CCF-HuaweiDBIR2020004A)
成都市技术创新研发项目(2021-YF05-00491-SN)
成都市重大科技创新项目(2021-YF08-00156-GX)
成都市软科学研究项目(2021-RK00-0065-ZF,2021-RK00-00066-ZF)
广东省基础与应用基础研究基金项目(2020B1515120028)。
关键词
机器学习
数据库
索引推荐
索引生成
索引选择
强化学习
machine learning
database
index advisor
index generation
index selection
reinforcement learning