网格环境下基于流水线的多重相似查询优化被引量：1

Pipeline-Based Multi-Query Optimization for Similarity Queries in Grid Environment

下载PDF

导出

摘要提出一种网格环境下基于流水线技术的分布式多重相似查询的优化算法(pipeline-based distributed similarity query processing,简称pGMSQ).首先,当用户提交若干个查询请求时,采用基于代价的动态层次聚类策略(dynamic query clustering,简称DQC)对其进行合并.然后在数据结点层,采用索引支持的向量集缩减方法快速过滤无关向量.最后,在执行结点层对候选向量执行求精操作返回结果向量.由于本查询采用了流水线技术,实验结果表明,该方法在提高查询性能的同时也提高了系统的吞吐量. This paper proposes a multi-query optimization algorithm for pipeline-based distributed similarity query processing （pGMSQ） in grid environment. First, when a number of query requests are simultaneously submitted by users, a cost-based dynamic query clustering （DQC） is invoked to quickly and effectively identify the correlation among the query spheres （requests）. Then, index-support vector set reduction is performed at data node level in parallel. Finally, refinement of the candidate vectors is conducted to get the answer set at the execution node level. By adopting pipeline-based technique, this algorithm is experimentally proved to be efficient and effective in minimizing the response time by decreasing network transfer cost and increasing the throughput.

作者胡华庄毅胡海洋赵格华

机构地区杭州电子科技大学计算机学院浙江工商大学计算机与信息工程学院香港中文大学计算机科学与工程系

出处《软件学报》 EI CSCD 北大核心 2010年第1期55-67,共13页 Journal of Software

基金国家自然科学基金Nos.60873022 60903053 浙江省自然科学基金Nos.Y1080148 Y1090165 浙江省科技厅重大科技项目No.2008C13082 浙江工商大学青年人才基金重点资助项目No.Q09-7 南京大学计算机软件新技术国家重点实验室开放基金~~

关键词网格多重查询优化高维索引数据分片 grid multi-query optimization high-dimensional indexing data partition

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献1

1杨东华,李建中,张文平.基于数据网格环境的连接操作算法[J].计算机研究与发展,2004,41(10):1848-1855. 被引量：8

二级参考文献9

1I Foster, C Kcsselrnan. The Grid: Blueprint for a New Computing Infrastructure. San Francisco, CA: Morgan Kaufmann, 1998
2A Chervenak, I Foster, C Kesselman, et al. The data grid:Towards an architecture for the distributed management and analysis of large scientific datasets. Journal of Network and Computer Applications, 2001, 23:187～200
3Wolfgang Hoschek, Javier Jaen Martinez, Asad Samar, et al.Data management in an international data grid project. In: Proc of the 1st IEEE/ACM Int'l Workshop on Grid Computing. Berlin:Springer-Verlag, 2000. 17～20
4B Segal. Grid Computing: The European data grid project. The 2000 IEEE Nuclear Science Symposium and Medical Imaging Conference, Lyon, France, 2000
5Heinz Stockinger. Distributed database management systems and the data grid. The 18th IEEE Symp on Mass Storage Systems and the 9th NASA Goddard Conference on Mass Storage Systems and Technologies, San Diego, CA, 2001
6J Smith, A Gounaris, P Watson, et al. Distributed query processing on the grid. In: Proc of the 3rd Int'l Workshop on Grid Computing. Berlin: Springer-Verlag, 2002. 279～290
7M Nedim Alpdemir, Arijit Mukherjee, Norman W Paton, et al.Service-based distributed querying on the grid. UK e-Science Programme All Hands Conference, Nottinghan, UK, 2003
8Z Ives, D Florescu, M Friedman, et al. An adaptive query execution system for data integration. In: Proc of the 1999 ACM SIGMOD Int'l Conf on Management of Data. New York: ACM Press, 1999. 299～310
9Nick Roussopoulos, Hyunchul Kang. A pipeline n-way join algorithm based on the 2-way semijoin program. IEEE Trans on Knowledge and Data Engineering, 1991, 3(4): 486～495

共引文献7

1石柯,林海华,徐彬.AnyQuery:网格环境下基于服务的分布式查询处理系统[J].小型微型计算机系统,2006,27(8):1432-1438. 被引量：6
2庄毅,庄越挺,吴飞.基于数据网格的书法字k近邻查询[J].软件学报,2006,17(11):2289-2301. 被引量：3
3申德荣,于戈,聂铁铮,寇月.支持多领域动态数据集成的数据库网格系统[J].软件学报,2006,17(11):2302-2313. 被引量：10
4庄毅,庄越挺,吴飞.基于数据网格环境的k近邻查询[J].计算机研究与发展,2006,43(11):1876-1885.
5蔡红云,张建勋,田俊峰,何欣枫.校园网格环境下异构数据库的集成与分布式查询[J].广西师范大学学报（自然科学版）,2007,25(4):298-301. 被引量：7
6印桂生,于翔,宁慧.一种基于网格的增量聚类算法[J].计算机应用研究,2009,26(6):2038-2040. 被引量：4
7谭云松.网格环境中异构数据访问和集成研究[J].重庆文理学院学报（自然科学版）,2010,29(5):33-36. 被引量：1

同被引文献2

1帅训波,马书南,周相广,龚安.基于遗传算法的分布式数据库查询优化研究[J].小型微型计算机系统,2009,30(8):1600-1604. 被引量：23
2宋怀明,安明远,王洋,袁春阳,孙凝晖.大规模数据密集型系统中的去重查询优化[J].计算机研究与发展,2010,47(4):581-588. 被引量：6

引证文献1

1张亮,陆余良,袁桓,张旻.Deep Web查询优化算法研究[J].小型微型计算机系统,2012,33(3):552-557.

1杨晓宇,岳丽华,柳建平.多重查询优化技术在移动数据库中的应用[J].小型微型计算机系统,2004,25(8):1538-1541. 被引量：2
28个增强Windows系统效率的免费软件[J].计算机与网络,2008,34(17):19-19.
3NSA:内部搜索系统曝光可查询全球通讯数据[J].移动通信,2014,38(16):81-81.
4葛星,沈耀,徐常亮.基于云计算的多重查询优化系统[J].计算机工程,2014,40(9):46-50. 被引量：3
5李伟红,张镇,龚卫国.基于面部组件码表的人脸照片库缩减方法[J].仪器仪表学报,2015,36(11):2563-2569.
6徐雅斌,李卓,董源.基于社会计算和机器学习的垃圾邮件快速过滤[J].系统工程理论与实践,2014,34(S1):179-186. 被引量：1
7梅炳夫,林少丹,刘攀.基于主动网络层次的多播体系结构[J].微电子学与计算机,2009,26(8):230-232.
8王国庆.数据预处理的数据缩减方法的研究[J].计算技术与自动化,2008,27(2):134-137. 被引量：2
9孙见青,汪荣贵,李守毅.一种改进的基于特征和基于图像相结合的人脸检测[J].工程图学学报,2007,28(5):62-67. 被引量：2
10蔡友林,谢银祥.动态层次组播路由[J].四川轻化工学院学报,2004,17(2):54-58.

软件学报

2010年第1期

浏览历史

内容加载中请稍等...

网格环境下基于流水线的多重相似查询优化被引量：1

参考文献1

二级参考文献9

共引文献7

同被引文献2

引证文献1

相关作者

相关机构

相关主题

浏览历史

网格环境下基于流水线的多重相似查询优化 被引量：1

参考文献1

二级参考文献9

共引文献7

同被引文献2

引证文献1

相关作者

相关机构

相关主题

浏览历史

网格环境下基于流水线的多重相似查询优化被引量：1