期刊文献+

基于PMVS算法的大规模数据细粒度并行优化方法 被引量:4

Fine-Grained Parallel Optimization of Large-Scale Data for PMVS Algorithm
原文传递
导出
摘要 三维多视角立体视觉算法(patch-based multi-view stereo, PMVS)以其良好的三维重建效果广泛应用于数字城市等领域,但用于大规模计算时算法的执行效率低下。针对此,提出了一种细粒度并行优化方法,从任务划分和负载均衡、主系统存储和GPU存储、通信开销等3方面加以优化;同时,设计了基于面片的PMVS算法特征提取的GPU和多线程并行改造方法,实现了CPUs_GPUs多粒度协同并行。实验结果表明,基于CPU多线程策略能实现4倍加速比,基于统一计算设备架构(compute unified device architecture, CUDA)并行策略能实现最高34倍加速比,而提出的策略在CUDA并行策略的基础上实现了30%的性能提升,可以用于其他领域大数据处理中快速调度计算资源。 We address the problem of fine-grained parallel optimization of large-scale data. Patch-based multi-view stereo(PMVS) algorithm has been widely applied to digital city and other fields because of its good three-dimensional reconstruction effect, however, its large-scale computing algorithm has a low execution efficiency. Therefore, to address the limitation, this paper proposes a fine-grained parallel optimization method, including task allocation and load-balancing;strategies of main system memory and GPU memory;the optimization of communication. We perform CPU multi-threading operation using the pthreads function library to take full advantage of the computing power of multi-core CPUs. And for GPUs, we utilize the CUDA framework while optimizing thread organization and memory access. Besides that, we propose the idea of adapting memory pool model and pipelining model to improve bandwidth availability ratio. The memory pool model reduces the impact of data resources transferring on the bus for CPUs_GPUs while waiting for resources;the pipelining model hides communication time for CPU to read data from memory. At the same time, this paper utilizes the Harris-DOG feature extraction of PMVS algorithm of sequences of images as the example to verify our optimization strategies. The experiments demonstrate that the multi-threading CPU-based strategy can achieve 4 times speed-up ratio, the highest ratio that parallel CUDA-based strategy can achieve is 34 times, and our strategy can improve the performance 30% on the basis of the parallel CUDA-based strategy. In the future, our optimization strategy can be applied to quick computing resource scheduling in big data processing of other domains.
作者 刘金硕 李扬眉 江庄毅 邓娟 眭海刚 Pan Jeff LlUJinshuo;LI Yangmei;JIANG Zhuangyi;DENG Juan;SUI Haigang;PAN Jeff(School of Cyber Science and Engineering. Wuhan University, Wuhan 430072 , China;School of Computer Science, Technical Univers让y of Munich, Munich 85748, Germany;School of Computer Science, Wuhan University, Wuhan 430072, China;State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing,Wuhan University, Wuhan 430079, China;Department of Computing Science, University of Aberdeen, Aberdeen AB24 3FX, UK)
出处 《武汉大学学报(信息科学版)》 EI CSCD 北大核心 2019年第4期608-616,共9页 Geomatics and Information Science of Wuhan University
基金 国家自然科学基金(61672393 U1536204)~~
关键词 CPUs_GPUs多粒度并行 GPU并行优化 CUDA 负载均衡 存储与通信优化 图像处理 CPUs_GPUs multi-granularity parallel GPU parallel optimization CUDA load balancing storage and communication optimization image processing
  • 相关文献

参考文献6

二级参考文献68

  • 1张学民,刁桂苓,赵英萍,王卫民,束沛镒.华北地区深部地幔剪切波速度结构研究[J].地球物理学报,2006,49(6):1709-1719. 被引量:18
  • 2Kitchen L, Rosenfeld A. Gray-level Corner Detec tion[J]. Pattern Recognition Letters, 1982, 1(2): 95-102.
  • 3Mokhtarian F, Suomela R. Robust Image Corner Detection Through Curvature Scale Space[J]. IEEE Transaction on Pattern Analysis and Machine Intel- ligence, 1998, 20(12):1 376-1 381.
  • 4Freeman H, Davis L S. A Corner Finding Algo- rithm for Chain-coded Curves[J]. IEEE Transaction on Computers, 1977, C-26(3) :297-303.
  • 5Wang H, Brady M. Real-time Corner Detection Al- gorithm for Motion Estimation[J]. Image and Vi- sion Computing, 1995, 13(9):695-703.
  • 6Harris C G, Stephens M J. A Combined Corner and Edge Detector[C]. Proceedings of the Fourth Alvey Vision Conference, Manchester, UK, 1988.
  • 7Moravec H P. Towards Automatic Visual Obstacle Avoidance[C]. Proceedings of International Joint Conference on Artificial Intelligence, Cambridge, MA, USA, 1977.
  • 8Trajkovic M, Hedley M. Fast Corner Detection[J]. Image and Version Computing, 1998, 16(2):75-87.
  • 9Smith S M, Brady M. SUSAN a New Ap- proach to Low Level Image Processing[J]. Interna- tional Journal of Computer Version, 1997, 23(1): 45-78.
  • 10Boyer M, Skadron K, Weimer W. Automated Dy- namic Analysis of CUDA Programs[EB/OL]. ht- tp://web, mit. edu/rabbah/www/conferences/08/ stmcs/papers/boyer-stmcs08, pdf, 2008.

共引文献36

同被引文献40

引证文献4

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部