PELLR: A Permutated ELLPACK-R Format for SpMV on GPUs

PELLR: A Permutated ELLPACK-R Format for SpMV on GPUs

下载PDF

导出

摘要 The sparse matrix vector multiplication (SpMV) is inevitable in almost all kinds of scientific computation, such as iterative methods for solving linear systems and eigenvalue problems. With the emergence and development of Graphics Processing Units (GPUs), high efficient formats for SpMV should be constructed. The performance of SpMV is mainly determinted by the storage format for sparse matrix. Based on the idea of JAD format, this paper improved the ELLPACK-R format, reduced the waiting time between different threads in a warp, and the speed up achieved about 1.5 in our experimental results. Compared with other formats, such as CSR, ELL, BiELL and so on, our format performance of SpMV is optimal over 70 percent of the test matrix. We proposed a method based on parameters to analyze the performance impact on different formats. In addition, a formula was constructed to count the computation and the number of iterations. The sparse matrix vector multiplication (SpMV) is inevitable in almost all kinds of scientific computation, such as iterative methods for solving linear systems and eigenvalue problems. With the emergence and development of Graphics Processing Units (GPUs), high efficient formats for SpMV should be constructed. The performance of SpMV is mainly determinted by the storage format for sparse matrix. Based on the idea of JAD format, this paper improved the ELLPACK-R format, reduced the waiting time between different threads in a warp, and the speed up achieved about 1.5 in our experimental results. Compared with other formats, such as CSR, ELL, BiELL and so on, our format performance of SpMV is optimal over 70 percent of the test matrix. We proposed a method based on parameters to analyze the performance impact on different formats. In addition, a formula was constructed to count the computation and the number of iterations.

作者 Zhiqi Wang Tongxiang Gu

机构地区 School of Mathematics and Statistics

出处《Journal of Computer and Communications》 2020年第4期44-58,共15页 电脑和通信（英文）

关键词 SpMV GPU STORAGE FORMAT HIGH PERFORMANCE SpMV GPU Storage Format High Performance

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

1Shohei Ikawa,Naoki Takada,Hiromitsu Araki,Hiroaki Niwase,Hiromi Sannomiya,Hirotaka Nakayama,Minoru Oikawa,Yuichiro Mori,Takashi Kakue,Tomoyoshi Shimobaba,Tomoyoshi Ito.Real-time color holographic video reconstruction using multiple-graphics processing unit cluster acceleration and three spatial light modulators[J].Chinese Optics Letters,2020,18(1):18-22. 被引量：5
2Muhammad Saqib,Muhammad Iqbal,Shahid Ali,Tariq Ismaeel.New Fourth and Fifth-Order Iterative Methods for Solving Nonlinear Equations[J].Applied Mathematics,2015,6(8):1220-1227. 被引量：2
3Ruixing Wang,Tongxiang Gu,Ming Li.Performance Prediction Based on Statistics of Sparse Matrix-Vector Multiplication on GPUs[J].Journal of Computer and Communications,2017,5(6):65-83. 被引量：1
4Huanzhou Zhu,Zhuoer Gu,Haiming Zhao,Keyang Chen,Chang-Tsun Li,Ligang He.Developing a Pattern Discovery Method in Time Series Data and Its GPU Acceleration[J].Big Data Mining and Analytics,2018,1(4):266-283.
5Imaddin A. Al-Omari,Ralph Skomski,David J. Sellmyer.Magnetic Properties of Y3-2xCa2xFe5-xVxO12Garnets[J].Advances in Materials Physics and Chemistry,2012,2(3):116-120.
6Anastasia-Dimitra Lipitakis.Explicit Iterative Methods of Second Order and Approximate Inverse Preconditioners for Solving Complex Computational Problems[J].Applied Mathematics,2020,11(4):307-327.
7Bing Zhang.The delay time of gravitational wave-gamma-ray burst associations[J].Frontiers of physics,2019,14(6):133-143. 被引量：1
8Nashila Hirji,Sophie Jones,Graham Thompson.The Causes of Distress in Paediatric Outpatients Receiving Dilating Drops[J].Open Journal of Ophthalmology,2012,2(2):21-25.
9Wenbo Li,Qingfeng Hu,Yunwei Liu,Mengmeng Zhang,Jiajun Wang,Xiaopeng Han,Cheng Zhong,Wenbin Hu,Yida Deng.Powder metallurgy synthesis of porous Ni-Fe alloy for oxygen evolution reaction and overall water splitting[J].Journal of Materials Science & Technology,2020(2):154-160.
10Isaac Fried.Newton’s Method and an Exact Opposite That Average into Halley’s Method[J].Applied Mathematics,2017,8(10):1427-1436.

Journal of Computer and Communications

2020年第4期

浏览历史

内容加载中请稍等...

PELLR: A Permutated ELLPACK-R Format for SpMV on GPUs

相关作者

相关机构

相关主题

浏览历史