期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
PhiBench 2.0: characterizing data analytics workloads on Intel Knights Landing
1
作者 Xie Biwei Zhan Jianfeng +1 位作者 Wang Lei Zhang Lixin 《High Technology Letters》 EI CAS 2019年第2期121-128,共8页
With high computational capacity, e.g. many-core and wide floating point SIMD units, Intel Xeon Phi shows promising prospect to accelerate high-performance computing(HPC) applications. But the application of Intel Xeo... With high computational capacity, e.g. many-core and wide floating point SIMD units, Intel Xeon Phi shows promising prospect to accelerate high-performance computing(HPC) applications. But the application of Intel Xeon Phi on data analytics workloads in data center is still an open question. Phibench 2.0 is built for the latest generation of Intel Xeon Phi(KNL, Knights Landing), based on the prior work PhiBench(also named BigDataBench-Phi), which is designed for the former generation of Intel Xeon Phi(KNC, Knights Corner). Workloads of PhiBench 2.0 are delicately chosen based on BigdataBench 4.0 and PhiBench 1.0. Other than that, these workloads are well optimized on KNL, and run on real-world datasets to evaluate their performance and scalability. Further, the microarchitecture-level characteristics including CPI, cache behavior, vectorization intensity, and branch prediction efficiency are analyzed and the impact of affinity and scheduling policy on performance are investigated. It is believed that the observations would help other researchers working on Intel Xeon Phi and data analytics workloads. 展开更多
关键词 intel xeon phi data analytics workloads characterization knights landing(KNL) many core x86 processors
下载PDF
基于矩阵转置优化的Intel KNL特性分析 被引量:2
2
作者 王琦 韩林 +2 位作者 高雨辰 李颖颖 王曦 《计算机工程与设计》 北大核心 2018年第5期1358-1364,1371,共8页
矩阵转置访存密集,便于并行优化,因此优化矩阵转置程序,旨在分析Knights Landing平台新特性。分析矩阵转置程序特性,按照矩阵一行元素个数,将矩阵分为3类;利用KNL平台提供的AVX-512扩展指令集对其进行向量化优化以及数据预取,利用OpenM... 矩阵转置访存密集,便于并行优化,因此优化矩阵转置程序,旨在分析Knights Landing平台新特性。分析矩阵转置程序特性,按照矩阵一行元素个数,将矩阵分为3类;利用KNL平台提供的AVX-512扩展指令集对其进行向量化优化以及数据预取,利用OpenMP实现两种不同粒度的并行优化;利用矩阵转置程序,通过实验数据对比,分析KNL平台优化程序的特点及其不同模式的不同特性。 展开更多
关键词 矩阵转置 英特尔第二代至强融合处理器 并行优化 高带宽内存 集群模式
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部