期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
RUMINATE METHOD-SOFTWARE PIPELINING ON NESTED LOOPS
1
作者 LEI WANG ZHIZHONGTANG and CHIHONG ZHANG(Dept. of Computer Science, Tsinghua Lirnivcrsitg Beijing 100084,P. R. China)(Final: wl,t ang ,zch@est4. dcs. tsinghua.edu. cn) 《Wuhan University Journal of Natural Sciences》 CAS 1996年第Z1期430-436,共7页
This paper offers a new method to solve the problem of software pipelininsr on nested loops. We first introduce our new software pipelininog method. Ruminate Method, which can optimize program with nested loops. We al... This paper offers a new method to solve the problem of software pipelininsr on nested loops. We first introduce our new software pipelininog method. Ruminate Method, which can optimize program with nested loops. We also outline an algorithm to realize it and introduce the hardware support we designed. The performance of Ruminate Method is analyzed at the end of this paper with the aid of our preliminary experimental result. 展开更多
关键词 instruction-level Parallelism Software Pipeline Ruminate Method Nested Loop.
下载PDF
Trace Software Pipelining
2
作者 王剑 AndreasKrall 《Journal of Computer Science & Technology》 SCIE EI CSCD 1995年第6期481-490,共10页
Global software pipelining is a complex but efficient compilation technique to exploit instruction-level parallelism for loops with branches. This paper presents a novel global software pipelining technique, called Th... Global software pipelining is a complex but efficient compilation technique to exploit instruction-level parallelism for loops with branches. This paper presents a novel global software pipelining technique, called Thace Software Pipelining,targeted to the instruction-level parallel processors such as Very Long Instruc-tion Word (VLIW) and superscalar machines. Thace software pipelining applies a global code scheduling technique to compact the original loop body. The re-sulting loop is called a trace software pipelined (TSP) code. The trace softwrae pipelined code can be directly executed with special architectural support or call be transformed into a globally software pipelined loop for the current VLIW and superscalar processors. Thus, exploiting parallelism across all iterations of a loop can be completed through compacting the original loop body with any global code scheduling technique. This makes our new technique very promis-ing in practical compilers. Finally, we also present the preliminary experimental results to support our new approach. 展开更多
关键词 instruction-level parallelism fine-grain parallelism software pipelining loop scheduling Very Long Instruction Word (VLIW) superscalar processor
原文传递
DYNAMEM-A Microarchitecture for Improving Memory Disambiguation at Run-Time
3
作者 王显著 廖恒 李三立 《Journal of Computer Science & Technology》 SCIE EI CSCD 1996年第6期589-600,共12页
This paper presents a new microarchitecture technique named DYNAMEM,in which memory reference instructions are dynamically scheduled and can be executed out-of-order. Load instructions can bypass store instructions sp... This paper presents a new microarchitecture technique named DYNAMEM,in which memory reference instructions are dynamically scheduled and can be executed out-of-order. Load instructions can bypass store instructions specula-tively, even if the store instructions'addresses are unknown. DYNAMEM can greatly alleviate the restraints of ambiguous memory dependencies. Simulation results show that the frequency of false load is low. Mechanism has been pro-vided to repair false loads with low penalty, and to achieve precise interrupts.Discussions and experimental results show that DYNAMEM could dramatically raise instruction-level parallelism in programs without recompilation. 展开更多
关键词 instruction-level parallelism dynamic scheduling memory dependency
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部