期刊文献+
共找到524篇文章
< 1 2 27 >
每页显示 20 50 100
Fast Parallel Algorithm for Slicing STL Based on Pipeline 被引量:4
1
作者 MA Xulong LIN Feng YAO Bo 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2016年第3期549-555,共7页
In Additive Manufacturing field, the current researches of data processing mainly focus on a slicing process of large STL files or complicated CAD models. To improve the efficiency and reduce the slicing time, a paral... In Additive Manufacturing field, the current researches of data processing mainly focus on a slicing process of large STL files or complicated CAD models. To improve the efficiency and reduce the slicing time, a parallel algorithm has great advantages. However, traditional algorithms can't make full use of multi-core CPU hardware resources. In the paper, a fast parallel algorithm is presented to speed up data processing. A pipeline mode is adopted to design the parallel algorithm. And the complexity of the pipeline algorithm is analyzed theoretically. To evaluate the performance of the new algorithm, effects of threads number and layers number are investigated by a serial of experiments. The experimental results show that the threads number and layers number are two remarkable factors to the speedup ratio. The tendency of speedup versus threads number reveals a positive relationship which greatly agrees with the Amdahl's law, and the tendency of speedup versus layers number also keeps a positive relationship agreeing with Gustafson's law. The new algorithm uses topological information to compute contours with a parallel method of speedup. Another parallel algorithm based on data parallel is used in experiments to show that pipeline parallel mode is more efficient. A case study at last shows a suspending performance of the new parallel algorithm. Compared with the serial slicing algorithm, the new pipeline parallel algorithm can make full use of the multi-core CPU hardware, accelerate the slicing process, and compared with the data parallel slicing algorithm, the new slicing algorithm in this paper adopts a pipeline parallel model, and a much higher speedup ratio and efficiency is achieved. 展开更多
关键词 additive manufacturing STL model slicing algorithm data parallel pipeline parallel
下载PDF
A parallel pipeline connected-component labeling method for on-orbit space target monitoring
2
作者 LI Zongling ZHANG Qingjun +1 位作者 LONG Teng ZHAO Baojun 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2022年第5期1095-1107,共13页
The paper designs a peripheral maximum gray differ-ence(PMGD)image segmentation method,a connected-compo-nent labeling(CCL)algorithm based on dynamic run length(DRL),and a real-time implementation streaming processor ... The paper designs a peripheral maximum gray differ-ence(PMGD)image segmentation method,a connected-compo-nent labeling(CCL)algorithm based on dynamic run length(DRL),and a real-time implementation streaming processor for DRL-CCL.And it verifies the function and performance in space target monitoring scene by the carrying experiment of Tianzhou-3 cargo spacecraft(TZ-3).The PMGD image segmentation method can segment the image into highly discrete and simple point tar-gets quickly,which reduces the generation of equivalences greatly and improves the real-time performance for DRL-CCL.Through parallel pipeline design,the storage of the streaming processor is optimized by 55%with no need for external me-mory,the logic is optimized by 60%,and the energy efficiency ratio is 12 times than that of the graphics processing unit,62 times than that of the digital signal proccessing,and 147 times than that of personal computers.Analyzing the results of 8756 images completed on-orbit,the speed is up to 5.88 FPS and the target detection rate is 100%.Our algorithm and implementation method meet the requirements of lightweight,high real-time,strong robustness,full-time,and stable operation in space irradia-tion environment. 展开更多
关键词 Tianzhou-3 cargo spacecraft(TZ-3) connected-component labeling(CCL)algorithms parallel pipeline processing on-orbit space target detection streaming processor
下载PDF
Parallel Pipelines for DNA Sequence Alignment on a Cluster of Multicores: A Comparison of Communication Models
3
作者 Enzo Rucci Franco Chichizola Marcelo Naiouf Laura De Giusti Armando De Giusti 《通讯和计算机(中英文版)》 2012年第12期1364-1371,共8页
关键词 DNA序列比对 通信模型 并行编程 SMITH-WATERMAN算法 多核 流水线 群集 体系结构
下载PDF
用于高速CIS的12-bit紧凑型多列共享并行pipeline-SAR ADC(英文) 被引量:1
4
作者 郭志强 刘力源 吴南健 《红外与激光工程》 EI CSCD 北大核心 2018年第5期187-196,共10页
设计了一款用于高速CMOS图像传感器的多列共享列并行流水线逐次逼近模数转换器。八列像素共享一路pipeline-SAR ADC,从而使得ADC的版图不再局限于二列像素的宽度,可以在16列像素宽度内实现。该模数转换器采用了异步控制逻辑电路来提高... 设计了一款用于高速CMOS图像传感器的多列共享列并行流水线逐次逼近模数转换器。八列像素共享一路pipeline-SAR ADC,从而使得ADC的版图不再局限于二列像素的宽度,可以在16列像素宽度内实现。该模数转换器采用了异步控制逻辑电路来提高转换速度。半增益数模混合单元电路被用于对第一级子ADC的余差信号放大,同时被用于降低对增益数模混合单元电路中运放性能的要求。相关电平位移技术也被用于对余差信号进行更精确的放大。整个pipeline-SAR ADC第一级子ADC精度为6-bit,第二级子ADC为7-bit,两级之间存在1-bit冗余校准,最终实现12-bit精度。输入信号满幅电压为1 V。该8列共享并行处理的pipeline-SAR ADC在0.18μm 1P4M工艺下制造实现,芯片面积为0.204 mm^2。仿真结果显示,在采样频率为8.33 Msps,输入信号频率为229.7 kHz时,该ADC的信噪失真比为72.6 d B;在采样频率为8.33 Msps,输入信号频率为4.16 MHz时,该ADC的信噪失真比为71.7 dB。该pipelineSAR ADC的电源电压为1.8 V,功耗为4.95 mW,功耗品质因子(FoM)为172.5 fJ/conversion-step。由于像素尺寸只有7.5μm,工艺只有四层金属,因此这款12-bit多列共享列并行流水线逐次逼近模数转换器非常适用于高速CMOS图像传感器系统。 展开更多
关键词 高速CMOS图像传感器 多列共享列并行 pipeline-SAR AD
下载PDF
基于Pipeline的一类动态规划并行算法 被引量:1
5
作者 何奇 《计算机学报》 EI CSCD 北大核心 1994年第7期527-535,共9页
动态规划是解决组合优化问题的有效方法之一.本文基于Pipline结构,提出并分析了三个相似的动态规划并行算法(求简单最短路径.求最长公共子串和解背包问题).获得了较理想的加速比、并行效率等指标.进而提出并讨论了这一类... 动态规划是解决组合优化问题的有效方法之一.本文基于Pipline结构,提出并分析了三个相似的动态规划并行算法(求简单最短路径.求最长公共子串和解背包问题).获得了较理想的加速比、并行效率等指标.进而提出并讨论了这一类问题之动态规划并行处理的一般化思想及方法. 展开更多
关键词 pipeline结构 动态规划 并行算法
下载PDF
A Low Power and High Speed Viterbi Decoder Based on Deep Pipelined, Clock Blocking and Hazards Filtering
6
作者 C. ARUN V. RAJAMANI 《International Journal of Communications, Network and System Sciences》 2009年第6期575-582,共8页
A high speed and low power Viterbi decoder architecture design based on deep pipelined, clock gating and toggle filtering has been presented in this paper. The Add-Compare-Select (ACS) and Trace Back (TB) units and it... A high speed and low power Viterbi decoder architecture design based on deep pipelined, clock gating and toggle filtering has been presented in this paper. The Add-Compare-Select (ACS) and Trace Back (TB) units and its sub circuits of the decoder have been operated in deep pipelined manner to achieve high transmission rate. The Power dissipation analysis is also investigated and compared with the existing results. The techniques that have been employed in our low-power design are clock-gating and toggle filtering. The synthesized circuits are placed and routed in the standard cell design environment and implemented on a Xilinx XC2VP2fg256-6 FPGA device. Power estimation obtained through gate level simulations indicated that the proposed design reduces the power dissipation of an original Viterbi decoder design by 68.82% and a speed of 145 MHz is achieved. 展开更多
关键词 VITERBI DECODER Convolutional Codes High-Speed Low Power Consumption parallel Processing DEEP pipelinING
下载PDF
A Practical Parallel Algorithm for All-Pair Shortest Path Based on Pipelining
7
作者 Hua Wang Ling Tian Chun-Hua Jiang 《Journal of Electronic Science and Technology of China》 2008年第3期329-333,共5页
On the basis of Floyd algorithm with the extended path matrix, a parallel algorithm which resolves all-pair shortest path (APSP) problem on cluster environment is analyzed and designed. Meanwhile, the parallel APSP ... On the basis of Floyd algorithm with the extended path matrix, a parallel algorithm which resolves all-pair shortest path (APSP) problem on cluster environment is analyzed and designed. Meanwhile, the parallel APSP pipelining algorithm makes full use of overlapping technique between computation and communication. Compared with broadcast operation, the parallel algorithm reduces communication cost. This algorithm has been implemented on MPI on PC-cluster. The theoretical analysis and experimental results show that the parallel algorithm is an efficient and scalable algorithm. 展开更多
关键词 All-pair shortest path Floyd algorithm pipelinING parallel algorithm
下载PDF
A parallel-pipelining software process model
8
作者 赵鹏 龚鹏 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2007年第5期646-651,共6页
Software process is a framework for effective and timely delivery of software system. The framework plays a crucial role for software success. However, the development of large-scale software still faces the crisis of... Software process is a framework for effective and timely delivery of software system. The framework plays a crucial role for software success. However, the development of large-scale software still faces the crisis of high risks, low quality, high costs and long cycle time. This paper proposed a three-phase parallel-pipelining software process model for improving speed and productivity, and reducing software costs and risks without sacrificing software quality. In this model, two strategies were presented. One strategy, based on subsystem-cost priority, was used to prevent software development cost wasting and to reduce software complexity as well; the other strategy, used for balancing subsystem complexity, was designed to reduce the software complexity in the later development stages. Moreover, the proposed function-detailed and workload-simplified subsystem pipelining software process model presents much higher parallelity than the concurrent incremental model. Finally, the component-based product line technology not only ensures software quality and further reduces cycle time, software costs, and software risks but also sufficiently and rationally utilizes previous software product resources and enhances the competition ability of software development organizations. 展开更多
关键词 软件 设计方案 编程语言 计算机技术
下载PDF
面向微控制器的卷积神经网络加速器设计
9
作者 乔建华 吴言 +1 位作者 栗亚宁 雷光政 《电子器件》 CAS 2024年第1期48-54,共7页
针对目前嵌入式微控制器的性能难以满足实时图像识别任务的问题,提出一种适用于微控制器的卷积神经网络加速器。该加速器在卷积层设计了无阻塞的行并行乘法-加法树结构,获得了更高的硬件利用率;为了满足行并行的数据吞吐量,设计了卷积专... 针对目前嵌入式微控制器的性能难以满足实时图像识别任务的问题,提出一种适用于微控制器的卷积神经网络加速器。该加速器在卷积层设计了无阻塞的行并行乘法-加法树结构,获得了更高的硬件利用率;为了满足行并行的数据吞吐量,设计了卷积专用SRAM存储器。加速器将池化和激活单元融入数据通路,有效减少数据重复存取带来的时间开销。FPGA原型验证表明加速器的性能达到92.2 GOPS@100 MHz;基于TSMC 130 nm工艺节点进行逻辑综合,加速器的动态功耗为33 mW,面积为90 764.2μm^(2),能效比高达2 793 GOPS/W,比FPGA加速器方案提高了约100倍。该加速器低功耗、低成本的特性,有利于实现嵌入式系统在目标检测、人脸识别等机器视觉领域的广泛应用。 展开更多
关键词 卷积神经网络 并行计算 流水线 硬件加速器 专用集成电路
下载PDF
西南山区并行油气管道阴极保护系统之间的干扰因素与规律
10
作者 王爱玲 《腐蚀与防护》 CAS CSCD 北大核心 2024年第2期97-102,共6页
以西南山区并行油气管道阴极保护系统之间干扰为研究对象,采用BEASY软件模拟计算了多种因素对管道阴极保护系统之间干扰的影响,并通过现场跨接缓解管道干扰。结果表明:并行管道阴极保护系统间的干扰随管道并行间距的增大呈逐渐减小的趋... 以西南山区并行油气管道阴极保护系统之间干扰为研究对象,采用BEASY软件模拟计算了多种因素对管道阴极保护系统之间干扰的影响,并通过现场跨接缓解管道干扰。结果表明:并行管道阴极保护系统间的干扰随管道并行间距的增大呈逐渐减小的趋势;当受干扰管道的涂层破损率较大时,管道电位负向偏移明显;并行管道阴极保护系统间的干扰程度随土壤电阻率的增大而增大,且电位偏移呈边界递减趋势;跨接能有效提升阴极保护水平较差管线的保护效果,降低并行管道之间的干扰程度。 展开更多
关键词 并行管道 数值模拟 干扰程度 跨接
下载PDF
新疆油田并行管道阴保系统相互干扰因素及规律
11
作者 时彦杰 邓丽媛 +6 位作者 栾翔 廖臻 罗泰星 陈帅 王晨 刘艳明 吕祥鸿 《西安石油大学学报(自然科学版)》 CAS 北大核心 2024年第2期103-111,119,共10页
针对新疆油田多条管道并行敷设产生的阴保系统相互干扰问题,采用数值模拟技术,研究土壤电阻率等8个因素对并行管道干扰程度的影响规律。结果表明:随着土壤电阻率或干扰管道涂层破损率增加,被干扰管道近阳极端电位负移,远阳极端电位正移... 针对新疆油田多条管道并行敷设产生的阴保系统相互干扰问题,采用数值模拟技术,研究土壤电阻率等8个因素对并行管道干扰程度的影响规律。结果表明:随着土壤电阻率或干扰管道涂层破损率增加,被干扰管道近阳极端电位负移,远阳极端电位正移,干扰显著增强;随着管道间距增加,干扰程度略有增加后逐渐降低,且最强干扰管道间距随土壤电阻率和涂层破损率增加而增大;随着干扰管道直径增大,或当干扰管道为高温管道时,并行管道受干扰略微增强;随着干扰管道辅助阳极距离增加,干扰减弱,但当辅助阳极距离小于200 m时干扰影响仍较为显著;当辅助阳极位置分别在两条并行管道一侧或两侧分布时,并行管道受干扰程度均较小;随着干扰管道数量增多,干扰电流存在明显叠加效应,干扰程度显著增强。综上,土壤电阻率、涂层破损率及干扰管道数量是阴保系统相互干扰的主控因素,管道间距、辅助阳极距离影响次之,辅助阳极相对位置、管道类型、管道直径影响较小。 展开更多
关键词 并行管道 阴极保护系统 直流干扰 干扰因素 干扰规律
下载PDF
某双管同沟敷设管线阴极保护系统干扰防护数值模拟研究
12
作者 陈帅 李洪福 +7 位作者 罗小武 肖会会 时彦杰 罗泰星 刘艳明 王晨 吕祥鸿 崔光磊 《材料保护》 CAS CSCD 2024年第6期175-185,198,共12页
为明确同沟敷设管道阴极保护系统相互干扰规律及干扰防护措施,以新疆油田某双管同沟敷设管道(D219和D457)为模型,设置D219管道的涂层破损率为0.1%,D457管道的涂层破损率为0.1%、0.5%、1.0%,采用数值模拟方法对不同阴极保护设计及阴极保... 为明确同沟敷设管道阴极保护系统相互干扰规律及干扰防护措施,以新疆油田某双管同沟敷设管道(D219和D457)为模型,设置D219管道的涂层破损率为0.1%,D457管道的涂层破损率为0.1%、0.5%、1.0%,采用数值模拟方法对不同阴极保护设计及阴极保护干扰防护措施的有效性及适用性进行评估。结果表明:采用浅埋阳极单独阴极保护的同沟敷设管道存在阴极保护系统相互干扰,随着干扰管道D457防腐层破损率的增加,其对并行管道D219的干扰程度增强,施加均压线措施可有效降低干扰;采用浅埋阳极联合阴极保护+均压线措施可消除同沟敷设管道阴极保护系统的相互干扰,当双管同沟管道防腐层破损率相差较大(D2190.1%,D4571.0%)时,改善效果较为显著;对于采用深井阳极联合阴极保护的同沟敷设管道,当D457防腐层破损率较大(1.0%)、土壤电阻率较高(50Ω·m)时,其阴极保护效果较差,采用均压线+牺牲阳极措施对阴极保护效果的改善作用不明显,而增大深井阳极输出电流联合均压线措施可显著改善同沟敷设管道的阴极保护效果。 展开更多
关键词 双管线 平行敷设 直流干扰 阴极保护 排流保护 数值模拟
下载PDF
相控阵快速波束控制算法设计与FPGA实现
13
作者 虞伶俐 赵志强 黎剑 《电子信息对抗技术》 2024年第2期79-84,共6页
相控阵天线的电扫描特性使其具有扫描灵活、指向精确、可靠性高和抗干扰能力强等特点,其实现波束快速扫描、波束形状快速变化、同时多波束快速形成能力的关键技术之一是波束控制技术。针对多波束多阵元波束控制快速响应的需求,对常规单... 相控阵天线的电扫描特性使其具有扫描灵活、指向精确、可靠性高和抗干扰能力强等特点,其实现波束快速扫描、波束形状快速变化、同时多波束快速形成能力的关键技术之一是波束控制技术。针对多波束多阵元波束控制快速响应的需求,对常规单波束控制算法进行分解和优化,提出单波束内基于截断归一化的流水线设计算法、多波束并行计算方法,来进行移相控制码计算。该算法完成单波束内N个阵元的移相码计算仅需(N+12)T(其中T为系统工作时钟的周期,典型值为8 ns),可快速计算各天线阵元所需移相控制码。所提方法较传统单波束移相控制码计算方法时间缩短一个数量级以上,并已在FPGA上得到实际验证与应用。 展开更多
关键词 波束控制 并行计算 流水线 FPGA
下载PDF
可扩展架构的超大点数FFT处理器设计
14
作者 王江 吴佳 《现代雷达》 CSCD 北大核心 2024年第5期54-59,共6页
面向合成孔径雷达、遥感、电子对抗等领域研究了一款高性能的超大点数快速傅里叶变换(FFT)处理器。文中提出了一种可扩展架构,即针对不同的应用场景可以动态实时调整FFT算法的基数以及处理点数;存储器划分为16个存储模块,可以通过产生... 面向合成孔径雷达、遥感、电子对抗等领域研究了一款高性能的超大点数快速傅里叶变换(FFT)处理器。文中提出了一种可扩展架构,即针对不同的应用场景可以动态实时调整FFT算法的基数以及处理点数;存储器划分为16个存储模块,可以通过产生无冲突地址进行访问,输出与输入数据帧可以共享同一存储器,具备高效存储器特征。FFT运算采用并行流水线排布,当采用高基算法时,可高并行度访问存储器,实现并行计算,从而获得明显的实时性优势。FFT各级运算采用循环移位寄存器产生地址,以保证蝶算单元输入数据的抽取间隔,并在最后一级输出时进行循环移位寄存器反转操作产生地址,实现输入输出共享存储器。所提设计方法规整、高效、适用范围广泛,便于现场可编程逻辑器件以及集成电路实施,并且也将持续受益于存储器工艺的提升。 展开更多
关键词 快速傅里叶变换 并行计算 可扩展架构 高效存储器 流水线
下载PDF
矩形顶管隧道邻近刚性管线施工安全距离研究
15
作者 丁启峰 李文江 +1 位作者 孙源 李乾 《粉煤灰综合利用》 CAS 2024年第2期64-69,共6页
针对顶管隧道邻近刚性管线施工安全距离问题,借助有限差分程序和梁—弹簧模型,采用两阶段法,对净空2.6 m×2.4 m矩形顶管电缆隧道在软黏土地层施工时土体变位特征进行研究,分析顶管平行下穿地下刚性管线时管—隧净距、管材和管径对... 针对顶管隧道邻近刚性管线施工安全距离问题,借助有限差分程序和梁—弹簧模型,采用两阶段法,对净空2.6 m×2.4 m矩形顶管电缆隧道在软黏土地层施工时土体变位特征进行研究,分析顶管平行下穿地下刚性管线时管—隧净距、管材和管径对刚性管线最大拉应力的影响,并根据服务年限对管线强度控制标准进行了修正,确定顶管下穿管线的安全施工距离。结果表明:管线最大变形曲率与最大拉应力随着顶管与管线间距的增大呈指数性衰减;同种材质下,管线管径越大,施工扰动导致的最大拉应力越大;管—隧净距及管线管径一定时,钢管的最大拉应力最大,球磨铸铁管其次,灰口铸铁管最小。 展开更多
关键词 矩形顶管 刚性管线 平行下穿 两阶段法 变形特征 安全距离
下载PDF
大型并联天然气管网系统运行优化技术研究
16
作者 刘恩斌 彭勇 +1 位作者 杨毅 李长俊 《油气田地面工程》 2024年第1期1-7,共7页
某大型长距离并行管道系统结构复杂,其压气机站合建在一起,且进站口和出站口设置有联通管线和截断阀门,由于两条管线内流量能够自由分配,使得沿线不同站场可能出现双线联合运行、双线独立运行、单线压力越站、双线压力越站等多种复杂工... 某大型长距离并行管道系统结构复杂,其压气机站合建在一起,且进站口和出站口设置有联通管线和截断阀门,由于两条管线内流量能够自由分配,使得沿线不同站场可能出现双线联合运行、双线独立运行、单线压力越站、双线压力越站等多种复杂工况。为了减少大型并联管道系统能耗,构建了并联管道的优化运行模型,该模型以能耗最低为目标函数,考虑了压缩机自耗气、站场流量分配以及压力分配等多种约束,能够确定不同站场的运行方式,计算不同管线间的最佳流量分配。由于模型决策变量较多,采用一种改进的粒子群算法(PPSO)对模型进行优化计算,通过实践该算法获取的优化方案最好。研究结果表明:优化后,管道系统可降低能耗25.31%,对现场操作具有一定指导意义。 展开更多
关键词 天然气 并联管网 粒子群算法(PPSO) 优化模型
下载PDF
安哥拉长大玻璃钢双管供水管道穿越河滩沼泽地段施工技术
17
作者 杨宗文 《路基工程》 2024年第1期150-154,共5页
安哥拉SOYOⅠ联合循环电站取水玻璃钢管道穿越刚果河滩沼泽地段,玻璃钢管双管布置,管径D=600 mm,管节长6 m,管节间采用双“O”型整体式承插连接,管道与设备、阀门采用法兰连接。对特殊、复杂地质条件下的玻璃钢管道安装方法进行改进,通... 安哥拉SOYOⅠ联合循环电站取水玻璃钢管道穿越刚果河滩沼泽地段,玻璃钢管双管布置,管径D=600 mm,管节长6 m,管节间采用双“O”型整体式承插连接,管道与设备、阀门采用法兰连接。对特殊、复杂地质条件下的玻璃钢管道安装方法进行改进,通过合理选取取水管材,确定最优管道基础处理方案,严格水压试验,确保玻璃钢管安装质量等措施,工程质量得到有效保证。 展开更多
关键词 长大管道 玻璃钢管 并排穿越 河滩沼泽 水压试验 中粗砂换填 毛石挤淤
下载PDF
Half Vector Message Pipelining Optimization of Barrier Synchronization Problems
18
作者 韩天舒 胡铭曾 +1 位作者 李晓明 丁雪梅 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 1999年第1期50-53,共4页
Communication optimization is very important for imporoving performance of parallel programs A communication optimization method called HVMP(Half Vector Message Ripelining) is presented. In comparison with the widelyu... Communication optimization is very important for imporoving performance of parallel programs A communication optimization method called HVMP(Half Vector Message Ripelining) is presented. In comparison with the widelyused vector message pipelining, HVMP can get better tradeoff between reducing and hiding communication overhead,and eliminate the communication barrier of barrier synchronization problems[1]. For parallel Systems with low bandwidth such as cluster of workstations and barrier synchronization problems with large amount of communication, HVMPmethod can get good performance. 展开更多
关键词 Data parallel BARRIER synchronization communication OPTIMIZATION method HALF VECTOR MESSAGE pipelinING
下载PDF
RUMINATE METHOD-SOFTWARE PIPELINING ON NESTED LOOPS
19
作者 LEI WANG ZHIZHONGTANG and CHIHONG ZHANG(Dept. of Computer Science, Tsinghua Lirnivcrsitg Beijing 100084,P. R. China)(Final: wl,t ang ,zch@est4. dcs. tsinghua.edu. cn) 《Wuhan University Journal of Natural Sciences》 CAS 1996年第Z1期430-436,共7页
This paper offers a new method to solve the problem of software pipelininsr on nested loops. We first introduce our new software pipelininog method. Ruminate Method, which can optimize program with nested loops. We al... This paper offers a new method to solve the problem of software pipelininsr on nested loops. We first introduce our new software pipelininog method. Ruminate Method, which can optimize program with nested loops. We also outline an algorithm to realize it and introduce the hardware support we designed. The performance of Ruminate Method is analyzed at the end of this paper with the aid of our preliminary experimental result. 展开更多
关键词 Instruction-level parallelism Software pipeline Ruminate Method Nested Loop.
下载PDF
Parallel spatial-temporal mode
20
作者 ZHU Ding-ju 《通讯和计算机(中英文版)》 2009年第4期42-46,共5页
关键词 空间时间模型 平行 数据流水线 计算机技术
下载PDF
上一页 1 2 27 下一页 到第
使用帮助 返回顶部