期刊文献+
共找到417篇文章
< 1 2 21 >
每页显示 20 50 100
Architecture, challenges and applications of dynamic reconfigurable computing 被引量:4
1
作者 Yanan Lu Leibo Liu +2 位作者 Jianfeng Zhu Shouyi Yin Shaojun Wei 《Journal of Semiconductors》 EI CAS CSCD 2020年第2期4-13,共10页
As a computing paradigm that combines temporal and spatial computations,dynamic reconfigurable computing provides superiorities of flexibility,energy efficiency and area efficiency,attracting interest from both academ... As a computing paradigm that combines temporal and spatial computations,dynamic reconfigurable computing provides superiorities of flexibility,energy efficiency and area efficiency,attracting interest from both academia and industry.However,dynamic reconfigurable computing is not yet mature because of several unsolved problems.This work introduces the concept,architecture,and compilation techniques of dynamic reconfigurable computing.It also discusses the existing major challenges and points out its potential applications. 展开更多
关键词 reconfigurable computing ARCHITECTURE CHALLENGE APPLICATION
下载PDF
THE RESEARCH AND DESIGN OF RECONFIGURABLE COMPUTING FOR BLOCK CIPHER 被引量:1
2
作者 Yang Xiaohui Dai Zibin Zhang Yongfu Yu Xuerong 《Journal of Electronics(China)》 2008年第4期503-510,共8页
This paper describes a new specialized Reconfigurable Cryptographic for Block ciphersArchitecture(RCBA).Application-specific computation pipelines can be configured according to thecharacteristics of the block cipher ... This paper describes a new specialized Reconfigurable Cryptographic for Block ciphersArchitecture(RCBA).Application-specific computation pipelines can be configured according to thecharacteristics of the block cipher processing in RCBA,which delivers high performance for crypto-graphic applications.RCBA adopts a coarse-grained reconfigurable architecture that mixes the ap-propriate amount of static configurations with dynamic configurations.RCBA has been implementedbased on Altera’s FPGA,and representative algorithms of block cipher such as DES,Rijndael and RC6have been mapped on RCBA architecture successfully.System performance has been analyzed,andfrom the analysis it is demonstrated that the RCBA architecture can achieve more flexibility and ef-ficiency when compared with other implementations. 展开更多
关键词 reconfigurable computing Block cipher reconfigurable Cryptographic for Block ciphers Architecture (RCBA)
下载PDF
A Reconfigurable Network-on-Chip Datapath for Application Specific Computing
3
作者 Joshua Weber Erdal Oruklu 《Circuits and Systems》 2013年第2期181-192,共12页
This paper introduces a new datapath architecture for reconfigurable processors. The proposed datapath is based on Network-on-Chip approach and facilitates tight coupling of all functional units. Reconfigurable functi... This paper introduces a new datapath architecture for reconfigurable processors. The proposed datapath is based on Network-on-Chip approach and facilitates tight coupling of all functional units. Reconfigurable functional elements can be dynamically allocated for application specific optimizations, enabling polymorphic computing. Using a modified network simulator, performance of several NoC topologies and parameters are investigated with standard benchmark programs, including fine grain and coarse grain computations. Simulation results highlight the flexibility and scalability of the proposed polymorphic NoC processor for a wide range of application domains. 展开更多
关键词 reconfigurable computing NETWORK-ON-CHIP NETWORK Simulators POLYMORPHIC computing
下载PDF
Parallel scheduling strategy of web-based spatial computing tasks in multi-core environment
4
作者 郭明强 Huang Ying Xie Zhong 《High Technology Letters》 EI CAS 2014年第4期395-400,共6页
In order to improve the concurrent access performance of the web-based spatial computing system in cluster,a parallel scheduling strategy based on the multi-core environment is proposed,which includes two levels of pa... In order to improve the concurrent access performance of the web-based spatial computing system in cluster,a parallel scheduling strategy based on the multi-core environment is proposed,which includes two levels of parallel processing mechanisms.One is that it can evenly allocate tasks to each server node in the cluster and the other is that it can implement the load balancing inside a server node.Based on the strategy,a new web-based spatial computing model is designed in this paper,in which,a task response ratio calculation method,a request queue buffer mechanism and a thread scheduling strategy are focused on.Experimental results show that the new model can fully use the multi-core computing advantage of each server node in the concurrent access environment and improve the average hits per second,average I/O Hits,CPU utilization and throughput.Using speed-up ratio to analyze the traditional model and the new one,the result shows that the new model has the best performance.The performance of the multi-core server nodes in the cluster is optimized;the resource utilization and the parallel processing capabilities are enhanced.The more CPU cores you have,the higher parallel processing capabilities will be obtained. 展开更多
关键词 parallel scheduling strategy the web-based spatial computing model multi-core environment load balancing
下载PDF
Design of graph computing accelerator based on reconfigurable PE array
5
作者 Deng Junyong Jia Yanting +2 位作者 Zhang Baoxiang Kang Yuchun Lu Songtao 《The Journal of China Universities of Posts and Telecommunications》 EI 2024年第5期49-63,70,共16页
Due to the diversity of graph computing applications, the power-law distribution of graph data, and the high compute-to-memory ratio, traditional architectures face significant challenges regarding poor flexibility, i... Due to the diversity of graph computing applications, the power-law distribution of graph data, and the high compute-to-memory ratio, traditional architectures face significant challenges regarding poor flexibility, imbalanced workload distribution, and inefficient memory access when executing graph computing tasks. Graph computing accelerator, GraphApp, based on a reconfigurable processing element(PE) array was proposed to address the challenges above. GraphApp utilizes 16 reconfigurable PEs for parallel computation and employs tiled data. By reasonably dividing the data into tiles, load balancing is achieved and the overall efficiency of parallel computation is enhanced. Additionally, it preprocesses graph data using the compressed sparse columns independently(CSCI) data compression format to alleviate the issue of low memory access efficiency caused by the high memory access-to-computation ratio. Lastly, GraphApp is evaluated using triangle counting(TC) and depth-first search(DFS) algorithms. Performance analysis is conducted by measuring the execution time of these algorithms in GraphApp against existing typical graph frameworks, Ligra, and GraphBIG, using six datasets from the Stanford Network Analysis Project(SNAP) database. The results show that GraphApp achieves a maximum performance improvement of 30.86% compared to Ligra and 20.43% compared to GraphBIG when processing the same datasets. 展开更多
关键词 graph computing reconfigurable accelerator parallel computing triangle counting(TC)algorithm depth-first search(DFS)algorithm
原文传递
Image processing algorithm acceleration using reconfigurable macro processor model 被引量:2
6
作者 SunGuanKfu ChenHuaming LuHuanzhang 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2004年第2期110-114,共5页
The concept and advantage of reconfigurable technology is introduced. A kind of processor architecture of re configurable macro processor (RMP) model based on FPGA array and DSP is put forward and has been implemented... The concept and advantage of reconfigurable technology is introduced. A kind of processor architecture of re configurable macro processor (RMP) model based on FPGA array and DSP is put forward and has been implemented. Two image algorithms are developed: template-based automatic target recognition and zone labeling. One is estimating for motion direction in the infrared image background, another is line picking-up algorithm based on image zone labeling and phase grouping technique. It is a kind of 'hardware' function that can be called by the DSP in high-level algorithm. It is also a kind of hardware algorithm of the DSP. The results of experiments show the reconfigurable computing technology based on RMP is an ideal accelerating means to deal with the high-speed image processing tasks. High real time performance is obtained in our two applications on RMP. 展开更多
关键词 real-time image processing reconfigurable computing technology reconfigurable macro processor model template matching image zone labeling.
下载PDF
Polymorphic Computing: Definition, Trends, and a New Agent-Based Architecture 被引量:3
7
作者 David Hentrich Erdal Oruklu Jafar Saniie 《Circuits and Systems》 2011年第4期358-364,共7页
Polymorphic computing is widely seen as next evolutionary step in designing advanced computing architectures. This paper presents a brief history of reconfigurable and polymorphic computing, and highlights the recent ... Polymorphic computing is widely seen as next evolutionary step in designing advanced computing architectures. This paper presents a brief history of reconfigurable and polymorphic computing, and highlights the recent trends and challenges. A novel polymorphic architecture featuring programmable memory event triggers and a new concept of control agents is proposed. This architecture can provide dynamic load balancing, distributed control, separated memory and processing fabrics, configurable memory blocks, and task-optimized computation. 展开更多
关键词 POLYMORPHIC computing reconfigurable computing Agents Processing FABRIC
下载PDF
Low Power Computing Paradigms Based on Emerging Non-Volatile Nanodevices 被引量:1
8
作者 G.-F.Wang W.Kang +4 位作者 Y.-Q.Cheng J.Nan J.-O.Klein Y.-G.Zhang W.-S.Zhao 《Journal of Electronic Science and Technology》 CAS 2014年第2期163-172,共10页
Traditional digital processing approaches are based on semiconductor transistors, which suffer from high power consumption, aggravating with technology node scaling. To solve definitively this problem, a number of eme... Traditional digital processing approaches are based on semiconductor transistors, which suffer from high power consumption, aggravating with technology node scaling. To solve definitively this problem, a number of emerging non-volatile nanodevices are under intense investigations. Meanwhile, novel computing circuits are invented to dig the full potential of the nanodevices. The combination of non-volatile nanodevices with suitable computing paradigms have many merits compared with the complementary metal-oxide-semiconductor transistor (CMOS) technology based structures, such as zero standby power, ultra-high density, non-volatility, and acceptable access speed. In this paper, we overview and compare the computing paradigms based on the emerging nanodevices towards ultra-low dissipation. 展开更多
关键词 Emerging nanodevices logic in memory low-power computing paradigms MEMRISTOR neuromorphic NORMALLY-OFF reconfigurable logic
下载PDF
Power Grid Islands Service Restoration Based on Cloud Computing 被引量:14
9
作者 ZHANG Hao HE Jinghan +2 位作者 YIN Hang BO Zhiqian B Kirby 《中国电机工程学报》 EI CSCD 北大核心 2011年第34期I0007-I0007,9,共1页
提出了一种基于云计算思想的电网恢复重构方法,利用网络中大量的分布式计算资源加速求取孤岛间网络恢复重构的最优解。利用电力系统分层分布式体系结构将含有多个微网的复杂配电网络视为不同的子云计算区,每个子云计算区根据自身情况将... 提出了一种基于云计算思想的电网恢复重构方法,利用网络中大量的分布式计算资源加速求取孤岛间网络恢复重构的最优解。利用电力系统分层分布式体系结构将含有多个微网的复杂配电网络视为不同的子云计算区,每个子云计算区根据自身情况将重构的服务请求分解为多个可独立处理的计算块并提交分布的计算节点进行并行处理和结果取优。相关联子云计算区之间可通信以协调不同区域间的负荷转供,最终实现故障恢复重构的目标。算例分析结果验证了该方法的有效性。 展开更多
关键词 英文摘要 内容介绍 编辑工作 期刊
下载PDF
Design and implementation of near-memory computing array architecture based on shared buffer 被引量:1
10
作者 SHAN Rui GAO Xu +3 位作者 FENG Yani HUI Chao CUI Xinyue CHAI Miaomiao 《High Technology Letters》 EI CAS 2022年第4期345-353,共9页
Deep learning algorithms have been widely used in computer vision,natural language processing and other fields.However,due to the ever-increasing scale of the deep learning model,the requirements for storage and compu... Deep learning algorithms have been widely used in computer vision,natural language processing and other fields.However,due to the ever-increasing scale of the deep learning model,the requirements for storage and computing performance are getting higher and higher,and the processors based on the von Neumann architecture have gradually exposed significant shortcomings such as consumption and long latency.In order to alleviate this problem,large-scale processing systems are shifting from a traditional computing-centric model to a data-centric model.A near-memory computing array architecture based on the shared buffer is proposed in this paper to improve system performance,which supports instructions with the characteristics of store-calculation integration,reducing the data movement between the processor and main memory.Through data reuse,the processing speed of the algorithm is further improved.The proposed architecture is verified and tested through the parallel realization of the convolutional neural network(CNN)algorithm.The experimental results show that at the frequency of 110 MHz,the calculation speed of a single convolution operation is increased by 66.64%on average compared with the CNN architecture that performs parallel calculations on field programmable gate array(FPGA).The processing speed of the whole convolution layer is improved by 8.81%compared with the reconfigurable array processor that does not support near-memory computing. 展开更多
关键词 near-memory computing shared buffer reconfigurable array processor convolutional neural network(CNN)
下载PDF
Principle and architecture of parallel reconfiguration circuit for ternary optical computer 被引量:3
11
作者 欧阳山 金翊 +1 位作者 周裕 王宏健 《Journal of Shanghai University(English Edition)》 CAS 2011年第5期397-404,共8页
Reconfiguration is the key to produce an applicable ternary optical computer (TOC). The method to implement the reconfiguration function determines whether a TOC can step into applied fields or not. In this work, a ... Reconfiguration is the key to produce an applicable ternary optical computer (TOC). The method to implement the reconfiguration function determines whether a TOC can step into applied fields or not. In this work, a design of the reconfiguration circuit based on field programmable gates array (FPGA) is proposed, and the structure of the entire hardware system is discussed. 展开更多
关键词 reconfiguration circuit ternary optical computer (TOC) field programmable gates array (FPGA)
下载PDF
Reconfigurable Mott electronics for homogeneous neuromorphic platform
12
作者 杨振 路英明 杨玉超 《Chinese Physics B》 SCIE EI CAS CSCD 2023年第12期67-72,共6页
To simplify the fabrication process and increase the versatility of neuromorphic systems,the reconfiguration concept has attracted much attention.Here,we developed a novel electrochemical VO_(2)(EC-VO_(2))device,which... To simplify the fabrication process and increase the versatility of neuromorphic systems,the reconfiguration concept has attracted much attention.Here,we developed a novel electrochemical VO_(2)(EC-VO_(2))device,which can be reconfigured as synapses or LIF neurons.The ionic dynamic doping contributed to the resistance changes of VO_(2),which enables the reversible modulation of device states.The analog resistance switching and tunable LIF functions were both measured based on the same device to demonstrate the capacity of reconfiguration.Based on the reconfigurable EC-VO_(2),the simulated spiking neural network model exhibited excellent performances by using low-precision weights and tunable output neurons,whose final accuracy reached 91.92%. 展开更多
关键词 Mott electronics reconfigurable neuromorphic computing VO_(2)
下载PDF
Large-Eddy Simulation of Airflow over a Steep, Three-Dimensional Isolated Hill with Multi-GPUs Computing
13
作者 Takanori Uchida 《Open Journal of Fluid Dynamics》 2018年第4期416-434,共19页
The present research attempted a Large-Eddy Simulation (LES) of airflow over a steep, three-dimensional isolated hill by using the latest multi-cores multi-CPUs systems. As a result, it was found that 1) turbulence si... The present research attempted a Large-Eddy Simulation (LES) of airflow over a steep, three-dimensional isolated hill by using the latest multi-cores multi-CPUs systems. As a result, it was found that 1) turbulence simulations using approximately 50 million grid points are feasible and 2) the use of this system resulted in the achievement of a high computation speed, which exceeded the speed of parallel computation attained by a single CPU on one of the latest supercomputers. Furthermore, LES was conducted by using the multi-GPUs systems. The results of these simulations revealed the following findings: 1) the multi-GPUs environment which used the NVDIA? Tesla M2090 or the M2075 could simulate turbulence in a model with as many as approximately 50 million grid points. 2) The computation speed achieved by the multi-GPUs environments exceeded that by parallel computation which used four to six CPUs of one of the latest supercomputers. 展开更多
关键词 LES ISOLATED HILL multi-cores Multi-CPUs computing Multi-GPUs computing
下载PDF
Reconfigurable IP Core Architecture of IEEE802.3 for Xilinx Spartan 3AN FPGA
14
作者 Wael M EI-Medany 《通讯和计算机(中英文版)》 2013年第2期264-269,共6页
关键词 IEEE802 3 FPGA 可重构 超高速集成电路硬件描述语言 IP核心 现场可编程门阵列 无线收发系统 IP核设计
下载PDF
Configuration Reusing in On-Line Task Scheduling for Reconfigurable Computing Systems 被引量:1
15
作者 Maisam Mansub Bassiri Hadi Shahriar Shahhoseini 《Journal of Computer Science & Technology》 SCIE EI CSCD 2011年第3期463-473,共11页
Reconfigurable computing systems can be reconfigured at runtime and support partial reconfigurability which makes us able to execute tasks in a true multitasking manner. To manage such systems at runtime, a reconfigur... Reconfigurable computing systems can be reconfigured at runtime and support partial reconfigurability which makes us able to execute tasks in a true multitasking manner. To manage such systems at runtime, a reconfigurable operating system is needed. The main part of this operating system is resource management unit which performs on-line scheduling and placement of hardware tasks at runtime. Reconfiguration overhead is an important obstacle that limits the performance of on-line scheduling algorithms in reconfigurable computing systems and increases the overall execution time. Configuration reusing (task reusing) can decrease reconfiguration overhead considerably, particularly in periodic applications or the applications in which the probability of tasks recurrence is high. In this paper, we present a technique called reusing-based scheduling (RBS), for on-line scheduling and placement in which configuration reusing is considered as a main characteristic in order to reduce reconfiguration overhead and decrease total execution time of the tasks. Several experiments have been conducted on the proposed algorithm. Obtained results show considerable improvement in overall execution time of the tasks. 展开更多
关键词 reconfigurable computing on-line scheduling configuration reusing RPU partitioning replacement manage-ment probability of recurrence
原文传递
Reconfigurable memristor based on SrTiO_(3) thin-film for neuromorphic computing 被引量:3
16
作者 Xiaobing Yan Xu Han +12 位作者 Ziliang Fang Zhen Zhao Zixuan Zhang Jiameng Sun Yiduo Shao Yinxing Zhang Lulu Wang Shiqing Sun Zhenqiang Guo Xiaotong Jia Yupeng Zhang Zhiyuan Guan Tuo Shi 《Frontiers of physics》 SCIE CSCD 2023年第6期211-220,共10页
Neuromorphic computing aims to achieve artificial intelligence by mimicking the mechanisms of biological neurons and synapses that make up the human brain.However,the possibility of using one reconfigurable memristor ... Neuromorphic computing aims to achieve artificial intelligence by mimicking the mechanisms of biological neurons and synapses that make up the human brain.However,the possibility of using one reconfigurable memristor as both artificial neuron and synapse still requires intensive research in detail.In this work,Ag/SrTiO_(3)(STO)/Pt memristor with low operating voltage is manufactured and reconfigurable as both neuron and synapse for neuromorphic computing chip.By modulating the compliance current,two types of resistance switching,volatile and nonvolatile,can be obtained in amorphous STO thin film.This is attributed to the manipulation of the Ag conductive filament.Furthermore,through regulating electrical pulses and designing bionic circuits,the neuronal functions of leaky integrate and fire,as well as synaptic biomimicry with spike-timing-dependent plasticity and paired-pulse facilitation neural regulation,are successfully realized.This study shows that the reconfigurable devices based on STO thin film are promising for the application of neuromorphic computing systems. 展开更多
关键词 Ag/STO/Pt reconfigurable memristor volatile and nonvolatile coexistence neuron circuit synaptic biomimicry neuromorphic computing
原文传递
Parallel computing of discrete element method on multi-core processors 被引量:6
17
作者 Yusuke Shigeto Mikio Sakai 《Particuology》 SCIE EI CAS CSCD 2011年第4期398-405,共8页
This paper describes parallel simulation techniques for the discrete element method (DEM) on multi-core processors. Recently, multi-core CPU and GPU processors have attracted much attention in accelerating computer ... This paper describes parallel simulation techniques for the discrete element method (DEM) on multi-core processors. Recently, multi-core CPU and GPU processors have attracted much attention in accelerating computer simulations in various fields. We propose a new algorithm for multi-thread parallel computation of DEM, which makes effective use of the available memory and accelerates the computation. This study shows that memory usage is drastically reduced by using this algorithm. To show the practical use of DEM in industry, a large-scale powder system is simulated with a complicated drive unit. We compared the performance of the simulation between the latest GPU and CPU processors with optimized programs for each processor. The results show that the difference in performance is not substantial when using either GPUs or CPUs with a multi-thread parallel algorithm. In addition, DEM algorithm is shown to have high scalabilitv in a multi-thread parallel computation on a CPU. 展开更多
关键词 Discrete element method Parallel computing multi-core processor GPGPU
原文传递
A Partitioning Methodology That Optimizes the Communication Cost for Reconfigurable Computing Systems
18
作者 Ramzi Ayadi Bouraoui Ouni Abdellatif Mtibaa 《International Journal of Automation and computing》 EI 2012年第3期280-287,共8页
This paper focuses on the design process for reconfigurable architecture. Our contribution focuses on introducing a new temporal partitioning algorithm. Our algorithm is based on typical mathematic flow to solve the t... This paper focuses on the design process for reconfigurable architecture. Our contribution focuses on introducing a new temporal partitioning algorithm. Our algorithm is based on typical mathematic flow to solve the temporal partitioning problem. This algorithm optimizes the transfer of data required between design partitions and the reconfiguration overhead. Results show that our algorithm considerably decreases the communication cost and the latency compared with other well known algorithms. 展开更多
关键词 Temporal partitioning data flow graph communication cost reconfigurable computing systems field-programmable gate array (FPGA)
原文传递
A coarse-grained reconfigurable computing architecture with loop self-pipelining
19
作者 DOU Yong WU GuiMing XU dinHui ZHOU XingMing 《Science in China(Series F)》 2009年第4期575-587,共13页
Reconfigurable computing tries to achieve the balance between high efficiency of custom computing and flexibility of general-purpose computing. This paper presents the implementation techniques in LEAP, a coarse-grain... Reconfigurable computing tries to achieve the balance between high efficiency of custom computing and flexibility of general-purpose computing. This paper presents the implementation techniques in LEAP, a coarse-grained reconfigurable array, and proposes a speculative execution mechanism for dynamic loop scheduling with the goal of one iteration per cycle and implementation techniques to support decoupling synchronization between the token generator and the collector. This paper also in- troduces the techniques of exploiting both data dependences of intra- and inter-iteration, with the help of two instructions for special data reuses in the loop-carried dependences. The experimental results show that the number of memory accesses reaches on average 3% of an RISC processor simulator with no memory optimization. In a practical image matching application, LEAP architecture achieves about 34 times of speedup in execution cycles, compared with general-purpose processors. 展开更多
关键词 reconfigurable computing loop pipelining data driven register promotion
原文传递
基于可重构硅光滤波器的计算重建片上光谱仪
20
作者 张赞 黄北举 陈弘达 《物理学报》 SCIE EI CAS CSCD 北大核心 2024年第14期34-42,共9页
相比于笨重的台式光谱仪,集成化的芯片级光谱仪可以应用于便携式的健康监测、环境检测等场景.我们设计了一个基于硅光子平台的片上光谱仪.该器件由一个透射光谱可重构的硅光滤波器构成.通过改变滤波器的透射光谱,可以实现对输入光谱的... 相比于笨重的台式光谱仪,集成化的芯片级光谱仪可以应用于便携式的健康监测、环境检测等场景.我们设计了一个基于硅光子平台的片上光谱仪.该器件由一个透射光谱可重构的硅光滤波器构成.通过改变滤波器的透射光谱,可以实现对输入光谱的多次且不同的采样.再结合人工神经网络算法,从采样后的信号中重建出入射光谱.可重构的硅光滤波器由互相耦合的马赫曾德干涉仪和微环谐振腔组成.采用集成的热光相移器引入相位变化,能够对滤波器的透射光谱进行重构.通过这种方式,基于单个可重构滤波器可得到包含宽、窄光谱多样特征的响应函数.不需要滤波器阵列,就可以实现对入射光谱的多样化采样,能够显著地减小光谱检测器件的面积.仿真结果表明,所设计的器件在1500—1600 nm波长范围内可以实现连续光谱和稀疏光谱的重建,分辨率约为0.2 nm.该器件在可穿戴光学传感、便携式光谱仪等场景中具有巨大的应用潜力. 展开更多
关键词 计算光谱重建 可重构硅光滤波器 深度学习 光子集成回路
下载PDF
上一页 1 2 21 下一页 到第
使用帮助 返回顶部