The reconfigurable chip,which integrates the advantages of high performance,high flexibility,high parallelism,low power consumption,and low cost,has achieved rapid development and wide application.Generally,the contro...The reconfigurable chip,which integrates the advantages of high performance,high flexibility,high parallelism,low power consumption,and low cost,has achieved rapid development and wide application.Generally,the control part and the computing part of algorithm is accelerated based on different reconfigurable architectures,but it is difficult to obtain overall performance improvement.For improving efficiency of reconfigurable structure both for the control part and the computing part,a hybrid of instruction-driven and data-driven self-reconfigurable cell array is proposed.On instruction-driven mode,processing element(PE)works like a reduced instruction set computer(RSIC)machine,which is mainly for the control part of algorithm.On data-driven mode,data is calculated by flowing between the preconfigured PEs,which is mainly for the computing of algorithm.For verifying the efficiency of architecture,some high-efficiency video coding(HEVC)video compression algorithms are implemented on the proposed architecture.The proposed architecture has been implemented on Xilinx FPGA Virtex UltraScale VU440 develop board.The same circuitry is able to run at75 MHz.Compared with the architecture that only supports instruction-driven,the proposed architecture has better calculation efficiency.展开更多
CdSe/CdS semiconductor quantum dots co-sensitized TiO2 nanorod array was fabricated on the transparent conductive fluorine-doped tin oxide (FTO) substrate using the hydrothermal and successive ionic layer adsorption...CdSe/CdS semiconductor quantum dots co-sensitized TiO2 nanorod array was fabricated on the transparent conductive fluorine-doped tin oxide (FTO) substrate using the hydrothermal and successive ionic layer adsorption and reaction (SILAR) process. The structural and morphological properties of the samples were characterized by X-ray diffraction (XRD), field-emission scanning electron microscopy (FESEM), and transmission electron microscopy (TEM). The results indicate that CdSe/CdS QDs are uniformly coated on the surface of the TiO2 nanorods. The shift of light absorption edge was monitored by taking UV-visible absorption spectra. Compared with the absorption spectra of the TiO2 nanorod array, deposition of CdSe/CdS QDs shifts the absorption edge to the higher wavelength. The enhanced light absorption in the visible-light region of CdSe/CdS/TiO2 nanorod array indicates that CdSe/CdS layers can act as co-sensitizers in quantum dots sensitized solar cells (QDSSCs). By optimizing the CdSe layer deposition cycles, a photocurrent of 5.78 mA/cm2, an open circuit photovoltage of 0.469 V and a conversion efficiency of 1.34 % were obtained under an illumination of 100 mw/cm2.展开更多
The utilization of computation resources and reconfiguration time has a large impact on reconfiguration system performance. In order to promote the performance, a dynamical self-reconfigurable mechanism for data-drive...The utilization of computation resources and reconfiguration time has a large impact on reconfiguration system performance. In order to promote the performance, a dynamical self-reconfigurable mechanism for data-driven cell array is proposed. Cells can be fired only when the needed data arrives, and cell array can be worked on two modes: fixed execution and reconfiguration. On reconfiguration mode, cell function and data flow direction are changed automatically at run time according to contexts. Simultaneously using an H-tree interconnection network, through pre-storing multiple application mapping contexts in reconfiguration buffer, multiple applications can execute concurrently and context switching time is the minimal. For verifying system performance, some algorithms are selected for mapping onto the proposed structure, and the amount of configuration contexts and execution time are recorded for statistical analysis. The results show that the proposed self-reconfigurable mechanism can reduce the number of contexts efficiently, and has a low computing time.展开更多
In this paper, an improvement has been made on the algorithm of solving the kernels of new functions which are generated after a common divisor appearing in the original functions is replaced by a new intermediate var...In this paper, an improvement has been made on the algorithm of solving the kernels of new functions which are generated after a common divisor appearing in the original functions is replaced by a new intermediate variable. And an efficient method based on kernel heritage is presented. This method has been successfully used in synthesis of LCA (Logic Cell Array).展开更多
Different modalities in biomedical images, like CT, MRI and PET scanners, provide detailed cross-sectional views of human anatomy. This paper introduces three-dimensional brain reconstruction based on CT slices. It co...Different modalities in biomedical images, like CT, MRI and PET scanners, provide detailed cross-sectional views of human anatomy. This paper introduces three-dimensional brain reconstruction based on CT slices. It contains filtering, fuzzy segmentation, matching method of contours, cell array structure and image animation. Experimental results have shown its validity. The innovation is matching method of contours and fuzzy segmentation algorithm of CT slices.展开更多
Row Parallel Coarse-Grained Reconfigurable Architecture(RPCGRA)has the advantages of maximum parallelism and programmable flexibility.Designing an efficient algorithm to map the diverse applications onto RPCGRA is dif...Row Parallel Coarse-Grained Reconfigurable Architecture(RPCGRA)has the advantages of maximum parallelism and programmable flexibility.Designing an efficient algorithm to map the diverse applications onto RPCGRA is difficult due to a number of RPCGRA hardware constraints.To solve this problem,the nodes of the data flow graph must be partitioned and scheduled onto the RPCGRA.In this paper,we present a Depth-First Greedy Mapping(DFGM)algorithm that simultaneously considers the communication costs and the use times of the Reconfigurable Cell Array(RCA).Compared with level breadth mapping,the performance of DFGM is better.The percentage of maximum improvement in the use times of RCA is 33%and the percentage of maximum improvement in non-original input and output times is 64.4%(Given Discrete Cosine Transfor 8(DCT8),and the area of reconfigurable processing unit is 56).Compared with level-based depth mapping,DFGM also obtains the lowest averages of use times of RCA,non-original input and output times,and the reconfigurable time.展开更多
基金Supported by the National Natural Science Foundation of China(No.61802304,61834005,61772417,61634004)the Shaanxi Province Key R&D Plan(No.2021GY-029).
文摘The reconfigurable chip,which integrates the advantages of high performance,high flexibility,high parallelism,low power consumption,and low cost,has achieved rapid development and wide application.Generally,the control part and the computing part of algorithm is accelerated based on different reconfigurable architectures,but it is difficult to obtain overall performance improvement.For improving efficiency of reconfigurable structure both for the control part and the computing part,a hybrid of instruction-driven and data-driven self-reconfigurable cell array is proposed.On instruction-driven mode,processing element(PE)works like a reduced instruction set computer(RSIC)machine,which is mainly for the control part of algorithm.On data-driven mode,data is calculated by flowing between the preconfigured PEs,which is mainly for the computing of algorithm.For verifying the efficiency of architecture,some high-efficiency video coding(HEVC)video compression algorithms are implemented on the proposed architecture.The proposed architecture has been implemented on Xilinx FPGA Virtex UltraScale VU440 develop board.The same circuitry is able to run at75 MHz.Compared with the architecture that only supports instruction-driven,the proposed architecture has better calculation efficiency.
基金National Natural Science Foundation of China(No.11174071)the International Cooperation Project of Wuhan City and Hubei Province(Nos.201070934339 and 2010BFA010)
文摘CdSe/CdS semiconductor quantum dots co-sensitized TiO2 nanorod array was fabricated on the transparent conductive fluorine-doped tin oxide (FTO) substrate using the hydrothermal and successive ionic layer adsorption and reaction (SILAR) process. The structural and morphological properties of the samples were characterized by X-ray diffraction (XRD), field-emission scanning electron microscopy (FESEM), and transmission electron microscopy (TEM). The results indicate that CdSe/CdS QDs are uniformly coated on the surface of the TiO2 nanorods. The shift of light absorption edge was monitored by taking UV-visible absorption spectra. Compared with the absorption spectra of the TiO2 nanorod array, deposition of CdSe/CdS QDs shifts the absorption edge to the higher wavelength. The enhanced light absorption in the visible-light region of CdSe/CdS/TiO2 nanorod array indicates that CdSe/CdS layers can act as co-sensitizers in quantum dots sensitized solar cells (QDSSCs). By optimizing the CdSe layer deposition cycles, a photocurrent of 5.78 mA/cm2, an open circuit photovoltage of 0.469 V and a conversion efficiency of 1.34 % were obtained under an illumination of 100 mw/cm2.
基金the National Natural Science Foundation of China (Nos. 61802304, 61834005, 61772417, 61634004, and 61602377)the Shaanxi Provincial Co-ordination Innovation Project of Science and Technology (No. 2016KTZDGY02-04-02)。
文摘The utilization of computation resources and reconfiguration time has a large impact on reconfiguration system performance. In order to promote the performance, a dynamical self-reconfigurable mechanism for data-driven cell array is proposed. Cells can be fired only when the needed data arrives, and cell array can be worked on two modes: fixed execution and reconfiguration. On reconfiguration mode, cell function and data flow direction are changed automatically at run time according to contexts. Simultaneously using an H-tree interconnection network, through pre-storing multiple application mapping contexts in reconfiguration buffer, multiple applications can execute concurrently and context switching time is the minimal. For verifying system performance, some algorithms are selected for mapping onto the proposed structure, and the amount of configuration contexts and execution time are recorded for statistical analysis. The results show that the proposed self-reconfigurable mechanism can reduce the number of contexts efficiently, and has a low computing time.
文摘In this paper, an improvement has been made on the algorithm of solving the kernels of new functions which are generated after a common divisor appearing in the original functions is replaced by a new intermediate variable. And an efficient method based on kernel heritage is presented. This method has been successfully used in synthesis of LCA (Logic Cell Array).
基金This project was supported by the National Natural Science Foundation of China (69931010).
文摘Different modalities in biomedical images, like CT, MRI and PET scanners, provide detailed cross-sectional views of human anatomy. This paper introduces three-dimensional brain reconstruction based on CT slices. It contains filtering, fuzzy segmentation, matching method of contours, cell array structure and image animation. Experimental results have shown its validity. The innovation is matching method of contours and fuzzy segmentation algorithm of CT slices.
基金the National Natural Science Foundation of China(Nos.61136002,61272120,61634004 and 61602377)the Shaanxi Provincial Co-ordination Innovation Project of Science and Technology(No.2016KTZDGY02-04-02)+1 种基金the Shaanxi Provincial Science and Technology Research Fund(Nos.2013KTZB01-07,2014ZS-08 and S2015TQGY0166)the Fund of Shaanxi Education Bureau(No.2050205)
基金supported by the Natural Science Foundation of Anhui Province(No.1808085MF203)the National Natural Science Foundation of China(No.61432017)。
文摘Row Parallel Coarse-Grained Reconfigurable Architecture(RPCGRA)has the advantages of maximum parallelism and programmable flexibility.Designing an efficient algorithm to map the diverse applications onto RPCGRA is difficult due to a number of RPCGRA hardware constraints.To solve this problem,the nodes of the data flow graph must be partitioned and scheduled onto the RPCGRA.In this paper,we present a Depth-First Greedy Mapping(DFGM)algorithm that simultaneously considers the communication costs and the use times of the Reconfigurable Cell Array(RCA).Compared with level breadth mapping,the performance of DFGM is better.The percentage of maximum improvement in the use times of RCA is 33%and the percentage of maximum improvement in non-original input and output times is 64.4%(Given Discrete Cosine Transfor 8(DCT8),and the area of reconfigurable processing unit is 56).Compared with level-based depth mapping,DFGM also obtains the lowest averages of use times of RCA,non-original input and output times,and the reconfigurable time.