模型深度的不断增加和处理序列长度的不一致对循环神经网络在不同处理器上的性能优化提出巨大挑战。针对自主研制的长向量处理器FT-M7032,实现了一个高效的循环神经网络加速引擎。该引擎采用行优先矩阵向量乘算法和数据感知的多核并行方...模型深度的不断增加和处理序列长度的不一致对循环神经网络在不同处理器上的性能优化提出巨大挑战。针对自主研制的长向量处理器FT-M7032,实现了一个高效的循环神经网络加速引擎。该引擎采用行优先矩阵向量乘算法和数据感知的多核并行方式,提高矩阵向量乘的计算效率;采用两级内核融合优化方法降低临时数据传输的开销;采用手写汇编优化多种算子,进一步挖掘长向量处理器的性能潜力。实验表明,长向量处理器循环神经网络推理引擎可获得较高性能,相较于多核ARM CPU以及Intel Golden CPU,类循环神经网络模型长短记忆网络可获得最高62.68倍和3.12倍的性能加速。展开更多
A seven-core photonic crystal fiber (PCF) is fabricated and shown to possess a Gaussian-like far-field-intensity distribution. The seven-core PCF, designed with double-cladding structure and zero dispersion waveleng...A seven-core photonic crystal fiber (PCF) is fabricated and shown to possess a Gaussian-like far-field-intensity distribution. The seven-core PCF, designed with double-cladding structure and zero dispersion wavelength at 927nm, is utilized to build up a 104 W all-fiber-integrated supercontinuum (SC) source with total conversion efficiency up to 74.3%. The average output power of SC can be further sealed based on this multi-core PCF.展开更多
Cylindrical vector beams(CVBs),with non-uniform state of polarizations,have become an indispensable tool in many areas of science and technology.However,little research has explored high power CVBs at the femtosecond ...Cylindrical vector beams(CVBs),with non-uniform state of polarizations,have become an indispensable tool in many areas of science and technology.However,little research has explored high power CVBs at the femtosecond regime.In this paper,we report on tne generation of high quality CVBs with high peak power and femtosecond pulse duration in a fiber chirped-pulse amplification laser system.The radially(azimuthally)polarized vector beam has been obtained with a pulse duration of 4A0 fs[430 fs]and a maximum average output power of 20.36 W[20.12 W].The maximum output pulse energy is〜20μJ at a repetition rate of 1 MHz,corresponding to a high peak power of-46 MW.The comparison between simulated intensity profiles and measured experimental results suggests that the generated CVBs have a remarkable intensity distribution.The proposed configuration of our laser system provides a promising solution for high quality CVBs generation with the characteristics of high peak power,ultrashort pulse duration,and high mode purity.展开更多
文摘模型深度的不断增加和处理序列长度的不一致对循环神经网络在不同处理器上的性能优化提出巨大挑战。针对自主研制的长向量处理器FT-M7032,实现了一个高效的循环神经网络加速引擎。该引擎采用行优先矩阵向量乘算法和数据感知的多核并行方式,提高矩阵向量乘的计算效率;采用两级内核融合优化方法降低临时数据传输的开销;采用手写汇编优化多种算子,进一步挖掘长向量处理器的性能潜力。实验表明,长向量处理器循环神经网络推理引擎可获得较高性能,相较于多核ARM CPU以及Intel Golden CPU,类循环神经网络模型长短记忆网络可获得最高62.68倍和3.12倍的性能加速。
基金Supported by the National High-Technology Research and Development Program of China under Grant No 2013AA031501the National Natural Science Foundation of China under Grant Nos 61575075,61535009 and 61275144
文摘A seven-core photonic crystal fiber (PCF) is fabricated and shown to possess a Gaussian-like far-field-intensity distribution. The seven-core PCF, designed with double-cladding structure and zero dispersion wavelength at 927nm, is utilized to build up a 104 W all-fiber-integrated supercontinuum (SC) source with total conversion efficiency up to 74.3%. The average output power of SC can be further sealed based on this multi-core PCF.
基金This work was supported by the National Natural Science Foundation of China(NSFC)(Nos.61905148,61805278,and 61775146)Equipment Pre-research Field Foundation(No.61404140304)+1 种基金China Postdoctoral Science Foundation(No.2018M633704)State Key Laboratory of Advanced Optical Communication Systems and Networks.
文摘Cylindrical vector beams(CVBs),with non-uniform state of polarizations,have become an indispensable tool in many areas of science and technology.However,little research has explored high power CVBs at the femtosecond regime.In this paper,we report on tne generation of high quality CVBs with high peak power and femtosecond pulse duration in a fiber chirped-pulse amplification laser system.The radially(azimuthally)polarized vector beam has been obtained with a pulse duration of 4A0 fs[430 fs]and a maximum average output power of 20.36 W[20.12 W].The maximum output pulse energy is〜20μJ at a repetition rate of 1 MHz,corresponding to a high peak power of-46 MW.The comparison between simulated intensity profiles and measured experimental results suggests that the generated CVBs have a remarkable intensity distribution.The proposed configuration of our laser system provides a promising solution for high quality CVBs generation with the characteristics of high peak power,ultrashort pulse duration,and high mode purity.