The Moon provides a unique environment for investigating nearby astrophysical events such as supernovae.Lunar samples retain valuable information from these events,via detectable long-lived“fingerprint”radionuclides...The Moon provides a unique environment for investigating nearby astrophysical events such as supernovae.Lunar samples retain valuable information from these events,via detectable long-lived“fingerprint”radionuclides such as^(60)Fe.In this work,we stepped up the development of an accelerator mass spectrometry(AMS)method for detecting^(60)Fe using the HI-13tandem accelerator at the China Institute of Atomic Energy(CIAE).Since interferences could not be sufficiently removed solely with the existing magnetic systems of the tandem accelerator and the following Q3D magnetic spectrograph,a Wien filter with a maximum voltage of±60 kV and a maximum magnetic field of 0.3 T was installed after the accelerator magnetic systems to lower the detection background for the low abundance nuclide^(60)Fe.A 1μm thick Si_(3)N_(4) foil was installed in front of the Q3D as an energy degrader.For particle detection,a multi-anode gas ionization chamber was mounted at the center of the focal plane of the spectrograph.Finally,an^(60)Fe sample with an abundance of 1.125×10^(-10)was used to test the new AMS system.These results indicate that^(60)Fe can be clearly distinguished from the isobar^(60)Ni.The sensitivity was assessed to be better than 4.3×10^(-14)based on blank sample measurements lasting 5.8 h,and the sensitivity could,in principle,be expected to be approximately 2.5×10^(-15)when the data were accumulated for 100 h,which is feasible for future lunar sample measurements because the main contaminants were sufficiently separated.展开更多
An intense laser pulse focused onto a plasma can excite nonlinear plasma waves.Under appropriate conditions,electrons from the background plasma are trapped in the plasma wave and accelerated to ultra-relativistic vel...An intense laser pulse focused onto a plasma can excite nonlinear plasma waves.Under appropriate conditions,electrons from the background plasma are trapped in the plasma wave and accelerated to ultra-relativistic velocities.This scheme is called a laser wakefield accelerator.In this work,we present results from a laser wakefield acceleration experiment using a petawatt-class laser to excite the wakefields as well as nanoparticles to assist the injection of electrons into the accelerating phase of the wakefields.We find that a 10-cm-long,nanoparticle-assisted laser wakefield accelerator can generate 340 pC,10±1.86 GeV electron bunches with a 3.4 GeV rms convolved energy spread and a 0.9 mrad rms divergence.It can also produce bunches with lower energies in the 4–6 GeV range.展开更多
We present a first on-chip positron accelerator based on dielectric laser acceleration.This innovative approach significantly reduces the physical dimensions of the positron acceleration apparatus,enhancing its feasib...We present a first on-chip positron accelerator based on dielectric laser acceleration.This innovative approach significantly reduces the physical dimensions of the positron acceleration apparatus,enhancing its feasibility for diverse applications.By utilizing a stacked acceleration structure and far-infrared laser technology,we are able to achieve a seven-stage acceleration structure that surpasses the distance and energy gain of using the previous dielectric laser acceleration methods.Additionally,we are able to compress the positron beam to an ultrafast sub-femtosecond scale during the acceleration process,compared with the traditional methods,the positron beam is compressed to a greater extent.We also demonstrate the robustness of the stacked acceleration structure through the successful acceleration of the positron beam.展开更多
Prompt radiation emitted during accelerator operation poses a significant health risk,necessitating a thorough search and securing of hazardous areas prior to initiation.Currently,manual sweep methods are employed.How...Prompt radiation emitted during accelerator operation poses a significant health risk,necessitating a thorough search and securing of hazardous areas prior to initiation.Currently,manual sweep methods are employed.However,the limitations of manual sweeps have become increasingly evident with the implementation of large-scale accelerators.By leveraging advancements in machine vision technology,the automatic identification of stranded personnel in controlled areas through camera imagery presents a viable solution for efficient search and security.Given the criticality of personal safety for stranded individuals,search and security processes must be sufficiently reliable.To ensure comprehensive coverage,180°camera groups were strategically positioned on both sides of the accelerator tunnel to eliminate blind spots within the monitoring range.The YOLOV8 network model was modified to enable the detection of small targets,such as hands and feet,as well as larger targets formed by individuals near the cameras.Furthermore,the system incorporates a pedestrian recognition model that detects human body parts,and an information fusion strategy is used to integrate the detected head,hands,and feet with the identified pedestrians as a cohesive unit.This strategy enhanced the capability of the model to identify pedestrians obstructed by equipment,resulting in a notable improvement in the recall rate.Specifically,recall rates of 0.915 and 0.82were obtained for Datasets 1 and 2,respectively.Although there was a slight decrease in accuracy,it aligned with the intended purpose of the search-and-secure software design.Experimental tests conducted within an accelerator tunnel demonstrated the effectiveness of this approach in achieving reliable recognition outcomes.展开更多
In recent years,heavy ion accelerator technology has been rapidly developing worldwide and widely applied in the fields of space radiation simulation and particle therapy.Usually,a very high uniformity in the irradiat...In recent years,heavy ion accelerator technology has been rapidly developing worldwide and widely applied in the fields of space radiation simulation and particle therapy.Usually,a very high uniformity in the irradiation area is required for the extracted ion beams,which is crucial because it directly affects the experimental precision and therapeutic effect.Specifically,ultra-large-area and high-uniformity scanning are crucial requirements for spacecraft radiation effects assessment and serve as core specification for beamline terminal design.In the 300 MeV proton and heavy ion accelerator complex at the Space Environment Simulation and Research Infrastructure(SESRI),proton and heavy ion beams will be accelerated and ultimately delivered to three irradiation terminals.In order to achieve the required large irradiation area of 320 mm×320 mm,horizontal and vertical scanning magnets are used in the extraction beam line.However,considering the various requirements for beam species and energies,the tracking accuracy of power supplies(PSs),the eddy current effect of scanning magnets,and the fluctuation of ion bunch structure will reduce the irradiation uniformity.To mitigate these effects,a beam uniformity optimization method based on the measured beam distribution was proposed and applied in the accelerator complex at SESRI.In the experiment,the uniformity is successfully optimized from 75%to over 90%after five iterations of adjustment to the PS waveforms.In this paper,the method and experimental results were introduced.展开更多
Molecular Dynamics(MD)simulation for computing Interatomic Potential(IAP)is a very important High-Performance Computing(HPC)application.MD simulation on particles of experimental relevance takes huge computation time,...Molecular Dynamics(MD)simulation for computing Interatomic Potential(IAP)is a very important High-Performance Computing(HPC)application.MD simulation on particles of experimental relevance takes huge computation time,despite using an expensive high-end server.Heterogeneous computing,a combination of the Field Programmable Gate Array(FPGA)and a computer,is proposed as a solution to compute MD simulation efficiently.In such heterogeneous computation,communication between FPGA and Computer is necessary.One such MD simulation,explained in the paper,is the(Artificial Neural Network)ANN-based IAP computation of gold(Au_(147)&Au_(309))nanoparticles.MD simulation calculates the forces between atoms and the total energy of the chemical system.This work proposes the novel design and implementation of an ANN IAP-based MD simulation for Au_(147)&Au_(309) using communication protocols,such as Universal Asynchronous Receiver-Transmitter(UART)and Ethernet,for communication between the FPGA and the host computer.To improve the latency of MD simulation through heterogeneous computing,Universal Asynchronous Receiver-Transmitter(UART)and Ethernet communication protocols were explored to conduct MD simulation of 50,000 cycles.In this study,computation times of 17.54 and 18.70 h were achieved with UART and Ethernet,respectively,compared to the conventional server time of 29 h for Au_(147) nanoparticles.The results pave the way for the development of a Lab-on-a-chip application.展开更多
Massive computational complexity and memory requirement of artificial intelligence models impede their deploy-ability on edge computing devices of the Internet of Things(IoT).While Power-of-Two(PoT)quantization is pro...Massive computational complexity and memory requirement of artificial intelligence models impede their deploy-ability on edge computing devices of the Internet of Things(IoT).While Power-of-Two(PoT)quantization is pro-posed to improve the efficiency for edge inference of Deep Neural Networks(DNNs),existing PoT schemes require a huge amount of bit-wise manipulation and have large memory overhead,and their efficiency is bounded by the bottleneck of computation latency and memory footprint.To tackle this challenge,we present an efficient inference approach on the basis of PoT quantization and model compression.An integer-only scalar PoT quantization(IOS-PoT)is designed jointly with a distribution loss regularizer,wherein the regularizer minimizes quantization errors and training disturbances.Additionally,two-stage model compression is developed to effectively reduce memory requirement,and alleviate bandwidth usage in communications of networked heterogenous learning systems.The product look-up table(P-LUT)inference scheme is leveraged to replace bit-shifting with only indexing and addition operations for achieving low-latency computation and implementing efficient edge accelerators.Finally,comprehensive experiments on Residual Networks(ResNets)and efficient architectures with Canadian Institute for Advanced Research(CIFAR),ImageNet,and Real-world Affective Faces Database(RAF-DB)datasets,indicate that our approach achieves 2×∼10×improvement in the reduction of both weight size and computation cost in comparison to state-of-the-art methods.A P-LUT accelerator prototype is implemented on the Xilinx KV260 Field Programmable Gate Array(FPGA)platform for accelerating convolution operations,with performance results showing that P-LUT reduces memory footprint by 1.45×,achieves more than 3×power efficiency and 2×resource efficiency,compared to the conventional bit-shifting scheme.展开更多
A compact 10 MeV S-band irradiation electron linear accelerator(linac)was developed to simulate electronic radiation in outer space and perform electron irradiation effect tests on spacecraft materials and devices.Acc...A compact 10 MeV S-band irradiation electron linear accelerator(linac)was developed to simulate electronic radiation in outer space and perform electron irradiation effect tests on spacecraft materials and devices.According to the requirements of space environment simulation,the electron beam energy can be adjusted in the range from 3.5 to 10 MeV,and the average current can be adjusted in the range from 0.1 to 1 mA.The linac should be capable of providing beam irradiation over a large area of 1 m^(2) with a uniformity greater than 90% and a scanning rate of 100 Hz.A novel method was applied to achieve such a high beam scanning rate by combining a kicker and a scanning magnet.Based on this requirement,a design for the10 MeV linac is proposed with an RF power pulse repetition rate of 500 Hz;it includes a thermal cathode electron gun,a bunching-accelerating section,and a scanning transport line.The detailed physical design and dynamic simulation results of the proposed 10 MeV electron linac are presented in this paper.展开更多
Quantized training has been proven to be a prominent method to achieve deep neural network training under limited computational resources.It uses low bit-width arithmetics with a proper scaling factor to achieve negli...Quantized training has been proven to be a prominent method to achieve deep neural network training under limited computational resources.It uses low bit-width arithmetics with a proper scaling factor to achieve negligible accuracy loss.Cambricon-Q is the ASIC design proposed to efficiently support quantized training,and achieves significant performance improvement.However,there are still two caveats in the design.First,Cambricon-Q with different hardware specifications may lead to different numerical errors,resulting in non-reproducible behaviors which may become a major concern in critical applications.Second,Cambricon-Q cannot leverage data sparsity,where considerable cycles could still be squeezed out.To address the caveats,the acceleration core of Cambricon-Q is redesigned to support fine-grained irregular data processing.The new design not only enables acceleration on sparse data,but also enables performing local dynamic quantization by contiguous value ranges(which is hardware independent),instead of contiguous addresses(which is dependent on hardware factors).Experimental results show that the accuracy loss of the method still keeps negligible,and the accelerator achieves 1.61×performance improvement over Cambricon-Q,with about 10%energy increase.展开更多
With the rapid development of deep learning algorithms,the computational complexity and functional diversity are increasing rapidly.However,the gap between high computational density and insufficient memory bandwidth ...With the rapid development of deep learning algorithms,the computational complexity and functional diversity are increasing rapidly.However,the gap between high computational density and insufficient memory bandwidth under the traditional von Neumann architecture is getting worse.Analyzing the algorithmic characteristics of convolutional neural network(CNN),it is found that the access characteristics of convolution(CONV)and fully connected(FC)operations are very different.Based on this feature,a dual-mode reronfigurable distributed memory architecture for CNN accelerator is designed.It can be configured in Bank mode or first input first output(FIFO)mode to accommodate the access needs of different operations.At the same time,a programmable memory control unit is designed,which can effectively control the dual-mode configurable distributed memory architecture by using customized special accessing instructions and reduce the data accessing delay.The proposed architecture is verified and tested by parallel implementation of some CNN algorithms.The experimental results show that the peak bandwidth can reach 13.44 GB·s^(-1)at an operating frequency of 120 MHz.This work can achieve 1.40,1.12,2.80 and 4.70 times the peak bandwidth compared with the existing work.展开更多
The flexibility in radiotherapy can be improved if patients can be moved between any one of the department’s medical linear accelerators (LINACs) without the need to change anything in the patient’s treatment plan. ...The flexibility in radiotherapy can be improved if patients can be moved between any one of the department’s medical linear accelerators (LINACs) without the need to change anything in the patient’s treatment plan. For this to be possible, the dosimetric characteristics of the various accelerators must be the same, or nearly the same. The purpose of this work is to describe further and compare measurements and parameters after the initial vendor-recommended beam matching of the five LINACs. Deviations related to dose calculations and to beam matched accelerators may compromise treatment accuracy. The safest and most practical way to ensure that all accelerators are within clinical acceptable accuracy is to include TPS calculations in the LINACs matching evaluation. Treatment planning system (TPS) was used to create three photons plans with different field sizes 3 × 3 cm, 10 × 10 cm and 25 × 25 cm at a depth of 4.5 cm in Perspex. Calculated TPS plans were sent to Mosaiq to be delivered by five LINACs. TPS plans were compared with five LINACs measurements data using Gamma analyses of 2% and 2 mm. The results suggest that for four out of the five LINACs, there was generally good agreement, less than a 2% deviation between the planned dose distribution and the measured dose distribution. However, one specific LINAC named “Asterix” exhibited a deviation of 2.121% from the planned dose. The results show that all of the LINACs’ performance were within the acceptable deviation and delivering radiation dose consistently and accurately.展开更多
Deep neural networks(DNN)are widely used in image recognition,image classification,and other fields.However,as the model size increases,the DNN hardware accelerators face the challenge of higher area overhead and ener...Deep neural networks(DNN)are widely used in image recognition,image classification,and other fields.However,as the model size increases,the DNN hardware accelerators face the challenge of higher area overhead and energy consumption.In recent years,stochastic computing(SC)has been considered a way to realize deep neural networks and reduce hardware consumption.A probabilistic compensation algorithm is proposed to solve the accuracy problem of stochastic calculation,and a fully parallel neural network accelerator based on a deterministic method is designed.The software simulation results show that the accuracy of the probability compensation algorithm on the CIFAR-10 data set is 95.32%,which is 14.98%higher than that of the traditional SC algorithm.The accuracy of the deterministic algorithm on the CIFAR-10 dataset is 95.06%,which is 14.72%higher than that of the traditional SC algorithm.The results of Very Large Scale Integration Circuit(VLSI)hardware tests show that the normalized energy efficiency of the fully parallel neural network accelerator based on the deterministic method is improved by 31%compared with the circuit based on binary computing.展开更多
In this paper,an improved discharging circuit was proposed to quicken the decay of the current in the drive coil in a reluctance accelerator when the armature reaches the center of the coil.The aim of this is to preve...In this paper,an improved discharging circuit was proposed to quicken the decay of the current in the drive coil in a reluctance accelerator when the armature reaches the center of the coil.The aim of this is to prevent the suck-back effect caused by the residual current in drive coil.The method is adding a reverse charging branch with a small capacitor in the traditional pulsed discharging circuit.The results under the traditional circuit and the improved circuit were compared in a simulation.The experiment then verified the simulations and they had good agreement.Simulation and experiment both demonstrated the improved circuit can effectively prevent the suck-back effect and increase the efficiency.At the voltage of 800 V,an efficiency increase of 36.34% was obtained.展开更多
Developing sulfur cathodes with high catalytic activity on accelerating the sluggish redox kinetics of lithium polysulfides(Li PSs) and unveiling their mechanisms are pivotal for advanced lithium–sulfur(Li–S)batteri...Developing sulfur cathodes with high catalytic activity on accelerating the sluggish redox kinetics of lithium polysulfides(Li PSs) and unveiling their mechanisms are pivotal for advanced lithium–sulfur(Li–S)batteries. Herein, MoS2 is verified to reduce the Gibbs free energy for rate-limiting step of sulfur reduction and the dissociation energy of lithium sulfide(Li2 S) for the first time employing theoretical calculations. The Mo S2 nanosheets coated on mesoporous hollow carbon spheres(MHCS) are then reasonably designed as a sulfur host for high-capacity and long-life Li–S battery, in which MHCS can guarantee the high sulfur loading and fast electron/ion transfer. It is revealed that the shuttle effect is efficiently inhibited because of the boosted conversion of Li PSs. As a result, the coin cell based on the MHCS@Mo S2-S cathode exhibits stable cycling performance maintaining 735.7 mAh g^(-1) after 500 cycles at 1.0 C. More importantly, the pouch cell employing the MHCS@Mo S2-S cathodes achieves high specific capacity of1353.2 m Ah g^(-1) and prominent cycle stability that remaining 960.0 m Ah g^(-1) with extraordinary capacity retention of 79.8% at 0.1 C after 170 cycles. Therefore, this work paves a new avenue for developing practical high specific energy and long-life pouch-type Li–S batteries.展开更多
Recently,due to the availability of big data and the rapid growth of computing power,artificial intelligence(AI)has regained tremendous attention and investment.Machine learning(ML)approaches have been successfully ap...Recently,due to the availability of big data and the rapid growth of computing power,artificial intelligence(AI)has regained tremendous attention and investment.Machine learning(ML)approaches have been successfully applied to solve many problems in academia and in industry.Although the explosion of big data applications is driving the development of ML,it also imposes severe challenges of data processing speed and scalability on conventional computer systems.Computing platforms that are dedicatedly designed for AI applications have been considered,ranging from a complement to von Neumann platforms to a“must-have”and stand-alone technical solution.These platforms,which belong to a larger category named“domain-specific computing,”focus on specific customization for AI.In this article,we focus on summarizing the recent advances in accelerator designs for deep neural networks(DNNs)-that is,DNN accelerators.We discuss various architectures that support DNN executions in terms of computing units,dataflow optimization,targeted network topologies,architectures on emerging technologies,and accelerators for emerging applications.We also provide our visions on the future trend of AI chip designs.展开更多
Heavy-ion collisions are powerful tools for studying hypernuclear physics.We develop a dynamical coalescence model coupled with an ART model(version1.0) to study the production rates of light nuclear clusters and hype...Heavy-ion collisions are powerful tools for studying hypernuclear physics.We develop a dynamical coalescence model coupled with an ART model(version1.0) to study the production rates of light nuclear clusters and hypernuclei in heavy-ion reactions,for instance,the deuteron(d),triton(t),helium(~3He),and hypertriton(_A^3H)in minimum bias(0-80%centrality)~6Li+^(12)C reactions at beam energy of 3.5A GeV.The penalty factor for light clusters is extracted from the yields,and the distributions of 0 angle of particles,which provide direct suggesetions about the location of particle detectors in the near future facility-High Intensity heavy-ion Accelerator Facility(HIAF) are investigated.Our calculation demonstrates that HIAF is suitable for studying hypernuclear physics.展开更多
A novel crystal nucleus-based cement-hardening accelerator was evaluated using various mortar and segment concrete experiments.The mechanism of hardening acceleration was investigated via hydration temperature variati...A novel crystal nucleus-based cement-hardening accelerator was evaluated using various mortar and segment concrete experiments.The mechanism of hardening acceleration was investigated via hydration temperature variation analysis,hydration degree analysis,X-ray diffraction(XRD)and scanning electron microscopy(SEM).In the presence of accelerator,the fluidity loss of mortar was increased after 30 minuites,and a coagulation was also observed.Moreover,based on the image of SEM,the formation of C-S-H gels was enhanced in the early hydration.As a result,the hardening accelerator could significantly boost the early strength of concrete,especially within one day of pouring,and shorten steam curing time to meet the demolding strength.展开更多
How to operate^(82)Sr/^(82)Rb and ^(68)Ge/^(68)Ga generators used in the positron emission tomography scan process is explained, and the importance of ^(82)Sr and ^(68)Ge radionuclides for these generators is revealed...How to operate^(82)Sr/^(82)Rb and ^(68)Ge/^(68)Ga generators used in the positron emission tomography scan process is explained, and the importance of ^(82)Sr and ^(68)Ge radionuclides for these generators is revealed. To produce medical ^(82)Sr and ^(68)Ge by means of a proton accelerator in an irradiation time of 24 h, a proton beam current of250 l A, and an energy range E_(proton)= 100 →5 MeV, the cross sections and the neutron emission spectrum curves of(p,xn) reaction processes on Rb-85, Ga-69 and Ga-71 targets were calculated, and the activities and yields of the product were simulated for the reaction processes. Additionally, the integral yields of the reaction processes were determined via the calculated cross-sectional curves and the mass stopping power obtained from the X-PMSP program. Furthermore, based on the obtained results, the appropriate reaction processes for the production of ^(82)Sr and ^(68)Ge isotopes on Rb-85, Ga-69, and Ga-71 targets are discussed.展开更多
We have developed a conceptual design of a 15-TW pulsed-power accelerator based on the linear-transformer-driver(LTD)architecture described by Stygar[W.A.Stygar et al.,Phys.Rev.ST Accel.Beams 18,110401(2015)].The driv...We have developed a conceptual design of a 15-TW pulsed-power accelerator based on the linear-transformer-driver(LTD)architecture described by Stygar[W.A.Stygar et al.,Phys.Rev.ST Accel.Beams 18,110401(2015)].The driver will allow multiple,high-energy-density experiments per day in a university environment and,at the same time,will enable both fundamental and integrated experiments that are scalable to larger facilities.In this design,many individual energy storage units(bricks),each composed of two capacitors and one switch,directly drive the target load without additional pulse compression.Ten LTD modules in parallel drive the load.Each module consists of 16 LTD cavities connected in series,where each cavity is powered by 22 bricks connected in parallel.This design stores up to 2.75 MJ and delivers up to 15 TW in 100 ns to the constant-impedance,water-insulated radial transmission lines.The transmission lines in turn deliver a peak current as high as 12.5 MA to the physics load.To maximize its experimental value and flexibility,the accelerator is coupled to a modern,multibeam laser facility(four beams with up to 5 kJ in 10 ns and one beam with up to 2.6 kJ in 100 ps or less)that can provide auxiliary heating of the physics load.The lasers also enable advanced diagnostic techniques such as X-ray Thomson scattering and multiframe and three-dimensional radiography.The coupled accelerator-laser facility will be the first of its kind and be capable of conducting unprecedented high-energy-densityephysics experiments.展开更多
The armature is an important part affecting the energy conversion efficiency of a reluctance accelerator.In this paper,six kinds of soft magnetic materials are chosen and four structures are designed for the armature....The armature is an important part affecting the energy conversion efficiency of a reluctance accelerator.In this paper,six kinds of soft magnetic materials are chosen and four structures are designed for the armature.At first,the circuit and magnetic force are theoretically analyzed.Then the armatures with different materials and structures are used in the simulation,and the performances are compared and analyzed.At last,the experiment verifies the theory analysis and simulation design.It is concluded that the saturation flux density and conductivity of the material are the key factors affecting the armature force,and the optimization of armature structure can effectively restrain the eddy current,reduce negative force and improve efficiency.Compared with cutting slits in solid armatures,laminating the sheets radially can reduce the eddy current more efficiently.Although slitting can prevent the eddy current to a certain extent,meanwhile,it will decrease the magnetic force because of the losing of magnetized volume and the surface area.Hence,choosing the high saturation flux density material and making out the armature with radially_laminated sheets will improve the efficiency of the reluctance accelerator.In this paper,the silicon steel radially_laminated armature is a better choice for the armature design of the reluctance accelerator.展开更多
基金supported by the National Natural Science Foundation of China(Nos.12125509,12222514,11961141003,and 12005304)National Key Research and Development Project(No.2022YFA1602301)+1 种基金CAST Young Talent Support Planthe CNNC Science Fund for Talented Young Scholars Continuous support for basic scientific research projects。
文摘The Moon provides a unique environment for investigating nearby astrophysical events such as supernovae.Lunar samples retain valuable information from these events,via detectable long-lived“fingerprint”radionuclides such as^(60)Fe.In this work,we stepped up the development of an accelerator mass spectrometry(AMS)method for detecting^(60)Fe using the HI-13tandem accelerator at the China Institute of Atomic Energy(CIAE).Since interferences could not be sufficiently removed solely with the existing magnetic systems of the tandem accelerator and the following Q3D magnetic spectrograph,a Wien filter with a maximum voltage of±60 kV and a maximum magnetic field of 0.3 T was installed after the accelerator magnetic systems to lower the detection background for the low abundance nuclide^(60)Fe.A 1μm thick Si_(3)N_(4) foil was installed in front of the Q3D as an energy degrader.For particle detection,a multi-anode gas ionization chamber was mounted at the center of the focal plane of the spectrograph.Finally,an^(60)Fe sample with an abundance of 1.125×10^(-10)was used to test the new AMS system.These results indicate that^(60)Fe can be clearly distinguished from the isobar^(60)Ni.The sensitivity was assessed to be better than 4.3×10^(-14)based on blank sample measurements lasting 5.8 h,and the sensitivity could,in principle,be expected to be approximately 2.5×10^(-15)when the data were accumulated for 100 h,which is feasible for future lunar sample measurements because the main contaminants were sufficiently separated.
基金supported by the Air Force Office of Scientific Research Grant No.FA9550-17-1-0264supported by the DOE,Office of Science,Fusion Energy Sciences under Contract No.DE-SC0021125+2 种基金supported by the U.S.Department of Energy Grant No.DESC0011617.D.A.Jarozynski,E.Brunetti,B.Ersfeld,and S.Yoffe would like to acknowledge support from the U.K.EPSRC(Grant Nos.EP/J018171/1 and EP/N028694/1)the European Union’s Horizon 2020 research and innovation program under Grant Agreement No.871124 Laserlab-Europe and EuPRAXIA(Grant No.653782)funded by the N8 research partnership and EPSRC(Grant No.EP/T022167/1).
文摘An intense laser pulse focused onto a plasma can excite nonlinear plasma waves.Under appropriate conditions,electrons from the background plasma are trapped in the plasma wave and accelerated to ultra-relativistic velocities.This scheme is called a laser wakefield accelerator.In this work,we present results from a laser wakefield acceleration experiment using a petawatt-class laser to excite the wakefields as well as nanoparticles to assist the injection of electrons into the accelerating phase of the wakefields.We find that a 10-cm-long,nanoparticle-assisted laser wakefield accelerator can generate 340 pC,10±1.86 GeV electron bunches with a 3.4 GeV rms convolved energy spread and a 0.9 mrad rms divergence.It can also produce bunches with lower energies in the 4–6 GeV range.
基金supported by the National Natural Science Foundation of China(Grant No.11975214).
文摘We present a first on-chip positron accelerator based on dielectric laser acceleration.This innovative approach significantly reduces the physical dimensions of the positron acceleration apparatus,enhancing its feasibility for diverse applications.By utilizing a stacked acceleration structure and far-infrared laser technology,we are able to achieve a seven-stage acceleration structure that surpasses the distance and energy gain of using the previous dielectric laser acceleration methods.Additionally,we are able to compress the positron beam to an ultrafast sub-femtosecond scale during the acceleration process,compared with the traditional methods,the positron beam is compressed to a greater extent.We also demonstrate the robustness of the stacked acceleration structure through the successful acceleration of the positron beam.
文摘Prompt radiation emitted during accelerator operation poses a significant health risk,necessitating a thorough search and securing of hazardous areas prior to initiation.Currently,manual sweep methods are employed.However,the limitations of manual sweeps have become increasingly evident with the implementation of large-scale accelerators.By leveraging advancements in machine vision technology,the automatic identification of stranded personnel in controlled areas through camera imagery presents a viable solution for efficient search and security.Given the criticality of personal safety for stranded individuals,search and security processes must be sufficiently reliable.To ensure comprehensive coverage,180°camera groups were strategically positioned on both sides of the accelerator tunnel to eliminate blind spots within the monitoring range.The YOLOV8 network model was modified to enable the detection of small targets,such as hands and feet,as well as larger targets formed by individuals near the cameras.Furthermore,the system incorporates a pedestrian recognition model that detects human body parts,and an information fusion strategy is used to integrate the detected head,hands,and feet with the identified pedestrians as a cohesive unit.This strategy enhanced the capability of the model to identify pedestrians obstructed by equipment,resulting in a notable improvement in the recall rate.Specifically,recall rates of 0.915 and 0.82were obtained for Datasets 1 and 2,respectively.Although there was a slight decrease in accuracy,it aligned with the intended purpose of the search-and-secure software design.Experimental tests conducted within an accelerator tunnel demonstrated the effectiveness of this approach in achieving reliable recognition outcomes.
基金Supported by National Key R&D Program of China(2019YFA0405400)。
文摘In recent years,heavy ion accelerator technology has been rapidly developing worldwide and widely applied in the fields of space radiation simulation and particle therapy.Usually,a very high uniformity in the irradiation area is required for the extracted ion beams,which is crucial because it directly affects the experimental precision and therapeutic effect.Specifically,ultra-large-area and high-uniformity scanning are crucial requirements for spacecraft radiation effects assessment and serve as core specification for beamline terminal design.In the 300 MeV proton and heavy ion accelerator complex at the Space Environment Simulation and Research Infrastructure(SESRI),proton and heavy ion beams will be accelerated and ultimately delivered to three irradiation terminals.In order to achieve the required large irradiation area of 320 mm×320 mm,horizontal and vertical scanning magnets are used in the extraction beam line.However,considering the various requirements for beam species and energies,the tracking accuracy of power supplies(PSs),the eddy current effect of scanning magnets,and the fluctuation of ion bunch structure will reduce the irradiation uniformity.To mitigate these effects,a beam uniformity optimization method based on the measured beam distribution was proposed and applied in the accelerator complex at SESRI.In the experiment,the uniformity is successfully optimized from 75%to over 90%after five iterations of adjustment to the PS waveforms.In this paper,the method and experimental results were introduced.
文摘Molecular Dynamics(MD)simulation for computing Interatomic Potential(IAP)is a very important High-Performance Computing(HPC)application.MD simulation on particles of experimental relevance takes huge computation time,despite using an expensive high-end server.Heterogeneous computing,a combination of the Field Programmable Gate Array(FPGA)and a computer,is proposed as a solution to compute MD simulation efficiently.In such heterogeneous computation,communication between FPGA and Computer is necessary.One such MD simulation,explained in the paper,is the(Artificial Neural Network)ANN-based IAP computation of gold(Au_(147)&Au_(309))nanoparticles.MD simulation calculates the forces between atoms and the total energy of the chemical system.This work proposes the novel design and implementation of an ANN IAP-based MD simulation for Au_(147)&Au_(309) using communication protocols,such as Universal Asynchronous Receiver-Transmitter(UART)and Ethernet,for communication between the FPGA and the host computer.To improve the latency of MD simulation through heterogeneous computing,Universal Asynchronous Receiver-Transmitter(UART)and Ethernet communication protocols were explored to conduct MD simulation of 50,000 cycles.In this study,computation times of 17.54 and 18.70 h were achieved with UART and Ethernet,respectively,compared to the conventional server time of 29 h for Au_(147) nanoparticles.The results pave the way for the development of a Lab-on-a-chip application.
基金This work was supported by Open Fund Project of State Key Laboratory of Intelligent Vehicle Safety Technology by Grant with No.IVSTSKL-202311Key Projects of Science and Technology Research Programme of Chongqing Municipal Education Commission by Grant with No.KJZD-K202301505+1 种基金Cooperation Project between Chongqing Municipal Undergraduate Universities and Institutes Affiliated to the Chinese Academy of Sciences in 2021 by Grant with No.HZ2021015Chongqing Graduate Student Research Innovation Program by Grant with No.CYS240801.
文摘Massive computational complexity and memory requirement of artificial intelligence models impede their deploy-ability on edge computing devices of the Internet of Things(IoT).While Power-of-Two(PoT)quantization is pro-posed to improve the efficiency for edge inference of Deep Neural Networks(DNNs),existing PoT schemes require a huge amount of bit-wise manipulation and have large memory overhead,and their efficiency is bounded by the bottleneck of computation latency and memory footprint.To tackle this challenge,we present an efficient inference approach on the basis of PoT quantization and model compression.An integer-only scalar PoT quantization(IOS-PoT)is designed jointly with a distribution loss regularizer,wherein the regularizer minimizes quantization errors and training disturbances.Additionally,two-stage model compression is developed to effectively reduce memory requirement,and alleviate bandwidth usage in communications of networked heterogenous learning systems.The product look-up table(P-LUT)inference scheme is leveraged to replace bit-shifting with only indexing and addition operations for achieving low-latency computation and implementing efficient edge accelerators.Finally,comprehensive experiments on Residual Networks(ResNets)and efficient architectures with Canadian Institute for Advanced Research(CIFAR),ImageNet,and Real-world Affective Faces Database(RAF-DB)datasets,indicate that our approach achieves 2×∼10×improvement in the reduction of both weight size and computation cost in comparison to state-of-the-art methods.A P-LUT accelerator prototype is implemented on the Xilinx KV260 Field Programmable Gate Array(FPGA)platform for accelerating convolution operations,with performance results showing that P-LUT reduces memory footprint by 1.45×,achieves more than 3×power efficiency and 2×resource efficiency,compared to the conventional bit-shifting scheme.
文摘A compact 10 MeV S-band irradiation electron linear accelerator(linac)was developed to simulate electronic radiation in outer space and perform electron irradiation effect tests on spacecraft materials and devices.According to the requirements of space environment simulation,the electron beam energy can be adjusted in the range from 3.5 to 10 MeV,and the average current can be adjusted in the range from 0.1 to 1 mA.The linac should be capable of providing beam irradiation over a large area of 1 m^(2) with a uniformity greater than 90% and a scanning rate of 100 Hz.A novel method was applied to achieve such a high beam scanning rate by combining a kicker and a scanning magnet.Based on this requirement,a design for the10 MeV linac is proposed with an RF power pulse repetition rate of 500 Hz;it includes a thermal cathode electron gun,a bunching-accelerating section,and a scanning transport line.The detailed physical design and dynamic simulation results of the proposed 10 MeV electron linac are presented in this paper.
基金the National Key Research and Devecopment Program of China(No.2022YFB4501601)the National Natural Science Foundation of China(No.62102398,U20A20227,62222214,62002338,U22A2028,U19B2019)+1 种基金the Chinese Academy of Sciences Project for Young Scientists in Basic Research(YSBR-029)Youth Innovation Promotion Association Chinese Academy of Sciences。
文摘Quantized training has been proven to be a prominent method to achieve deep neural network training under limited computational resources.It uses low bit-width arithmetics with a proper scaling factor to achieve negligible accuracy loss.Cambricon-Q is the ASIC design proposed to efficiently support quantized training,and achieves significant performance improvement.However,there are still two caveats in the design.First,Cambricon-Q with different hardware specifications may lead to different numerical errors,resulting in non-reproducible behaviors which may become a major concern in critical applications.Second,Cambricon-Q cannot leverage data sparsity,where considerable cycles could still be squeezed out.To address the caveats,the acceleration core of Cambricon-Q is redesigned to support fine-grained irregular data processing.The new design not only enables acceleration on sparse data,but also enables performing local dynamic quantization by contiguous value ranges(which is hardware independent),instead of contiguous addresses(which is dependent on hardware factors).Experimental results show that the accuracy loss of the method still keeps negligible,and the accelerator achieves 1.61×performance improvement over Cambricon-Q,with about 10%energy increase.
基金Supported by the National Key R&D Program of China(No.2022ZD0119001)the National Natural Science Foundation of China(No.61834005,61802304)+1 种基金the Education Department of Shaanxi Province(No.22JY060)the Shaanxi Provincial Key Research and Devel-opment Plan(No.2024GX-YBXM-100)。
文摘With the rapid development of deep learning algorithms,the computational complexity and functional diversity are increasing rapidly.However,the gap between high computational density and insufficient memory bandwidth under the traditional von Neumann architecture is getting worse.Analyzing the algorithmic characteristics of convolutional neural network(CNN),it is found that the access characteristics of convolution(CONV)and fully connected(FC)operations are very different.Based on this feature,a dual-mode reronfigurable distributed memory architecture for CNN accelerator is designed.It can be configured in Bank mode or first input first output(FIFO)mode to accommodate the access needs of different operations.At the same time,a programmable memory control unit is designed,which can effectively control the dual-mode configurable distributed memory architecture by using customized special accessing instructions and reduce the data accessing delay.The proposed architecture is verified and tested by parallel implementation of some CNN algorithms.The experimental results show that the peak bandwidth can reach 13.44 GB·s^(-1)at an operating frequency of 120 MHz.This work can achieve 1.40,1.12,2.80 and 4.70 times the peak bandwidth compared with the existing work.
文摘The flexibility in radiotherapy can be improved if patients can be moved between any one of the department’s medical linear accelerators (LINACs) without the need to change anything in the patient’s treatment plan. For this to be possible, the dosimetric characteristics of the various accelerators must be the same, or nearly the same. The purpose of this work is to describe further and compare measurements and parameters after the initial vendor-recommended beam matching of the five LINACs. Deviations related to dose calculations and to beam matched accelerators may compromise treatment accuracy. The safest and most practical way to ensure that all accelerators are within clinical acceptable accuracy is to include TPS calculations in the LINACs matching evaluation. Treatment planning system (TPS) was used to create three photons plans with different field sizes 3 × 3 cm, 10 × 10 cm and 25 × 25 cm at a depth of 4.5 cm in Perspex. Calculated TPS plans were sent to Mosaiq to be delivered by five LINACs. TPS plans were compared with five LINACs measurements data using Gamma analyses of 2% and 2 mm. The results suggest that for four out of the five LINACs, there was generally good agreement, less than a 2% deviation between the planned dose distribution and the measured dose distribution. However, one specific LINAC named “Asterix” exhibited a deviation of 2.121% from the planned dose. The results show that all of the LINACs’ performance were within the acceptable deviation and delivering radiation dose consistently and accurately.
文摘Deep neural networks(DNN)are widely used in image recognition,image classification,and other fields.However,as the model size increases,the DNN hardware accelerators face the challenge of higher area overhead and energy consumption.In recent years,stochastic computing(SC)has been considered a way to realize deep neural networks and reduce hardware consumption.A probabilistic compensation algorithm is proposed to solve the accuracy problem of stochastic calculation,and a fully parallel neural network accelerator based on a deterministic method is designed.The software simulation results show that the accuracy of the probability compensation algorithm on the CIFAR-10 data set is 95.32%,which is 14.98%higher than that of the traditional SC algorithm.The accuracy of the deterministic algorithm on the CIFAR-10 dataset is 95.06%,which is 14.72%higher than that of the traditional SC algorithm.The results of Very Large Scale Integration Circuit(VLSI)hardware tests show that the normalized energy efficiency of the fully parallel neural network accelerator based on the deterministic method is improved by 31%compared with the circuit based on binary computing.
基金This work was supported by the Fundamental Research Funds for the Central Universities[Grant number 2019XJ01].
文摘In this paper,an improved discharging circuit was proposed to quicken the decay of the current in the drive coil in a reluctance accelerator when the armature reaches the center of the coil.The aim of this is to prevent the suck-back effect caused by the residual current in drive coil.The method is adding a reverse charging branch with a small capacitor in the traditional pulsed discharging circuit.The results under the traditional circuit and the improved circuit were compared in a simulation.The experiment then verified the simulations and they had good agreement.Simulation and experiment both demonstrated the improved circuit can effectively prevent the suck-back effect and increase the efficiency.At the voltage of 800 V,an efficiency increase of 36.34% was obtained.
基金supported by the funding from the Strategy Priority Research Program of Chinese Academy of Science (Grant No. XDA17020404)DICP&QIBEBT (DICP&QIBEBT UN201702)+8 种基金R&D Projects in Key Areas of Guangdong Province (2019B090908001)Science and Technology Innovation Foundation of Dalian (2018J11CY020)Defense Industrial Technology Development Program (JCKY2018130C107)National Natural Science Foundation of China (Grants 51872283)Liao Ning Revitalization Talents Program (Grant XLYC1807153)Natural Science Foundation of Liaoning Province (Grant 20180510038)DICP (DICP ZZBS201708, DICP ZZBS201802)DNL Cooperation FundCAS (DNL180310, DNL180308, DNL201912, and DNL201915)。
文摘Developing sulfur cathodes with high catalytic activity on accelerating the sluggish redox kinetics of lithium polysulfides(Li PSs) and unveiling their mechanisms are pivotal for advanced lithium–sulfur(Li–S)batteries. Herein, MoS2 is verified to reduce the Gibbs free energy for rate-limiting step of sulfur reduction and the dissociation energy of lithium sulfide(Li2 S) for the first time employing theoretical calculations. The Mo S2 nanosheets coated on mesoporous hollow carbon spheres(MHCS) are then reasonably designed as a sulfur host for high-capacity and long-life Li–S battery, in which MHCS can guarantee the high sulfur loading and fast electron/ion transfer. It is revealed that the shuttle effect is efficiently inhibited because of the boosted conversion of Li PSs. As a result, the coin cell based on the MHCS@Mo S2-S cathode exhibits stable cycling performance maintaining 735.7 mAh g^(-1) after 500 cycles at 1.0 C. More importantly, the pouch cell employing the MHCS@Mo S2-S cathodes achieves high specific capacity of1353.2 m Ah g^(-1) and prominent cycle stability that remaining 960.0 m Ah g^(-1) with extraordinary capacity retention of 79.8% at 0.1 C after 170 cycles. Therefore, this work paves a new avenue for developing practical high specific energy and long-life pouch-type Li–S batteries.
基金the National Science Foundations(NSFs)(1822085,1725456,1816833,1500848,1719160,and 1725447)the NSF Computing and Communication Foundations(1740352)+1 种基金the Nanoelectronics COmputing REsearch Program in the Semiconductor Research Corporation(NC-2766-A)the Center for Research in Intelligent Storage and Processing-in-Memory,one of six centers in the Joint University Microelectronics Program,a SRC program sponsored by Defense Advanced Research Projects Agency.
文摘Recently,due to the availability of big data and the rapid growth of computing power,artificial intelligence(AI)has regained tremendous attention and investment.Machine learning(ML)approaches have been successfully applied to solve many problems in academia and in industry.Although the explosion of big data applications is driving the development of ML,it also imposes severe challenges of data processing speed and scalability on conventional computer systems.Computing platforms that are dedicatedly designed for AI applications have been considered,ranging from a complement to von Neumann platforms to a“must-have”and stand-alone technical solution.These platforms,which belong to a larger category named“domain-specific computing,”focus on specific customization for AI.In this article,we focus on summarizing the recent advances in accelerator designs for deep neural networks(DNNs)-that is,DNN accelerators.We discuss various architectures that support DNN executions in terms of computing units,dataflow optimization,targeted network topologies,architectures on emerging technologies,and accelerators for emerging applications.We also provide our visions on the future trend of AI chip designs.
基金supported in part by the Major State Basic Research Development Program in China(Nos.2014CB845401 and2015CB856904)the National Natural Science Foundation of China(Nos.11421505,11520101004,11275250,11322547 and U1232206)Key Program of CAS for the Frontier Science(No.QYZDJ-SSW-SLH002)
文摘Heavy-ion collisions are powerful tools for studying hypernuclear physics.We develop a dynamical coalescence model coupled with an ART model(version1.0) to study the production rates of light nuclear clusters and hypernuclei in heavy-ion reactions,for instance,the deuteron(d),triton(t),helium(~3He),and hypertriton(_A^3H)in minimum bias(0-80%centrality)~6Li+^(12)C reactions at beam energy of 3.5A GeV.The penalty factor for light clusters is extracted from the yields,and the distributions of 0 angle of particles,which provide direct suggesetions about the location of particle detectors in the near future facility-High Intensity heavy-ion Accelerator Facility(HIAF) are investigated.Our calculation demonstrates that HIAF is suitable for studying hypernuclear physics.
基金Funded by Star Program(No.1804QB1403200)from Science and Technology Commission of Shanghai Municipality。
文摘A novel crystal nucleus-based cement-hardening accelerator was evaluated using various mortar and segment concrete experiments.The mechanism of hardening acceleration was investigated via hydration temperature variation analysis,hydration degree analysis,X-ray diffraction(XRD)and scanning electron microscopy(SEM).In the presence of accelerator,the fluidity loss of mortar was increased after 30 minuites,and a coagulation was also observed.Moreover,based on the image of SEM,the formation of C-S-H gels was enhanced in the early hydration.As a result,the hardening accelerator could significantly boost the early strength of concrete,especially within one day of pouring,and shorten steam curing time to meet the demolding strength.
文摘How to operate^(82)Sr/^(82)Rb and ^(68)Ge/^(68)Ga generators used in the positron emission tomography scan process is explained, and the importance of ^(82)Sr and ^(68)Ge radionuclides for these generators is revealed. To produce medical ^(82)Sr and ^(68)Ge by means of a proton accelerator in an irradiation time of 24 h, a proton beam current of250 l A, and an energy range E_(proton)= 100 →5 MeV, the cross sections and the neutron emission spectrum curves of(p,xn) reaction processes on Rb-85, Ga-69 and Ga-71 targets were calculated, and the activities and yields of the product were simulated for the reaction processes. Additionally, the integral yields of the reaction processes were determined via the calculated cross-sectional curves and the mass stopping power obtained from the X-PMSP program. Furthermore, based on the obtained results, the appropriate reaction processes for the production of ^(82)Sr and ^(68)Ge isotopes on Rb-85, Ga-69, and Ga-71 targets are discussed.
文摘We have developed a conceptual design of a 15-TW pulsed-power accelerator based on the linear-transformer-driver(LTD)architecture described by Stygar[W.A.Stygar et al.,Phys.Rev.ST Accel.Beams 18,110401(2015)].The driver will allow multiple,high-energy-density experiments per day in a university environment and,at the same time,will enable both fundamental and integrated experiments that are scalable to larger facilities.In this design,many individual energy storage units(bricks),each composed of two capacitors and one switch,directly drive the target load without additional pulse compression.Ten LTD modules in parallel drive the load.Each module consists of 16 LTD cavities connected in series,where each cavity is powered by 22 bricks connected in parallel.This design stores up to 2.75 MJ and delivers up to 15 TW in 100 ns to the constant-impedance,water-insulated radial transmission lines.The transmission lines in turn deliver a peak current as high as 12.5 MA to the physics load.To maximize its experimental value and flexibility,the accelerator is coupled to a modern,multibeam laser facility(four beams with up to 5 kJ in 10 ns and one beam with up to 2.6 kJ in 100 ps or less)that can provide auxiliary heating of the physics load.The lasers also enable advanced diagnostic techniques such as X-ray Thomson scattering and multiframe and three-dimensional radiography.The coupled accelerator-laser facility will be the first of its kind and be capable of conducting unprecedented high-energy-densityephysics experiments.
基金supported in part by the Fundamental Research Funds for the Central Universities,China[grant number 2682020GF03]in part by the Foundation of Key Laboratory of Magnetic Suspension Technology and Maglev Vehicle,Ministry of Education,China.
文摘The armature is an important part affecting the energy conversion efficiency of a reluctance accelerator.In this paper,six kinds of soft magnetic materials are chosen and four structures are designed for the armature.At first,the circuit and magnetic force are theoretically analyzed.Then the armatures with different materials and structures are used in the simulation,and the performances are compared and analyzed.At last,the experiment verifies the theory analysis and simulation design.It is concluded that the saturation flux density and conductivity of the material are the key factors affecting the armature force,and the optimization of armature structure can effectively restrain the eddy current,reduce negative force and improve efficiency.Compared with cutting slits in solid armatures,laminating the sheets radially can reduce the eddy current more efficiently.Although slitting can prevent the eddy current to a certain extent,meanwhile,it will decrease the magnetic force because of the losing of magnetized volume and the surface area.Hence,choosing the high saturation flux density material and making out the armature with radially_laminated sheets will improve the efficiency of the reluctance accelerator.In this paper,the silicon steel radially_laminated armature is a better choice for the armature design of the reluctance accelerator.