The continental shale reservoirs of Jurassic Lianggaoshan Formation in Sichuan Basin contain thin lamina,which is characterized by strong plasticity and developed longitudinal shell limestone interlayer.To improve the...The continental shale reservoirs of Jurassic Lianggaoshan Formation in Sichuan Basin contain thin lamina,which is characterized by strong plasticity and developed longitudinal shell limestone interlayer.To improve the production efficiency of reservoirs by multi-cluster fracturing,it is necessary to consider the unbalanced propagation of hydraulic fractures and the penetration effect of fractures.This paper constructed a numerical model of multi-fracture propagation and penetration based on the finite element coupling cohesive zone method;considering the construction cluster spacing,pump rate,lamina strength and other parameters studied the influencing factors of multi-cluster fracture interaction propagation;combined with AE energy data and fracture mode reconstruction method,quantitatively characterized the comprehensive impact of the strength of thin interlayer rock interfaces on the initiation and propagation of fractures that penetrate layers,and accurately predicted the propagation pattern of hydraulic fractures through laminated shale oil reservoirs.Simulation results revealed that in the process of multi-cluster fracturing,the proportion of shear damage is low,and mainly occurs in bedding fractures activated by outer fractures.Reducing the cluster spacing enhances the fracture system's penetration ability,though it lowers the activation efficiency of lamina.The high plasticity of the limestone interlayer may impact the vertical propagation distance of the main fracture.Improving the interface strength is beneficial to the reconstruction of the fracture height,but the interface communication effect is limited.Reasonable selection of layers with moderate lamina strength for fracturing stimulation,increasing the pump rate during fracturing and setting the cluster spacing reasonably are beneficial to improve the effect of reservoir stimulation.展开更多
This study conducted temporary plugging and diversion fracturing(TPDF)experiments using a true triaxial fracturing simulation system within a laboratory setting that replicated a lab-based horizontal well completion w...This study conducted temporary plugging and diversion fracturing(TPDF)experiments using a true triaxial fracturing simulation system within a laboratory setting that replicated a lab-based horizontal well completion with multi-cluster sand jetting perforation.The effects of temporary plugging agent(TPA)particle size,TPA concentration,single-cluster perforation number and cluster number on plugging pressure,multi-fracture diversion pattern and distribution of TPAs were investigated.A combination of TPAs with small particle sizes within the fracture and large particle sizes within the segment is conducive to increasing the plugging pressure and promoting the diversion of multi-fractures.The addition of fibers can quickly achieve ultra-high pressure,but it may lead to longitudinal fractures extending along the wellbore.The temporary plugging peak pressure increases with an increase in the concentration of the TPA,reaching a peak at a certain concentration,and further increases do not significantly improve the temporary plugging peak pressure.The breaking pressure and temporary plugging peak pressure show a decreasing trend with an increase in single-cluster perforation number.A lower number of single-cluster perforations is beneficial for increasing the breaking pressure and temporary plugging peak pressure,and it has a more significant control on the propagation of multi-cluster fractures.A lower number of clusters is not conducive to increasing the total number and complexity of artificial fractures,while a higher number of clusters makes it difficult to achieve effective plugging.The TPAs within the fracture is mainly concentrated in the complex fracture areas,especially at the intersections of fractures.Meanwhile,the TPAs within the segment are primarily distributed near the perforation cluster apertures which initiated complex fractures.展开更多
Multistage multi-cluster hydraulic fracturing has enabled the economic exploitation of shale reservoirs,but the interpretation of hydraulic fracture parameters is challenging.The pressure signals after pump shutdown a...Multistage multi-cluster hydraulic fracturing has enabled the economic exploitation of shale reservoirs,but the interpretation of hydraulic fracture parameters is challenging.The pressure signals after pump shutdown are influenced by hydraulic fractures,which can reflect the geometric features of hydraulic fracture.The shutdown pressure can be used to interpret the hydraulic fracture parameters in a real-time and cost-effective manner.In this paper,a mathematical model for shutdown pressure evolution is developed considering the effects of wellbore friction,perforation friction and fluid loss in fractures.An efficient numerical simulation method is established by using the method of characteristics.Based on this method,the impacts of fracture half-length,fracture height,opened cluster and perforation number,and filtration coefficient on the evolution of shutdown pressure are analyzed.The results indicate that a larger fracture half-length may hasten the decay of shutdown pressure,while a larger fracture height can slow down the decay of shutdown pressure.A smaller number of opened clusters and perforations can significantly increase the perforation friction and decrease the overall level of shutdown pressure.A larger filtration coefficient may accelerate the fluid filtration in the fracture and hasten the drop of the shutdown pressure.The simulation method of shutdown pressure,as well as the analysis results,has important implications for the interpretation of hydraulic fracture parameters.展开更多
As wafer circuit widths shrink less than 10 nm,stringent quality control is imposed on the wafer fabrication processes. Therefore, wafer residency time constraints and chamber cleaning operations are widely required i...As wafer circuit widths shrink less than 10 nm,stringent quality control is imposed on the wafer fabrication processes. Therefore, wafer residency time constraints and chamber cleaning operations are widely required in chemical vapor deposition, coating processes, etc. They increase scheduling complexity in cluster tools. In this paper, we focus on scheduling single-arm multi-cluster tools with chamber cleaning operations subject to wafer residency time constraints. When a chamber is being cleaned, it can be viewed as processing a virtual wafer. In this way, chamber cleaning operations can be performed while wafer residency time constraints for real wafers are not violated. Based on such a method, we present the necessary and sufficient conditions to analytically check whether a single-arm multi-cluster tool can be scheduled with a chamber cleaning operation and wafer residency time constraints. An algorithm is proposed to adjust the cycle time for a cleaning operation that lasts a long cleaning time.Meanwhile, algorithms for a feasible schedule are also derived.And an algorithm is presented for operating a multi-cluster tool back to a steady state after the cleaning. Illustrative examples are given to show the application and effectiveness of the proposed method.展开更多
The historical significance of the Stern–Gerlach(SG)experiment lies in its provision of the initial evidence for space quantization.Over time,its sequential form has evolved into an elegant paradigm that effectively ...The historical significance of the Stern–Gerlach(SG)experiment lies in its provision of the initial evidence for space quantization.Over time,its sequential form has evolved into an elegant paradigm that effectively illustrates the fundamental principles of quantum theory.To date,the practical implementation of the sequential SG experiment has not been fully achieved.In this study,we demonstrate the capability of programmable quantum processors to simulate the sequential SG experiment.The specific parametric shallow quantum circuits,which are suitable for the limitations of current noisy quantum hardware,are given to replicate the functionality of SG devices with the ability to perform measurements in different directions.Surprisingly,it has been demonstrated that Wigner’s SG interferometer can be readily implemented in our sequential quantum circuit.With the utilization of the identical circuits,it is also feasible to implement Wheeler’s delayed-choice experiment.We propose the utilization of cross-shaped programmable quantum processors to showcase sequential experiments,and the simulation results demonstrate a strong alignment with theoretical predictions.With the rapid advancement of cloud-based quantum computing,such as BAQIS Quafu,it is our belief that the proposed solution is well-suited for deployment on the cloud,allowing for public accessibility.Our findings not only expand the potential applications of quantum computers,but also contribute to a deeper comprehension of the fundamental principles underlying quantum theory.展开更多
A notable portion of cachelines in real-world workloads exhibits inner non-uniform access behaviors.However,modern cache management rarely considers this fine-grained feature,which impacts the effective cache capacity...A notable portion of cachelines in real-world workloads exhibits inner non-uniform access behaviors.However,modern cache management rarely considers this fine-grained feature,which impacts the effective cache capacity of contemporary high-performance spacecraft processors.To harness these non-uniform access behaviors,an efficient cache replacement framework featuring an auxiliary cache specifically designed to retain evicted hot data was proposed.This framework reconstructs the cache replacement policy,facilitating data migration between the main cache and the auxiliary cache.Unlike traditional cacheline-granularity policies,the approach excels at identifying and evicting infrequently used data,thereby optimizing cache utilization.The evaluation shows impressive performance improvement,especially on workloads with irregular access patterns.Benefiting from fine granularity,the proposal achieves superior storage efficiency compared with commonly used cache management schemes,providing a potential optimization opportunity for modern resource-constrained processors,such as spacecraft processors.Furthermore,the framework complements existing modern cache replacement policies and can be seamlessly integrated with minimal modifications,enhancing their overall efficacy.展开更多
A treelike hybrid multi-cluster tool is composed of both single-arm and dual-arm cluster tools with a treelike topology. Scheduling such a tool is challenging. For a hybrid treelike multi-cluster tool whose bottleneck...A treelike hybrid multi-cluster tool is composed of both single-arm and dual-arm cluster tools with a treelike topology. Scheduling such a tool is challenging. For a hybrid treelike multi-cluster tool whose bottleneck individual tool is process-bound, this work aims at finding its optimal one-wafer cyclic schedule. It is modeled with Petri nets such that a onewafer cyclic schedule is parameterized as its robots' waiting time.Based on the model, this work proves the existence of its onewafer cyclic schedule that features with the ease of industrial implementation. Then, computationally efficient algorithms are proposed to find the minimal cycle time and optimal onewafer cyclic schedule. Multi-cluster tool examples are given to illustrate the proposed approach. The use of the found schedules enables industrial multi-cluster tools to operate with their highest productivity.展开更多
Multi-cluster tools are widely used in majority of wafer fabrication processes in semiconductor industry. Smaller lot production, thinner circuit width in wafers, larger wafer size, and maintenance have resulted in a ...Multi-cluster tools are widely used in majority of wafer fabrication processes in semiconductor industry. Smaller lot production, thinner circuit width in wafers, larger wafer size, and maintenance have resulted in a large quantity of their start-up and close-down transient periods. Yet, most of existing efforts have been concentrated on scheduling their steady states.Different from such efforts, this work schedules their transient and steady-state periods subject to wafer residency constraints. It gives the schedulability conditions for the steady-state scheduling of dual-blade robotic multi-cluster tools and a corresponding algorithm for finding an optimal schedule. Based on the robot synchronization conditions, a linear program is proposed to figure out an optimal schedule for a start-up period, which ensures a tool to enter the desired optimal steady state. Another linear program is proposed to find an optimal schedule for a closedown period that evolves from the steady state period. Finally,industrial cases are presented to illustrate how the provided method outperforms the existing approach in terms of system throughput improvement.展开更多
When it comes to decreasing margins and increasing energy effi-ciency in near-threshold and sub-threshold processors,timing error resilience may be viewed as a potentially lucrative alternative to examine.On the other...When it comes to decreasing margins and increasing energy effi-ciency in near-threshold and sub-threshold processors,timing error resilience may be viewed as a potentially lucrative alternative to examine.On the other hand,the currently employed approaches have certain restrictions,including high levels of design complexity,severe time constraints on error consolidation and propagation,and uncontaminated architectural registers(ARs).The design of near-threshold circuits,often known as NT circuits,is becoming the approach of choice for the construction of energy-efficient digital circuits.As a result of the exponentially decreased driving current,there was a reduction in performance,which was one of the downsides.Numerous studies have advised the use of NT techniques to chip multiprocessors as a means to preserve outstanding energy efficiency while minimising performance loss.Over the past several years,there has been a clear growth in interest in the development of artificial intelligence hardware with low energy consumption(AI).This has resulted in both large corporations and start-ups producing items that compete on the basis of varying degrees of performance and energy use.This technology’s ultimate goal was to provide levels of efficiency and performance that could not be achieved with graphics processing units or general-purpose CPUs.To achieve this objective,the technology was created to integrate several processing units into a single chip.To accomplish this purpose,the hardware was designed with a number of unique properties.In this study,an Energy Effi-cient Hyperparameter Tuned Deep Neural Network(EEHPT-DNN)model for Variation-Tolerant Near-Threshold Processor was developed.In order to improve the energy efficiency of artificial intelligence(AI),the EEHPT-DNN model employs several AI techniques.The notion focuses mostly on the repercussions of embedded technologies positioned at the network’s edge.The presented model employs a deep stacked sparse autoencoder(DSSAE)model with the objective of creating a variation-tolerant NT processor.The time-consuming method of modifying hyperparameters through trial and error is substituted with the marine predators optimization algorithm(MPO).This method is utilised to modify the hyperparameters associated with the DSSAE model.To validate that the proposed EEHPT-DNN model has a higher degree of functionality,a full simulation study is conducted,and the results are analysed from a variety of perspectives.This was completed so that the enhanced performance could be evaluated and analysed.According to the results of the study that compared numerous DL models,the EEHPT-DNN model performed significantly better than the other models.展开更多
针对可重入制造系统多具有多品种、大规模、混流生产等特点,构建带批处理机的可重入混合流水车间调度问题(reentrant hybrid flow shop scheduling problem with batch processors,BPRHFSP)模型,提出一种改进的多目标蜉蝣算法(multi-obj...针对可重入制造系统多具有多品种、大规模、混流生产等特点,构建带批处理机的可重入混合流水车间调度问题(reentrant hybrid flow shop scheduling problem with batch processors,BPRHFSP)模型,提出一种改进的多目标蜉蝣算法(multi-objective mayfly algorithm,MOMA)进行求解。提出了单件加工阶段和批处理阶段的解码规则;设计了基于Logistic混沌映射的反向学习初始化策略、改进的蜉蝣交配和变异策略,提高了算法初始解的质量和局部搜索能力;根据编码规则设计了基于变邻域下降搜索的蜉蝣运动策略,优化了种群方向。通过对不同规模大量测试算例的仿真实验,验证了MOMA相比传统算法求解BP-RHFSP更具有效性和优越性。所提出的模型能够反映生产的基础特征,达到减少最大完工时间、机器负载和碳排放的目的。展开更多
模型深度的不断增加和处理序列长度的不一致对循环神经网络在不同处理器上的性能优化提出巨大挑战。针对自主研制的长向量处理器FT-M7032,实现了一个高效的循环神经网络加速引擎。该引擎采用行优先矩阵向量乘算法和数据感知的多核并行方...模型深度的不断增加和处理序列长度的不一致对循环神经网络在不同处理器上的性能优化提出巨大挑战。针对自主研制的长向量处理器FT-M7032,实现了一个高效的循环神经网络加速引擎。该引擎采用行优先矩阵向量乘算法和数据感知的多核并行方式,提高矩阵向量乘的计算效率;采用两级内核融合优化方法降低临时数据传输的开销;采用手写汇编优化多种算子,进一步挖掘长向量处理器的性能潜力。实验表明,长向量处理器循环神经网络推理引擎可获得较高性能,相较于多核ARM CPU以及Intel Golden CPU,类循环神经网络模型长短记忆网络可获得最高62.68倍和3.12倍的性能加速。展开更多
The automation process is a very important pillar for Industry 4.0.One of the first steps is the control of motors to improve production efficiency and generate energy savings.In mass production industries,techniques ...The automation process is a very important pillar for Industry 4.0.One of the first steps is the control of motors to improve production efficiency and generate energy savings.In mass production industries,techniques such as digital signal processing(DSP)systems are implemented to control motors.These systems are efficient but very expensive for certain applications.From this arises the need for a controller capable of handling AC and DC motors that improves efficiency and maintains low energy consumption.This project presents the design of an adaptive control system for brushless AC induction and DC motors,which is functional to any type of plant in the industry.The design was possible by implementing Matlab software and tools such as digital signal processor(DSP)and Simulink.Through an extensive investigation of the state of the art,three models needed to represent the control system have been specified.The first model for the AC motor,the second for the DC motor and the third for the DSP control;this is done in this way so that the probability of failure is lower.Subsequently,these models have been programmed in Simulink,integrating the three main models into one.In this way,the design of a controller for use in AC induction motors,specifically squirrel cage and brushless DC motors,has been achieved.The final model represents a response time of 0.25 seconds,which is optimal for this type of application,where response times of 2e-3 to 3 seconds are expected.展开更多
基金financial support by the National Key Research and Development Program of China (No.2022YFE0129800)the National Natural Science Foundation of China (No.52074311)。
文摘The continental shale reservoirs of Jurassic Lianggaoshan Formation in Sichuan Basin contain thin lamina,which is characterized by strong plasticity and developed longitudinal shell limestone interlayer.To improve the production efficiency of reservoirs by multi-cluster fracturing,it is necessary to consider the unbalanced propagation of hydraulic fractures and the penetration effect of fractures.This paper constructed a numerical model of multi-fracture propagation and penetration based on the finite element coupling cohesive zone method;considering the construction cluster spacing,pump rate,lamina strength and other parameters studied the influencing factors of multi-cluster fracture interaction propagation;combined with AE energy data and fracture mode reconstruction method,quantitatively characterized the comprehensive impact of the strength of thin interlayer rock interfaces on the initiation and propagation of fractures that penetrate layers,and accurately predicted the propagation pattern of hydraulic fractures through laminated shale oil reservoirs.Simulation results revealed that in the process of multi-cluster fracturing,the proportion of shear damage is low,and mainly occurs in bedding fractures activated by outer fractures.Reducing the cluster spacing enhances the fracture system's penetration ability,though it lowers the activation efficiency of lamina.The high plasticity of the limestone interlayer may impact the vertical propagation distance of the main fracture.Improving the interface strength is beneficial to the reconstruction of the fracture height,but the interface communication effect is limited.Reasonable selection of layers with moderate lamina strength for fracturing stimulation,increasing the pump rate during fracturing and setting the cluster spacing reasonably are beneficial to improve the effect of reservoir stimulation.
基金Supported by the National Natural Science Foundation of China(51974332).
文摘This study conducted temporary plugging and diversion fracturing(TPDF)experiments using a true triaxial fracturing simulation system within a laboratory setting that replicated a lab-based horizontal well completion with multi-cluster sand jetting perforation.The effects of temporary plugging agent(TPA)particle size,TPA concentration,single-cluster perforation number and cluster number on plugging pressure,multi-fracture diversion pattern and distribution of TPAs were investigated.A combination of TPAs with small particle sizes within the fracture and large particle sizes within the segment is conducive to increasing the plugging pressure and promoting the diversion of multi-fractures.The addition of fibers can quickly achieve ultra-high pressure,but it may lead to longitudinal fractures extending along the wellbore.The temporary plugging peak pressure increases with an increase in the concentration of the TPA,reaching a peak at a certain concentration,and further increases do not significantly improve the temporary plugging peak pressure.The breaking pressure and temporary plugging peak pressure show a decreasing trend with an increase in single-cluster perforation number.A lower number of single-cluster perforations is beneficial for increasing the breaking pressure and temporary plugging peak pressure,and it has a more significant control on the propagation of multi-cluster fractures.A lower number of clusters is not conducive to increasing the total number and complexity of artificial fractures,while a higher number of clusters makes it difficult to achieve effective plugging.The TPAs within the fracture is mainly concentrated in the complex fracture areas,especially at the intersections of fractures.Meanwhile,the TPAs within the segment are primarily distributed near the perforation cluster apertures which initiated complex fractures.
基金The work is supported by the Sub-Project of“Research on Key Technologies and Equipment of Reservoir Stimulation”of China National Petroleum Corporation Post–14th Five-Year Plan Forward-Looking Major Science and Technology Project“Research on New Technology of Monitoring and Diagnosis of Horizontal Well Hydraulic Fracture Network Distribution Pattern”(2021DJ4502).
文摘Multistage multi-cluster hydraulic fracturing has enabled the economic exploitation of shale reservoirs,but the interpretation of hydraulic fracture parameters is challenging.The pressure signals after pump shutdown are influenced by hydraulic fractures,which can reflect the geometric features of hydraulic fracture.The shutdown pressure can be used to interpret the hydraulic fracture parameters in a real-time and cost-effective manner.In this paper,a mathematical model for shutdown pressure evolution is developed considering the effects of wellbore friction,perforation friction and fluid loss in fractures.An efficient numerical simulation method is established by using the method of characteristics.Based on this method,the impacts of fracture half-length,fracture height,opened cluster and perforation number,and filtration coefficient on the evolution of shutdown pressure are analyzed.The results indicate that a larger fracture half-length may hasten the decay of shutdown pressure,while a larger fracture height can slow down the decay of shutdown pressure.A smaller number of opened clusters and perforations can significantly increase the perforation friction and decrease the overall level of shutdown pressure.A larger filtration coefficient may accelerate the fluid filtration in the fracture and hasten the drop of the shutdown pressure.The simulation method of shutdown pressure,as well as the analysis results,has important implications for the interpretation of hydraulic fracture parameters.
基金supported in part by the Natural Science Foundation of Guangdong Province,China (2022A1515011310)。
文摘As wafer circuit widths shrink less than 10 nm,stringent quality control is imposed on the wafer fabrication processes. Therefore, wafer residency time constraints and chamber cleaning operations are widely required in chemical vapor deposition, coating processes, etc. They increase scheduling complexity in cluster tools. In this paper, we focus on scheduling single-arm multi-cluster tools with chamber cleaning operations subject to wafer residency time constraints. When a chamber is being cleaned, it can be viewed as processing a virtual wafer. In this way, chamber cleaning operations can be performed while wafer residency time constraints for real wafers are not violated. Based on such a method, we present the necessary and sufficient conditions to analytically check whether a single-arm multi-cluster tool can be scheduled with a chamber cleaning operation and wafer residency time constraints. An algorithm is proposed to adjust the cycle time for a cleaning operation that lasts a long cleaning time.Meanwhile, algorithms for a feasible schedule are also derived.And an algorithm is presented for operating a multi-cluster tool back to a steady state after the cleaning. Illustrative examples are given to show the application and effectiveness of the proposed method.
基金supported by Beijing Academy of Quantum Information Sciencessupported by the State Key Laboratory of Low Dimensional Quantum Physics+2 种基金the Start-up Fund provided by Tsinghua Universitythe financial support provided by the National Natural Science Foundation of China(Grant No.92065113)the Anhui Initiative in Quantum Information Technologies。
文摘The historical significance of the Stern–Gerlach(SG)experiment lies in its provision of the initial evidence for space quantization.Over time,its sequential form has evolved into an elegant paradigm that effectively illustrates the fundamental principles of quantum theory.To date,the practical implementation of the sequential SG experiment has not been fully achieved.In this study,we demonstrate the capability of programmable quantum processors to simulate the sequential SG experiment.The specific parametric shallow quantum circuits,which are suitable for the limitations of current noisy quantum hardware,are given to replicate the functionality of SG devices with the ability to perform measurements in different directions.Surprisingly,it has been demonstrated that Wigner’s SG interferometer can be readily implemented in our sequential quantum circuit.With the utilization of the identical circuits,it is also feasible to implement Wheeler’s delayed-choice experiment.We propose the utilization of cross-shaped programmable quantum processors to showcase sequential experiments,and the simulation results demonstrate a strong alignment with theoretical predictions.With the rapid advancement of cloud-based quantum computing,such as BAQIS Quafu,it is our belief that the proposed solution is well-suited for deployment on the cloud,allowing for public accessibility.Our findings not only expand the potential applications of quantum computers,but also contribute to a deeper comprehension of the fundamental principles underlying quantum theory.
文摘A notable portion of cachelines in real-world workloads exhibits inner non-uniform access behaviors.However,modern cache management rarely considers this fine-grained feature,which impacts the effective cache capacity of contemporary high-performance spacecraft processors.To harness these non-uniform access behaviors,an efficient cache replacement framework featuring an auxiliary cache specifically designed to retain evicted hot data was proposed.This framework reconstructs the cache replacement policy,facilitating data migration between the main cache and the auxiliary cache.Unlike traditional cacheline-granularity policies,the approach excels at identifying and evicting infrequently used data,thereby optimizing cache utilization.The evaluation shows impressive performance improvement,especially on workloads with irregular access patterns.Benefiting from fine granularity,the proposal achieves superior storage efficiency compared with commonly used cache management schemes,providing a potential optimization opportunity for modern resource-constrained processors,such as spacecraft processors.Furthermore,the framework complements existing modern cache replacement policies and can be seamlessly integrated with minimal modifications,enhancing their overall efficacy.
基金supported in part by Science and Technology Development Fund(FDCT)of Macao(106/2016/A3)the National Natural Science Foundation of China(U1401240)the Delta Electronics Inc and the National Research Foundation(NRF)Singapore under the Corp Lab@University Scheme
文摘A treelike hybrid multi-cluster tool is composed of both single-arm and dual-arm cluster tools with a treelike topology. Scheduling such a tool is challenging. For a hybrid treelike multi-cluster tool whose bottleneck individual tool is process-bound, this work aims at finding its optimal one-wafer cyclic schedule. It is modeled with Petri nets such that a onewafer cyclic schedule is parameterized as its robots' waiting time.Based on the model, this work proves the existence of its onewafer cyclic schedule that features with the ease of industrial implementation. Then, computationally efficient algorithms are proposed to find the minimal cycle time and optimal onewafer cyclic schedule. Multi-cluster tool examples are given to illustrate the proposed approach. The use of the found schedules enables industrial multi-cluster tools to operate with their highest productivity.
基金the National Natural Science Foundation of China(61673123,61803397)the Science and Technology Development Fund(FDCT)of Macao(106/2016/A3,005/2018/A1,011/2017/A,0017/2019/A1)
文摘Multi-cluster tools are widely used in majority of wafer fabrication processes in semiconductor industry. Smaller lot production, thinner circuit width in wafers, larger wafer size, and maintenance have resulted in a large quantity of their start-up and close-down transient periods. Yet, most of existing efforts have been concentrated on scheduling their steady states.Different from such efforts, this work schedules their transient and steady-state periods subject to wafer residency constraints. It gives the schedulability conditions for the steady-state scheduling of dual-blade robotic multi-cluster tools and a corresponding algorithm for finding an optimal schedule. Based on the robot synchronization conditions, a linear program is proposed to figure out an optimal schedule for a start-up period, which ensures a tool to enter the desired optimal steady state. Another linear program is proposed to find an optimal schedule for a closedown period that evolves from the steady state period. Finally,industrial cases are presented to illustrate how the provided method outperforms the existing approach in terms of system throughput improvement.
文摘When it comes to decreasing margins and increasing energy effi-ciency in near-threshold and sub-threshold processors,timing error resilience may be viewed as a potentially lucrative alternative to examine.On the other hand,the currently employed approaches have certain restrictions,including high levels of design complexity,severe time constraints on error consolidation and propagation,and uncontaminated architectural registers(ARs).The design of near-threshold circuits,often known as NT circuits,is becoming the approach of choice for the construction of energy-efficient digital circuits.As a result of the exponentially decreased driving current,there was a reduction in performance,which was one of the downsides.Numerous studies have advised the use of NT techniques to chip multiprocessors as a means to preserve outstanding energy efficiency while minimising performance loss.Over the past several years,there has been a clear growth in interest in the development of artificial intelligence hardware with low energy consumption(AI).This has resulted in both large corporations and start-ups producing items that compete on the basis of varying degrees of performance and energy use.This technology’s ultimate goal was to provide levels of efficiency and performance that could not be achieved with graphics processing units or general-purpose CPUs.To achieve this objective,the technology was created to integrate several processing units into a single chip.To accomplish this purpose,the hardware was designed with a number of unique properties.In this study,an Energy Effi-cient Hyperparameter Tuned Deep Neural Network(EEHPT-DNN)model for Variation-Tolerant Near-Threshold Processor was developed.In order to improve the energy efficiency of artificial intelligence(AI),the EEHPT-DNN model employs several AI techniques.The notion focuses mostly on the repercussions of embedded technologies positioned at the network’s edge.The presented model employs a deep stacked sparse autoencoder(DSSAE)model with the objective of creating a variation-tolerant NT processor.The time-consuming method of modifying hyperparameters through trial and error is substituted with the marine predators optimization algorithm(MPO).This method is utilised to modify the hyperparameters associated with the DSSAE model.To validate that the proposed EEHPT-DNN model has a higher degree of functionality,a full simulation study is conducted,and the results are analysed from a variety of perspectives.This was completed so that the enhanced performance could be evaluated and analysed.According to the results of the study that compared numerous DL models,the EEHPT-DNN model performed significantly better than the other models.
文摘模型深度的不断增加和处理序列长度的不一致对循环神经网络在不同处理器上的性能优化提出巨大挑战。针对自主研制的长向量处理器FT-M7032,实现了一个高效的循环神经网络加速引擎。该引擎采用行优先矩阵向量乘算法和数据感知的多核并行方式,提高矩阵向量乘的计算效率;采用两级内核融合优化方法降低临时数据传输的开销;采用手写汇编优化多种算子,进一步挖掘长向量处理器的性能潜力。实验表明,长向量处理器循环神经网络推理引擎可获得较高性能,相较于多核ARM CPU以及Intel Golden CPU,类循环神经网络模型长短记忆网络可获得最高62.68倍和3.12倍的性能加速。
文摘The automation process is a very important pillar for Industry 4.0.One of the first steps is the control of motors to improve production efficiency and generate energy savings.In mass production industries,techniques such as digital signal processing(DSP)systems are implemented to control motors.These systems are efficient but very expensive for certain applications.From this arises the need for a controller capable of handling AC and DC motors that improves efficiency and maintains low energy consumption.This project presents the design of an adaptive control system for brushless AC induction and DC motors,which is functional to any type of plant in the industry.The design was possible by implementing Matlab software and tools such as digital signal processor(DSP)and Simulink.Through an extensive investigation of the state of the art,three models needed to represent the control system have been specified.The first model for the AC motor,the second for the DC motor and the third for the DSP control;this is done in this way so that the probability of failure is lower.Subsequently,these models have been programmed in Simulink,integrating the three main models into one.In this way,the design of a controller for use in AC induction motors,specifically squirrel cage and brushless DC motors,has been achieved.The final model represents a response time of 0.25 seconds,which is optimal for this type of application,where response times of 2e-3 to 3 seconds are expected.