The amount of oxygen blown into the converter is one of the key parameters for the control of the converter blowing process,which directly affects the tap-to-tap time of converter. In this study, a hybrid model based ...The amount of oxygen blown into the converter is one of the key parameters for the control of the converter blowing process,which directly affects the tap-to-tap time of converter. In this study, a hybrid model based on oxygen balance mechanism (OBM) and deep neural network (DNN) was established for predicting oxygen blowing time in converter. A three-step method was utilized in the hybrid model. First, the oxygen consumption volume was predicted by the OBM model and DNN model, respectively. Second, a more accurate oxygen consumption volume was obtained by integrating the OBM model and DNN model. Finally, the converter oxygen blowing time was calculated according to the oxygen consumption volume and the oxygen supply intensity of each heat. The proposed hybrid model was verified using the actual data collected from an integrated steel plant in China, and compared with multiple linear regression model, OBM model, and neural network model including extreme learning machine, back propagation neural network, and DNN. The test results indicate that the hybrid model with a network structure of 3 hidden layer layers, 32-16-8 neurons per hidden layer, and 0.1 learning rate has the best prediction accuracy and stronger generalization ability compared with other models. The predicted hit ratio of oxygen consumption volume within the error±300 m^(3)is 96.67%;determination coefficient (R^(2)) and root mean square error (RMSE) are0.6984 and 150.03 m^(3), respectively. The oxygen blow time prediction hit ratio within the error±0.6 min is 89.50%;R2and RMSE are0.9486 and 0.3592 min, respectively. As a result, the proposed model can effectively predict the oxygen consumption volume and oxygen blowing time in the converter.展开更多
Neural networks are often viewed as pure‘black box’models,lacking interpretability and extrapolation capabilities of pure mechanistic models.This work proposes a new approach that,with the help of neural networks,im...Neural networks are often viewed as pure‘black box’models,lacking interpretability and extrapolation capabilities of pure mechanistic models.This work proposes a new approach that,with the help of neural networks,improves the conformity of the first-principal model to the actual plant.The final result is still a first-principal model rather than a hybrid model,which maintains the advantage of the high interpretability of first-principal model.This work better simulates industrial batch distillation which separates four components:water,ethylene glycol,diethylene glycol,and triethylene glycol.GRU(gated recurrent neural network)and LSTM(long short-term memory)were used to obtain empirical parameters of mechanistic model that are difficult to measure directly.These were used to improve the empirical processes in mechanistic model,thus correcting unreasonable model assumptions and achieving better predictability for batch distillation.The proposed method was verified using a case study from one industrial plant case,and the results show its advancement in improving model predictions and the potential to extend to other similar systems.展开更多
Implanted neural probes can detect weak discharges of neurons in the brain by piercing soft brain tissue,thus as important tools for brain science research,as well as diagnosis and treatment of brain diseases.However,...Implanted neural probes can detect weak discharges of neurons in the brain by piercing soft brain tissue,thus as important tools for brain science research,as well as diagnosis and treatment of brain diseases.However,the rigid neural probes,such as Utah arrays,Michigan probes,and metal microfilament electrodes,are mechanically unmatched with brain tissue and are prone to rejection and glial scarring after implantation,which leads to a significant degradation in the signal quality with the implantation time.In recent years,flexible neural electrodes are rapidly developed with less damage to biological tissues,excellent biocompatibility,and mechanical compliance to alleviate scarring.Among them,the mechanical modeling is important for the optimization of the structure and the implantation process.In this review,the theoretical calculation of the flexible neural probes is firstly summarized with the processes of buckling,insertion,and relative interaction with soft brain tissue for flexible probes from outside to inside.Then,the corresponding mechanical simulation methods are organized considering multiple impact factors to realize minimally invasive implantation.Finally,the technical difficulties and future trends of mechanical modeling are discussed for the next-generation flexible neural probes,which is critical to realize low-invasiveness and long-term coexistence in vivo.展开更多
The present study proposes a sub-grid scale model for the one-dimensional Burgers turbulence based on the neuralnetwork and deep learning method.The filtered data of the direct numerical simulation is used to establis...The present study proposes a sub-grid scale model for the one-dimensional Burgers turbulence based on the neuralnetwork and deep learning method.The filtered data of the direct numerical simulation is used to establish thetraining data set,the validation data set,and the test data set.The artificial neural network(ANN)methodand Back Propagation method are employed to train parameters in the ANN.The developed ANN is applied toconstruct the sub-grid scale model for the large eddy simulation of the Burgers turbulence in the one-dimensionalspace.The proposed model well predicts the time correlation and the space correlation of the Burgers turbulence.展开更多
Wheat is a critical crop,extensively consumed worldwide,and its production enhancement is essential to meet escalating demand.The presence of diseases like stem rust,leaf rust,yellow rust,and tan spot significantly di...Wheat is a critical crop,extensively consumed worldwide,and its production enhancement is essential to meet escalating demand.The presence of diseases like stem rust,leaf rust,yellow rust,and tan spot significantly diminishes wheat yield,making the early and precise identification of these diseases vital for effective disease management.With advancements in deep learning algorithms,researchers have proposed many methods for the automated detection of disease pathogens;however,accurately detectingmultiple disease pathogens simultaneously remains a challenge.This challenge arises due to the scarcity of RGB images for multiple diseases,class imbalance in existing public datasets,and the difficulty in extracting features that discriminate between multiple classes of disease pathogens.In this research,a novel method is proposed based on Transfer Generative Adversarial Networks for augmenting existing data,thereby overcoming the problems of class imbalance and data scarcity.This study proposes a customized architecture of Vision Transformers(ViT),where the feature vector is obtained by concatenating features extracted from the custom ViT and Graph Neural Networks.This paper also proposes a Model AgnosticMeta Learning(MAML)based ensemble classifier for accurate classification.The proposedmodel,validated on public datasets for wheat disease pathogen classification,achieved a test accuracy of 99.20%and an F1-score of 97.95%.Compared with existing state-of-the-art methods,this proposed model outperforms in terms of accuracy,F1-score,and the number of disease pathogens detection.In future,more diseases can be included for detection along with some other modalities like pests and weed.展开更多
Modeling of unsteady aerodynamic loads at high angles of attack using a small amount of experimental or simulation data to construct predictive models for unknown states can greatly improve the efficiency of aircraft ...Modeling of unsteady aerodynamic loads at high angles of attack using a small amount of experimental or simulation data to construct predictive models for unknown states can greatly improve the efficiency of aircraft unsteady aerodynamic design and flight dynamics analysis.In this paper,aiming at the problems of poor generalization of traditional aerodynamic models and intelligent models,an intelligent aerodynamic modeling method based on gated neural units is proposed.The time memory characteristics of the gated neural unit is fully utilized,thus the nonlinear flow field characterization ability of the learning and training process is enhanced,and the generalization ability of the whole prediction model is improved.The prediction and verification of the model are carried out under the maneuvering flight condition of NACA0015 airfoil.The results show that the model has good adaptability.In the interpolation prediction,the maximum prediction error of the lift and drag coefficients and the moment coefficient does not exceed 10%,which can basically represent the variation characteristics of the entire flow field.In the construction of extrapolation models,the training model based on the strong nonlinear data has good accuracy for weak nonlinear prediction.Furthermore,the error is larger,even exceeding 20%,which indicates that the extrapolation and generalization capabilities need to be further optimized by integrating physical models.Compared with the conventional state space equation model,the proposed method can improve the extrapolation accuracy and efficiency by 78%and 60%,respectively,which demonstrates the applied potential of this method in aerodynamic modeling.展开更多
Blades are essential components of wind turbines.Reducing their fatigue loads during operation helps to extend their lifespan,but it is difficult to quickly and accurately calculate the fatigue loads of blades.To solv...Blades are essential components of wind turbines.Reducing their fatigue loads during operation helps to extend their lifespan,but it is difficult to quickly and accurately calculate the fatigue loads of blades.To solve this problem,this paper innovatively designs a data-driven blade load modeling method based on a deep learning framework through mechanism analysis,feature selection,and model construction.In the mechanism analysis part,the generation mechanism of blade loads and the load theoretical calculationmethod based on material damage theory are analyzed,and four measurable operating state parameters related to blade loads are screened;in the feature extraction part,15 characteristic indicators of each screened parameter are extracted in the time and frequency domain,and feature selection is completed through correlation analysis with blade loads to determine the input parameters of data-driven modeling;in the model construction part,a deep neural network based on feedforward and feedback propagation is designed to construct the nonlinear coupling relationship between the unit operating parameter characteristics and blade loads.The results show that the proposed method mines the wind turbine operating state characteristics highly correlated with the blade load,such as the standard deviation of wind speed.The model built using these characteristics has reasonable calculation and fitting capabilities for the blade load and shows a better fitting level for untrained out-of-sample data than the traditional scheme.Based on the mean absolute percentage error calculation,the modeling accuracy of the two blade loads can reach more than 90%and 80%,respectively,providing a good foundation for the subsequent optimization control to suppress the blade load.展开更多
Deep learning, especially through convolutional neural networks (CNN) such as the U-Net 3D model, has revolutionized fault identification from seismic data, representing a significant leap over traditional methods. Ou...Deep learning, especially through convolutional neural networks (CNN) such as the U-Net 3D model, has revolutionized fault identification from seismic data, representing a significant leap over traditional methods. Our review traces the evolution of CNN, emphasizing the adaptation and capabilities of the U-Net 3D model in automating seismic fault delineation with unprecedented accuracy. We find: 1) The transition from basic neural networks to sophisticated CNN has enabled remarkable advancements in image recognition, which are directly applicable to analyzing seismic data. The U-Net 3D model, with its innovative architecture, exemplifies this progress by providing a method for detailed and accurate fault detection with reduced manual interpretation bias. 2) The U-Net 3D model has demonstrated its superiority over traditional fault identification methods in several key areas: it has enhanced interpretation accuracy, increased operational efficiency, and reduced the subjectivity of manual methods. 3) Despite these achievements, challenges such as the need for effective data preprocessing, acquisition of high-quality annotated datasets, and achieving model generalization across different geological conditions remain. Future research should therefore focus on developing more complex network architectures and innovative training strategies to refine fault identification performance further. Our findings confirm the transformative potential of deep learning, particularly CNN like the U-Net 3D model, in geosciences, advocating for its broader integration to revolutionize geological exploration and seismic analysis.展开更多
This study proposes a novel approach for estimating automobile insurance loss reserves utilizing Artificial Neural Network (ANN) techniques integrated with actuarial data intelligence. The model aims to address the ch...This study proposes a novel approach for estimating automobile insurance loss reserves utilizing Artificial Neural Network (ANN) techniques integrated with actuarial data intelligence. The model aims to address the challenges of accurately predicting insurance claim frequencies, severities, and overall loss reserves while accounting for inflation adjustments. Through comprehensive data analysis and model development, this research explores the effectiveness of ANN methodologies in capturing complex nonlinear relationships within insurance data. The study leverages a data set comprising automobile insurance policyholder information, claim history, and economic indicators to train and validate the ANN-based reserving model. Key aspects of the methodology include data preprocessing techniques such as one-hot encoding and scaling, followed by the construction of frequency, severity, and overall loss reserving models using ANN architectures. Moreover, the model incorporates inflation adjustment factors to ensure the accurate estimation of future loss reserves in real terms. Results from the study demonstrate the superior predictive performance of the ANN-based reserving model compared to traditional actuarial methods, with substantial improvements in accuracy and robustness. Furthermore, the model’s ability to adapt to changing market conditions and regulatory requirements, such as IFRS17, highlights its practical relevance in the insurance industry. The findings of this research contribute to the advancement of actuarial science and provide valuable insights for insurance companies seeking more accurate and efficient loss reserving techniques. The proposed ANN-based approach offers a promising avenue for enhancing risk management practices and optimizing financial decision-making processes in the automobile insurance sector.展开更多
The goals of this study are to assess the viability of waste tire-derived char(WTDC)as a sustainable,low-cost fine aggregate surrogate material for asphalt mixtures and to develop the statistically coupled neural netw...The goals of this study are to assess the viability of waste tire-derived char(WTDC)as a sustainable,low-cost fine aggregate surrogate material for asphalt mixtures and to develop the statistically coupled neural network(SCNN)model for predicting volumetric and Marshall properties of asphalt mixtures modified with WTDC.The study is based on experimental data acquired from laboratory volumetric and Marshall properties testing on WTDCmodified asphalt mixtures(WTDC-MAM).The input variables comprised waste tire char content and asphalt binder content.The output variables comprised mixture unit weight,total voids,voids filled with asphalt,Marshall stability,and flow.Statistical coupled neural networks were utilized to predict the volumetric and Marshall properties of asphalt mixtures.For predictive modeling,the SCNN model is employed,incorporating a three-layer neural network and preprocessing techniques to enhance accuracy and reliability.The optimal network architecture,using the collected dataset,was a 2:6:5 structure,and the neural network was trained with 60%of the data,whereas the other 20%was used for cross-validation and testing respectively.The network employed a hyperbolic tangent(tanh)activation function and a feed-forward backpropagation.According to the results,the network model could accurately predict the volumetric and Marshall properties.The predicted accuracy of SCNN was found to be as high value>98%and low prediction errors for both volumetric and Marshall properties.This study demonstrates WTDC's potential as a low-cost,sustainable aggregate replacement.The SCNN-based predictive model proves its efficiency and versatility and promotes sustainable practices.展开更多
A hybrid identification model based on multilayer artificial neural networks(ANNs) and particle swarm optimization(PSO) algorithm is developed to improve the simultaneous identification efficiency of thermal conductiv...A hybrid identification model based on multilayer artificial neural networks(ANNs) and particle swarm optimization(PSO) algorithm is developed to improve the simultaneous identification efficiency of thermal conductivity and effective absorption coefficient of semitransparent materials.For the direct model,the spherical harmonic method and the finite volume method are used to solve the coupled conduction-radiation heat transfer problem in an absorbing,emitting,and non-scattering 2D axisymmetric gray medium in the background of laser flash method.For the identification part,firstly,the temperature field and the incident radiation field in different positions are chosen as observables.Then,a traditional identification model based on PSO algorithm is established.Finally,multilayer ANNs are built to fit and replace the direct model in the traditional identification model to speed up the identification process.The results show that compared with the traditional identification model,the time cost of the hybrid identification model is reduced by about 1 000 times.Besides,the hybrid identification model remains a high level of accuracy even with measurement errors.展开更多
This study explores the factors influencing metro passengers’ arrival volume in Wuhan, China, and Lagos, Nigeria, by examining weather, time of day, waiting time, travel behavior, arrival patterns, and metro satisfac...This study explores the factors influencing metro passengers’ arrival volume in Wuhan, China, and Lagos, Nigeria, by examining weather, time of day, waiting time, travel behavior, arrival patterns, and metro satisfaction. It addresses a significant research gap in understanding metro passengers’ dynamics across cultural and geographical contexts. It employs questionnaires, field observations, and advanced data analysis techniques like association rule mining and neural network modeling. Key findings include a correlation between rainy weather, shorter waiting times, and higher arrival volumes. Neural network models showed high predictive accuracy, with waiting time, metro satisfaction, and weather being significant factors in Lagos Light Rail Blue Line Metro. In contrast, arrival patterns, weather, and time of day were more influential in Wuhan Metro Line 5. Results suggest that improving metro satisfaction and reducing waiting times could increase arrival volumes in Lagos Metro while adjusting schedules for weather and peak times could optimize flow in Wuhan Metro. These insights are valuable for transportation planning, passenger arrival volume management, and enhancing user experiences, potentially benefiting urban transportation sustainability and development goals.展开更多
This study introduces an innovative“Big Model”strategy to enhance Bridge Structural Health Monitoring(SHM)using a Convolutional Neural Network(CNN),time-frequency analysis,and fine element analysis.Leveraging ensemb...This study introduces an innovative“Big Model”strategy to enhance Bridge Structural Health Monitoring(SHM)using a Convolutional Neural Network(CNN),time-frequency analysis,and fine element analysis.Leveraging ensemble methods,collaborative learning,and distributed computing,the approach effectively manages the complexity and scale of large-scale bridge data.The CNN employs transfer learning,fine-tuning,and continuous monitoring to optimize models for adaptive and accurate structural health assessments,focusing on extracting meaningful features through time-frequency analysis.By integrating Finite Element Analysis,time-frequency analysis,and CNNs,the strategy provides a comprehensive understanding of bridge health.Utilizing diverse sensor data,sophisticated feature extraction,and advanced CNN architecture,the model is optimized through rigorous preprocessing and hyperparameter tuning.This approach significantly enhances the ability to make accurate predictions,monitor structural health,and support proactive maintenance practices,thereby ensuring the safety and longevity of critical infrastructure.展开更多
A mapping function between the Reynolds-averaged Navier-Stokes mean flow variables and transition intermittency factor is constructed by fully connected artificial neural network(ANN),which replaces the governing equa...A mapping function between the Reynolds-averaged Navier-Stokes mean flow variables and transition intermittency factor is constructed by fully connected artificial neural network(ANN),which replaces the governing equation of the intermittency factor in transition-predictive Spalart-Allmaras(SA)-γmodel.By taking SA-γmodel as the benchmark,the present ANN model is trained at two airfoils with various angles of attack,Mach numbers and Reynolds numbers,and tested with unseen airfoils in different flow states.The a posteriori tests manifest that the mean pressure coefficient,skin friction coefficient,size of laminar separation bubble,mean streamwise velocity,Reynolds shear stress and lift/drag/moment coefficient from the present two-way coupling ANN model almost coincide with those from the benchmark SA-γmodel.Furthermore,the ANN model proves to exhibit a higher calculation efficiency and better convergence quality than traditional SA-γmodel.展开更多
Fully connected neural networks(FCNNs)have been developed for the closure of subgrid-scale(SGS)stress and SGS heat flux in large-eddy simulations of compressible turbulent channel flow.The FCNNbased SGS model trained ...Fully connected neural networks(FCNNs)have been developed for the closure of subgrid-scale(SGS)stress and SGS heat flux in large-eddy simulations of compressible turbulent channel flow.The FCNNbased SGS model trained using data with Mach number Ma=3.0 and Reynolds number Re=3000 was applied to situations with different Mach numbers and Reynolds numbers.The input variables of the neural network model were the filtered velocity gradients and temperature gradients at a single spatial grid point.The a priori test showed that the FCNN model had a correlation coefficient larger than 0.91 and a relative error smaller than 0.43,with much better reconstructions of SGS unclosed terms than the dynamic Smagorinsky model(DSM).In a posteriori test,the behavior of the FCNN model was marginally better than that of the DSM in predicting the mean velocity profiles,mean temperature profiles,turbulent intensities,total Reynolds stress,total Reynolds heat flux,and mean SGS flux of kinetic energy,and outperformed the Smagorinsky model.展开更多
This works intends to provide numerical solutions based on the nonlinear fractional order derivatives of the classical White and Comiskey model(NFD-WCM).The fractional order derivatives have provided authentic and acc...This works intends to provide numerical solutions based on the nonlinear fractional order derivatives of the classical White and Comiskey model(NFD-WCM).The fractional order derivatives have provided authentic and accurate solutions for the NDF-WCM.The solutions of the fractional NFD-WCM are provided using the stochastic computing supervised algorithm named Levenberg-Marquard Backpropagation(LMB)based on neural networks(NNs).This regression approach combines gradient descent and Gauss-Newton iterative methods,which means finding a solution through the sequences of different calculations.WCM is used to demonstrate the heroin epidemics.Heroin has been on-growth world wide,mainly in Asia,Europe,and the USA.It is the fourth foremost cause of death due to taking an overdose in the USA.The nonlinear mathematical system NFD-WCM discusses the overall circumstance of different drug users,such as suspected groups,drug users without treatment,and drug users with treatment.The numerical results of NFD-WCM via LMB-NNs have been substantiated through training,testing,and validation measures.The stability and accuracy are then checked through the statistical tool,such asmean square error(MSE),error histogram,and fitness curves.The suggested methodology’s strength is demonstrated by the high convergence between the reference solutions and the solutions generated by adding the efficacy of a constructed solver LMB-NNs,with accuracy levels ranging from 10?9 to 10?10.展开更多
The motivation for this study is that the quality of deep fakes is constantly improving,which leads to the need to develop new methods for their detection.The proposed Customized Convolutional Neural Network method in...The motivation for this study is that the quality of deep fakes is constantly improving,which leads to the need to develop new methods for their detection.The proposed Customized Convolutional Neural Network method involves extracting structured data from video frames using facial landmark detection,which is then used as input to the CNN.The customized Convolutional Neural Network method is the date augmented-based CNN model to generate‘fake data’or‘fake images’.This study was carried out using Python and its libraries.We used 242 films from the dataset gathered by the Deep Fake Detection Challenge,of which 199 were made up and the remaining 53 were real.Ten seconds were allotted for each video.There were 318 videos used in all,199 of which were fake and 119 of which were real.Our proposedmethod achieved a testing accuracy of 91.47%,loss of 0.342,and AUC score of 0.92,outperforming two alternative approaches,CNN and MLP-CNN.Furthermore,our method succeeded in greater accuracy than contemporary models such as XceptionNet,Meso-4,EfficientNet-BO,MesoInception-4,VGG-16,and DST-Net.The novelty of this investigation is the development of a new Convolutional Neural Network(CNN)learning model that can accurately detect deep fake face photos.展开更多
In recent years,with the great success of pre-trained language models,the pre-trained BERT model has been gradually applied to the field of source code understanding.However,the time cost of training a language model ...In recent years,with the great success of pre-trained language models,the pre-trained BERT model has been gradually applied to the field of source code understanding.However,the time cost of training a language model from zero is very high,and how to transfer the pre-trained language model to the field of smart contract vulnerability detection is a hot research direction at present.In this paper,we propose a hybrid model to detect common vulnerabilities in smart contracts based on a lightweight pre-trained languagemodel BERT and connected to a bidirectional gate recurrent unitmodel.The downstream neural network adopts the bidirectional gate recurrent unit neural network model with a hierarchical attention mechanism to mine more semantic features contained in the source code of smart contracts by using their characteristics.Our experiments show that our proposed hybrid neural network model SolBERT-BiGRU-Attention is fitted by a large number of data samples with smart contract vulnerabilities,and it is found that compared with the existing methods,the accuracy of our model can reach 93.85%,and the Micro-F1 Score is 94.02%.展开更多
The motive of these investigations is to provide the importance and significance of the fractional order(FO)derivatives in the nonlinear environmental and economic(NEE)model,i.e.,FO-NEE model.The dynamics of the NEE m...The motive of these investigations is to provide the importance and significance of the fractional order(FO)derivatives in the nonlinear environmental and economic(NEE)model,i.e.,FO-NEE model.The dynamics of the NEE model achieves more precise by using the form of the FO derivative.The investigations through the non-integer and nonlinear mathematical form to define the FO-NEE model are also provided in this study.The composition of the FO-NEEmodel is classified into three classes,execution cost of control,system competence of industrial elements and a new diagnostics technical exclusion cost.The mathematical FO-NEE system is numerically studied by using the artificial neural networks(ANNs)along with the Levenberg-Marquardt backpropagation method(ANNs-LMBM).Three different cases using the FO derivative have been examined to present the numerical performances of the FO-NEE model.The data is selected to solve the mathematical FO-NEE system is executed as 70%for training and 15%for both testing and certification.The exactness of the proposed ANNs-LMBM is observed through the comparison of the obtained and the Adams-Bashforth-Moulton database results.To ratify the aptitude,validity,constancy,exactness,and competence of the ANNs-LMBM,the numerical replications using the state transitions,regression,correlation,error histograms and mean square error are also described.展开更多
To obtain excellent regression results under the condition of small sample hyperspectral data,a deep neural network with simulated annealing(SA-DNN)is proposed.According to the characteristics of data,the attention me...To obtain excellent regression results under the condition of small sample hyperspectral data,a deep neural network with simulated annealing(SA-DNN)is proposed.According to the characteristics of data,the attention mechanism was applied to make the network pay more attention to effective features,thereby improving the operating efficiency.By introducing an improved activation function,the data correlation was reduced based on increasing the operation rate,and the problem of over-fitting was alleviated.By introducing simulated annealing,the network chose the optimal learning rate by itself,which avoided falling into the local optimum to the greatest extent.To evaluate the performance of the SA-DNN,the coefficient of determination(R^(2)),root mean square error(RMSE),and other metrics were used to evaluate the model.The results show that the performance of the SA-DNN is significantly better than other traditional methods.展开更多
基金financially supported by the National Natural Science Foundation of China (Nos.51974023 and52374321)the funding of State Key Laboratory of Advanced Metallurgy,University of Science and Technology Beijing,China (No.41620007)。
文摘The amount of oxygen blown into the converter is one of the key parameters for the control of the converter blowing process,which directly affects the tap-to-tap time of converter. In this study, a hybrid model based on oxygen balance mechanism (OBM) and deep neural network (DNN) was established for predicting oxygen blowing time in converter. A three-step method was utilized in the hybrid model. First, the oxygen consumption volume was predicted by the OBM model and DNN model, respectively. Second, a more accurate oxygen consumption volume was obtained by integrating the OBM model and DNN model. Finally, the converter oxygen blowing time was calculated according to the oxygen consumption volume and the oxygen supply intensity of each heat. The proposed hybrid model was verified using the actual data collected from an integrated steel plant in China, and compared with multiple linear regression model, OBM model, and neural network model including extreme learning machine, back propagation neural network, and DNN. The test results indicate that the hybrid model with a network structure of 3 hidden layer layers, 32-16-8 neurons per hidden layer, and 0.1 learning rate has the best prediction accuracy and stronger generalization ability compared with other models. The predicted hit ratio of oxygen consumption volume within the error±300 m^(3)is 96.67%;determination coefficient (R^(2)) and root mean square error (RMSE) are0.6984 and 150.03 m^(3), respectively. The oxygen blow time prediction hit ratio within the error±0.6 min is 89.50%;R2and RMSE are0.9486 and 0.3592 min, respectively. As a result, the proposed model can effectively predict the oxygen consumption volume and oxygen blowing time in the converter.
基金supported by Beijing Natural Science Foundation(2222037)by the Fundamental Research Funds for the Central Universities.
文摘Neural networks are often viewed as pure‘black box’models,lacking interpretability and extrapolation capabilities of pure mechanistic models.This work proposes a new approach that,with the help of neural networks,improves the conformity of the first-principal model to the actual plant.The final result is still a first-principal model rather than a hybrid model,which maintains the advantage of the high interpretability of first-principal model.This work better simulates industrial batch distillation which separates four components:water,ethylene glycol,diethylene glycol,and triethylene glycol.GRU(gated recurrent neural network)and LSTM(long short-term memory)were used to obtain empirical parameters of mechanistic model that are difficult to measure directly.These were used to improve the empirical processes in mechanistic model,thus correcting unreasonable model assumptions and achieving better predictability for batch distillation.The proposed method was verified using a case study from one industrial plant case,and the results show its advancement in improving model predictions and the potential to extend to other similar systems.
基金support received from the National Natural Science Foundation of China(GrantNos.62204204 and 52175148)Science and Technology Innovation 2030-Major Project(Grant No.2022ZD0208601)+1 种基金Shanghai Sailing Program(Grant No.21YF1451000)Presidential Foundation of CAEP(Grant No.YZJJZQ2022001).
文摘Implanted neural probes can detect weak discharges of neurons in the brain by piercing soft brain tissue,thus as important tools for brain science research,as well as diagnosis and treatment of brain diseases.However,the rigid neural probes,such as Utah arrays,Michigan probes,and metal microfilament electrodes,are mechanically unmatched with brain tissue and are prone to rejection and glial scarring after implantation,which leads to a significant degradation in the signal quality with the implantation time.In recent years,flexible neural electrodes are rapidly developed with less damage to biological tissues,excellent biocompatibility,and mechanical compliance to alleviate scarring.Among them,the mechanical modeling is important for the optimization of the structure and the implantation process.In this review,the theoretical calculation of the flexible neural probes is firstly summarized with the processes of buckling,insertion,and relative interaction with soft brain tissue for flexible probes from outside to inside.Then,the corresponding mechanical simulation methods are organized considering multiple impact factors to realize minimally invasive implantation.Finally,the technical difficulties and future trends of mechanical modeling are discussed for the next-generation flexible neural probes,which is critical to realize low-invasiveness and long-term coexistence in vivo.
基金supported by the National Key R&D Program of China(Grant No.2022YFB3303500).
文摘The present study proposes a sub-grid scale model for the one-dimensional Burgers turbulence based on the neuralnetwork and deep learning method.The filtered data of the direct numerical simulation is used to establish thetraining data set,the validation data set,and the test data set.The artificial neural network(ANN)methodand Back Propagation method are employed to train parameters in the ANN.The developed ANN is applied toconstruct the sub-grid scale model for the large eddy simulation of the Burgers turbulence in the one-dimensionalspace.The proposed model well predicts the time correlation and the space correlation of the Burgers turbulence.
基金Researchers Supporting Project Number(RSPD2024R 553),King Saud University,Riyadh,Saudi Arabia.
文摘Wheat is a critical crop,extensively consumed worldwide,and its production enhancement is essential to meet escalating demand.The presence of diseases like stem rust,leaf rust,yellow rust,and tan spot significantly diminishes wheat yield,making the early and precise identification of these diseases vital for effective disease management.With advancements in deep learning algorithms,researchers have proposed many methods for the automated detection of disease pathogens;however,accurately detectingmultiple disease pathogens simultaneously remains a challenge.This challenge arises due to the scarcity of RGB images for multiple diseases,class imbalance in existing public datasets,and the difficulty in extracting features that discriminate between multiple classes of disease pathogens.In this research,a novel method is proposed based on Transfer Generative Adversarial Networks for augmenting existing data,thereby overcoming the problems of class imbalance and data scarcity.This study proposes a customized architecture of Vision Transformers(ViT),where the feature vector is obtained by concatenating features extracted from the custom ViT and Graph Neural Networks.This paper also proposes a Model AgnosticMeta Learning(MAML)based ensemble classifier for accurate classification.The proposedmodel,validated on public datasets for wheat disease pathogen classification,achieved a test accuracy of 99.20%and an F1-score of 97.95%.Compared with existing state-of-the-art methods,this proposed model outperforms in terms of accuracy,F1-score,and the number of disease pathogens detection.In future,more diseases can be included for detection along with some other modalities like pests and weed.
基金supported in part by the National Natural Science Foundation of China (No. 12202363)。
文摘Modeling of unsteady aerodynamic loads at high angles of attack using a small amount of experimental or simulation data to construct predictive models for unknown states can greatly improve the efficiency of aircraft unsteady aerodynamic design and flight dynamics analysis.In this paper,aiming at the problems of poor generalization of traditional aerodynamic models and intelligent models,an intelligent aerodynamic modeling method based on gated neural units is proposed.The time memory characteristics of the gated neural unit is fully utilized,thus the nonlinear flow field characterization ability of the learning and training process is enhanced,and the generalization ability of the whole prediction model is improved.The prediction and verification of the model are carried out under the maneuvering flight condition of NACA0015 airfoil.The results show that the model has good adaptability.In the interpolation prediction,the maximum prediction error of the lift and drag coefficients and the moment coefficient does not exceed 10%,which can basically represent the variation characteristics of the entire flow field.In the construction of extrapolation models,the training model based on the strong nonlinear data has good accuracy for weak nonlinear prediction.Furthermore,the error is larger,even exceeding 20%,which indicates that the extrapolation and generalization capabilities need to be further optimized by integrating physical models.Compared with the conventional state space equation model,the proposed method can improve the extrapolation accuracy and efficiency by 78%and 60%,respectively,which demonstrates the applied potential of this method in aerodynamic modeling.
基金supported by Science and Technology Project funding from China Southern Power Grid Corporation No.GDKJXM20230245(031700KC23020003).
文摘Blades are essential components of wind turbines.Reducing their fatigue loads during operation helps to extend their lifespan,but it is difficult to quickly and accurately calculate the fatigue loads of blades.To solve this problem,this paper innovatively designs a data-driven blade load modeling method based on a deep learning framework through mechanism analysis,feature selection,and model construction.In the mechanism analysis part,the generation mechanism of blade loads and the load theoretical calculationmethod based on material damage theory are analyzed,and four measurable operating state parameters related to blade loads are screened;in the feature extraction part,15 characteristic indicators of each screened parameter are extracted in the time and frequency domain,and feature selection is completed through correlation analysis with blade loads to determine the input parameters of data-driven modeling;in the model construction part,a deep neural network based on feedforward and feedback propagation is designed to construct the nonlinear coupling relationship between the unit operating parameter characteristics and blade loads.The results show that the proposed method mines the wind turbine operating state characteristics highly correlated with the blade load,such as the standard deviation of wind speed.The model built using these characteristics has reasonable calculation and fitting capabilities for the blade load and shows a better fitting level for untrained out-of-sample data than the traditional scheme.Based on the mean absolute percentage error calculation,the modeling accuracy of the two blade loads can reach more than 90%and 80%,respectively,providing a good foundation for the subsequent optimization control to suppress the blade load.
文摘Deep learning, especially through convolutional neural networks (CNN) such as the U-Net 3D model, has revolutionized fault identification from seismic data, representing a significant leap over traditional methods. Our review traces the evolution of CNN, emphasizing the adaptation and capabilities of the U-Net 3D model in automating seismic fault delineation with unprecedented accuracy. We find: 1) The transition from basic neural networks to sophisticated CNN has enabled remarkable advancements in image recognition, which are directly applicable to analyzing seismic data. The U-Net 3D model, with its innovative architecture, exemplifies this progress by providing a method for detailed and accurate fault detection with reduced manual interpretation bias. 2) The U-Net 3D model has demonstrated its superiority over traditional fault identification methods in several key areas: it has enhanced interpretation accuracy, increased operational efficiency, and reduced the subjectivity of manual methods. 3) Despite these achievements, challenges such as the need for effective data preprocessing, acquisition of high-quality annotated datasets, and achieving model generalization across different geological conditions remain. Future research should therefore focus on developing more complex network architectures and innovative training strategies to refine fault identification performance further. Our findings confirm the transformative potential of deep learning, particularly CNN like the U-Net 3D model, in geosciences, advocating for its broader integration to revolutionize geological exploration and seismic analysis.
文摘This study proposes a novel approach for estimating automobile insurance loss reserves utilizing Artificial Neural Network (ANN) techniques integrated with actuarial data intelligence. The model aims to address the challenges of accurately predicting insurance claim frequencies, severities, and overall loss reserves while accounting for inflation adjustments. Through comprehensive data analysis and model development, this research explores the effectiveness of ANN methodologies in capturing complex nonlinear relationships within insurance data. The study leverages a data set comprising automobile insurance policyholder information, claim history, and economic indicators to train and validate the ANN-based reserving model. Key aspects of the methodology include data preprocessing techniques such as one-hot encoding and scaling, followed by the construction of frequency, severity, and overall loss reserving models using ANN architectures. Moreover, the model incorporates inflation adjustment factors to ensure the accurate estimation of future loss reserves in real terms. Results from the study demonstrate the superior predictive performance of the ANN-based reserving model compared to traditional actuarial methods, with substantial improvements in accuracy and robustness. Furthermore, the model’s ability to adapt to changing market conditions and regulatory requirements, such as IFRS17, highlights its practical relevance in the insurance industry. The findings of this research contribute to the advancement of actuarial science and provide valuable insights for insurance companies seeking more accurate and efficient loss reserving techniques. The proposed ANN-based approach offers a promising avenue for enhancing risk management practices and optimizing financial decision-making processes in the automobile insurance sector.
基金the University of Teknologi PETRONAS(UTP),Malaysia,and Ahmadu Bello University,Nigeria,for their vital help and availability of laboratory facilities that allowed this work to be conducted successfully.
文摘The goals of this study are to assess the viability of waste tire-derived char(WTDC)as a sustainable,low-cost fine aggregate surrogate material for asphalt mixtures and to develop the statistically coupled neural network(SCNN)model for predicting volumetric and Marshall properties of asphalt mixtures modified with WTDC.The study is based on experimental data acquired from laboratory volumetric and Marshall properties testing on WTDCmodified asphalt mixtures(WTDC-MAM).The input variables comprised waste tire char content and asphalt binder content.The output variables comprised mixture unit weight,total voids,voids filled with asphalt,Marshall stability,and flow.Statistical coupled neural networks were utilized to predict the volumetric and Marshall properties of asphalt mixtures.For predictive modeling,the SCNN model is employed,incorporating a three-layer neural network and preprocessing techniques to enhance accuracy and reliability.The optimal network architecture,using the collected dataset,was a 2:6:5 structure,and the neural network was trained with 60%of the data,whereas the other 20%was used for cross-validation and testing respectively.The network employed a hyperbolic tangent(tanh)activation function and a feed-forward backpropagation.According to the results,the network model could accurately predict the volumetric and Marshall properties.The predicted accuracy of SCNN was found to be as high value>98%and low prediction errors for both volumetric and Marshall properties.This study demonstrates WTDC's potential as a low-cost,sustainable aggregate replacement.The SCNN-based predictive model proves its efficiency and versatility and promotes sustainable practices.
基金supported by the Fundamental Research Funds for the Central Universities (No.3122020072)the Multi-investment Project of Tianjin Applied Basic Research(No.23JCQNJC00250)。
文摘A hybrid identification model based on multilayer artificial neural networks(ANNs) and particle swarm optimization(PSO) algorithm is developed to improve the simultaneous identification efficiency of thermal conductivity and effective absorption coefficient of semitransparent materials.For the direct model,the spherical harmonic method and the finite volume method are used to solve the coupled conduction-radiation heat transfer problem in an absorbing,emitting,and non-scattering 2D axisymmetric gray medium in the background of laser flash method.For the identification part,firstly,the temperature field and the incident radiation field in different positions are chosen as observables.Then,a traditional identification model based on PSO algorithm is established.Finally,multilayer ANNs are built to fit and replace the direct model in the traditional identification model to speed up the identification process.The results show that compared with the traditional identification model,the time cost of the hybrid identification model is reduced by about 1 000 times.Besides,the hybrid identification model remains a high level of accuracy even with measurement errors.
文摘This study explores the factors influencing metro passengers’ arrival volume in Wuhan, China, and Lagos, Nigeria, by examining weather, time of day, waiting time, travel behavior, arrival patterns, and metro satisfaction. It addresses a significant research gap in understanding metro passengers’ dynamics across cultural and geographical contexts. It employs questionnaires, field observations, and advanced data analysis techniques like association rule mining and neural network modeling. Key findings include a correlation between rainy weather, shorter waiting times, and higher arrival volumes. Neural network models showed high predictive accuracy, with waiting time, metro satisfaction, and weather being significant factors in Lagos Light Rail Blue Line Metro. In contrast, arrival patterns, weather, and time of day were more influential in Wuhan Metro Line 5. Results suggest that improving metro satisfaction and reducing waiting times could increase arrival volumes in Lagos Metro while adjusting schedules for weather and peak times could optimize flow in Wuhan Metro. These insights are valuable for transportation planning, passenger arrival volume management, and enhancing user experiences, potentially benefiting urban transportation sustainability and development goals.
文摘This study introduces an innovative“Big Model”strategy to enhance Bridge Structural Health Monitoring(SHM)using a Convolutional Neural Network(CNN),time-frequency analysis,and fine element analysis.Leveraging ensemble methods,collaborative learning,and distributed computing,the approach effectively manages the complexity and scale of large-scale bridge data.The CNN employs transfer learning,fine-tuning,and continuous monitoring to optimize models for adaptive and accurate structural health assessments,focusing on extracting meaningful features through time-frequency analysis.By integrating Finite Element Analysis,time-frequency analysis,and CNNs,the strategy provides a comprehensive understanding of bridge health.Utilizing diverse sensor data,sophisticated feature extraction,and advanced CNN architecture,the model is optimized through rigorous preprocessing and hyperparameter tuning.This approach significantly enhances the ability to make accurate predictions,monitor structural health,and support proactive maintenance practices,thereby ensuring the safety and longevity of critical infrastructure.
基金the financial supports provided by the National Natural Science Foundation of China(Nos.91852112 and 11988102)。
文摘A mapping function between the Reynolds-averaged Navier-Stokes mean flow variables and transition intermittency factor is constructed by fully connected artificial neural network(ANN),which replaces the governing equation of the intermittency factor in transition-predictive Spalart-Allmaras(SA)-γmodel.By taking SA-γmodel as the benchmark,the present ANN model is trained at two airfoils with various angles of attack,Mach numbers and Reynolds numbers,and tested with unseen airfoils in different flow states.The a posteriori tests manifest that the mean pressure coefficient,skin friction coefficient,size of laminar separation bubble,mean streamwise velocity,Reynolds shear stress and lift/drag/moment coefficient from the present two-way coupling ANN model almost coincide with those from the benchmark SA-γmodel.Furthermore,the ANN model proves to exhibit a higher calculation efficiency and better convergence quality than traditional SA-γmodel.
基金Financial support provided by the National Natural Science Foundation of China(Grant Nos.11702042 and 91952104)。
文摘Fully connected neural networks(FCNNs)have been developed for the closure of subgrid-scale(SGS)stress and SGS heat flux in large-eddy simulations of compressible turbulent channel flow.The FCNNbased SGS model trained using data with Mach number Ma=3.0 and Reynolds number Re=3000 was applied to situations with different Mach numbers and Reynolds numbers.The input variables of the neural network model were the filtered velocity gradients and temperature gradients at a single spatial grid point.The a priori test showed that the FCNN model had a correlation coefficient larger than 0.91 and a relative error smaller than 0.43,with much better reconstructions of SGS unclosed terms than the dynamic Smagorinsky model(DSM).In a posteriori test,the behavior of the FCNN model was marginally better than that of the DSM in predicting the mean velocity profiles,mean temperature profiles,turbulent intensities,total Reynolds stress,total Reynolds heat flux,and mean SGS flux of kinetic energy,and outperformed the Smagorinsky model.
基金National Research Council of Thailand(NRCT)and Khon Kaen University:N42A650291.
文摘This works intends to provide numerical solutions based on the nonlinear fractional order derivatives of the classical White and Comiskey model(NFD-WCM).The fractional order derivatives have provided authentic and accurate solutions for the NDF-WCM.The solutions of the fractional NFD-WCM are provided using the stochastic computing supervised algorithm named Levenberg-Marquard Backpropagation(LMB)based on neural networks(NNs).This regression approach combines gradient descent and Gauss-Newton iterative methods,which means finding a solution through the sequences of different calculations.WCM is used to demonstrate the heroin epidemics.Heroin has been on-growth world wide,mainly in Asia,Europe,and the USA.It is the fourth foremost cause of death due to taking an overdose in the USA.The nonlinear mathematical system NFD-WCM discusses the overall circumstance of different drug users,such as suspected groups,drug users without treatment,and drug users with treatment.The numerical results of NFD-WCM via LMB-NNs have been substantiated through training,testing,and validation measures.The stability and accuracy are then checked through the statistical tool,such asmean square error(MSE),error histogram,and fitness curves.The suggested methodology’s strength is demonstrated by the high convergence between the reference solutions and the solutions generated by adding the efficacy of a constructed solver LMB-NNs,with accuracy levels ranging from 10?9 to 10?10.
基金Science and Technology Funds from the Liaoning Education Department(Serial Number:LJKZ0104).
文摘The motivation for this study is that the quality of deep fakes is constantly improving,which leads to the need to develop new methods for their detection.The proposed Customized Convolutional Neural Network method involves extracting structured data from video frames using facial landmark detection,which is then used as input to the CNN.The customized Convolutional Neural Network method is the date augmented-based CNN model to generate‘fake data’or‘fake images’.This study was carried out using Python and its libraries.We used 242 films from the dataset gathered by the Deep Fake Detection Challenge,of which 199 were made up and the remaining 53 were real.Ten seconds were allotted for each video.There were 318 videos used in all,199 of which were fake and 119 of which were real.Our proposedmethod achieved a testing accuracy of 91.47%,loss of 0.342,and AUC score of 0.92,outperforming two alternative approaches,CNN and MLP-CNN.Furthermore,our method succeeded in greater accuracy than contemporary models such as XceptionNet,Meso-4,EfficientNet-BO,MesoInception-4,VGG-16,and DST-Net.The novelty of this investigation is the development of a new Convolutional Neural Network(CNN)learning model that can accurately detect deep fake face photos.
基金supported by the National Natural Science Foundation of China(Grant Nos.62272120,62106030,U20B2046,62272119,61972105)the Technology Innovation and Application Development Projects of Chongqing(Grant Nos.cstc2021jscx-gksbX0032,cstc2021jscxgksbX0029).
文摘In recent years,with the great success of pre-trained language models,the pre-trained BERT model has been gradually applied to the field of source code understanding.However,the time cost of training a language model from zero is very high,and how to transfer the pre-trained language model to the field of smart contract vulnerability detection is a hot research direction at present.In this paper,we propose a hybrid model to detect common vulnerabilities in smart contracts based on a lightweight pre-trained languagemodel BERT and connected to a bidirectional gate recurrent unitmodel.The downstream neural network adopts the bidirectional gate recurrent unit neural network model with a hierarchical attention mechanism to mine more semantic features contained in the source code of smart contracts by using their characteristics.Our experiments show that our proposed hybrid neural network model SolBERT-BiGRU-Attention is fitted by a large number of data samples with smart contract vulnerabilities,and it is found that compared with the existing methods,the accuracy of our model can reach 93.85%,and the Micro-F1 Score is 94.02%.
基金funded by National Research Council of Thailand(NRCT)and Khon Kaen University:N42A650291.
文摘The motive of these investigations is to provide the importance and significance of the fractional order(FO)derivatives in the nonlinear environmental and economic(NEE)model,i.e.,FO-NEE model.The dynamics of the NEE model achieves more precise by using the form of the FO derivative.The investigations through the non-integer and nonlinear mathematical form to define the FO-NEE model are also provided in this study.The composition of the FO-NEEmodel is classified into three classes,execution cost of control,system competence of industrial elements and a new diagnostics technical exclusion cost.The mathematical FO-NEE system is numerically studied by using the artificial neural networks(ANNs)along with the Levenberg-Marquardt backpropagation method(ANNs-LMBM).Three different cases using the FO derivative have been examined to present the numerical performances of the FO-NEE model.The data is selected to solve the mathematical FO-NEE system is executed as 70%for training and 15%for both testing and certification.The exactness of the proposed ANNs-LMBM is observed through the comparison of the obtained and the Adams-Bashforth-Moulton database results.To ratify the aptitude,validity,constancy,exactness,and competence of the ANNs-LMBM,the numerical replications using the state transitions,regression,correlation,error histograms and mean square error are also described.
基金supported by the National Natural Science Foundation of China(Nos.62001023,61922013)Beijing Natural Science Foundation(No.4232013).
文摘To obtain excellent regression results under the condition of small sample hyperspectral data,a deep neural network with simulated annealing(SA-DNN)is proposed.According to the characteristics of data,the attention mechanism was applied to make the network pay more attention to effective features,thereby improving the operating efficiency.By introducing an improved activation function,the data correlation was reduced based on increasing the operation rate,and the problem of over-fitting was alleviated.By introducing simulated annealing,the network chose the optimal learning rate by itself,which avoided falling into the local optimum to the greatest extent.To evaluate the performance of the SA-DNN,the coefficient of determination(R^(2)),root mean square error(RMSE),and other metrics were used to evaluate the model.The results show that the performance of the SA-DNN is significantly better than other traditional methods.