The layered pavements usually exhibit complicated mechanical properties with the effect of complex material properties under external environment.In some cases,such as launching missiles or rockets,layered pavements a...The layered pavements usually exhibit complicated mechanical properties with the effect of complex material properties under external environment.In some cases,such as launching missiles or rockets,layered pavements are required to bear large impulse load.However,traditional methods cannot non-destructively and quickly detect the internal structural of pavements.Thus,accurate and fast prediction of the mechanical properties of layered pavements is of great importance and necessity.In recent years,machine learning has shown great superiority in solving nonlinear problems.In this work,we present a method of predicting the maximum deflection and damage factor of layered pavements under instantaneous large impact based on random forest regression with the deflection basin parameters obtained from falling weight deflection testing.The regression coefficient R^(2)of testing datasets are above 0.94 in the process of predicting the elastic moduli of structural layers and mechanical responses,which indicates that the prediction results have great consistency with finite element simulation results.This paper provides a novel method for fast and accurate prediction of pavement mechanical responses under instantaneous large impact load using partial structural parameters of pavements,and has application potential in non-destructive evaluation of pavement structure.展开更多
Remaining useful life(RUL) prediction is one of the most crucial elements in prognostics and health management(PHM). Aiming at the imperfect prior information, this paper proposes an RUL prediction method based on a n...Remaining useful life(RUL) prediction is one of the most crucial elements in prognostics and health management(PHM). Aiming at the imperfect prior information, this paper proposes an RUL prediction method based on a nonlinear random coefficient regression(RCR) model with fusing failure time data.Firstly, some interesting natures of parameters estimation based on the nonlinear RCR model are given. Based on these natures,the failure time data can be fused as the prior information reasonably. Specifically, the fixed parameters are calculated by the field degradation data of the evaluated equipment and the prior information of random coefficient is estimated with fusing the failure time data of congeneric equipment. Then, the prior information of the random coefficient is updated online under the Bayesian framework, the probability density function(PDF) of the RUL with considering the limitation of the failure threshold is performed. Finally, two case studies are used for experimental verification. Compared with the traditional Bayesian method, the proposed method can effectively reduce the influence of imperfect prior information and improve the accuracy of RUL prediction.展开更多
Given the challenge of estimating or calculating quantities of waste electrical and electronic equipment(WEEE)in developing countries,this article focuses on predicting the WEEE generated by Cameroonian small and medi...Given the challenge of estimating or calculating quantities of waste electrical and electronic equipment(WEEE)in developing countries,this article focuses on predicting the WEEE generated by Cameroonian small and medium enterprises(SMEs)that are engaged in ISO 14001:2015 initiatives and consume electrical and electronic equipment(EEE)to enhance their performance and profitability.The methodology employed an exploratory approach involving the application of general equilibrium theory(GET)to contextualize the study and generate relevant parameters for deploying the random forest regression learning algorithm for predictions.Machine learning was applied to 80%of the samples for training,while simulation was conducted on the remaining 20%of samples based on quantities of EEE utilized over a specific period,utilization rates,repair rates,and average lifespans.The results demonstrate that the model’s predicted values are significantly close to the actual quantities of generated WEEE,and the model’s performance was evaluated using the mean squared error(MSE)and yielding satisfactory results.Based on this model,both companies and stakeholders can set realistic objectives for managing companies’WEEE,fostering sustainable socio-environmental practices.展开更多
Given the rapid urbanization worldwide, Urban Heat Island(UHI) effect has been a severe issue limiting urban sustainability in both large and small cities. In order to study the spatial pattern of Surface urban heat i...Given the rapid urbanization worldwide, Urban Heat Island(UHI) effect has been a severe issue limiting urban sustainability in both large and small cities. In order to study the spatial pattern of Surface urban heat island(SUHI) in China’s Meihekou City, a combination method of Monte Carlo and Random Forest Regression(MC-RFR) is developed to construct the relationship between landscape pattern indices and Land Surface Temperature(LST). In this method, Monte Carlo acceptance-rejection sampling was added to the bootstrap layer of RFR to ensure the sensitivity of RFR to outliners of SUHI effect. The SHUI in 2030 was predicted by using this MC-RFR and the modeled future landscape pattern by Cellular Automata and Markov combination model(CA-Markov). Results reveal that forestland can greatly alleviate the impact of SUHI effect, while reasonable construction of urban land can also slow down the rising trend of SUHI. MC-RFR performs better for characterizing the relationship between landscape pattern and LST than single RFR or Linear Regression model. By 2030, the overall SUHI effect of Meihekou will be greatly enhanced, and the center of urban development will gradually shift to the central and western regions of the city. We suggest that urban designer and managers should concentrate vegetation and disperse built-up land to weaken the SUHI in the construction of new urban areas for its sustainability.展开更多
Due to the coarse scale of soil moisture products retrieved from passive microwave observations(SMPMW),several downscaling methods have been developed to enable regional scale applications.However,it can be challengin...Due to the coarse scale of soil moisture products retrieved from passive microwave observations(SMPMW),several downscaling methods have been developed to enable regional scale applications.However,it can be challenging for users to access final data products and algorithms,as well as managing different data sources and formats,various data processing methods,and the complexity of the workflows from raw data to information products.Here,the Google Earth Engine(GEE),which as of late offers SMPMW,is used to implement a workflow for retrieving 1 km SM at a depth of 0-5 cm using MODIS optical/thermal measurements,the SM_(PMW)coarse scale product,and a random forest regression.The proposed method was implemented on the African continent to estimate weekly SM maps.The results of this study were evaluated against in-situ measurements of three validation networks.Overall,in comparison to the original SM_(PMW)product,which was limited by a spatial resolution of only 9 km,this method is able to estimate SM at 1 km spatial resolution with acceptable accuracy(an average correlation coefficient of 0.64 and a ubRMSD of 0.069 m^(3)/m^(3)).The results show that the proposed method in GEE provides a precise estimation of SM with a higher spatial resolution across the entire continent.展开更多
Despite exploration and production success in Niger Delta,several failed wells have been encountered due to overpressures.Hence,it is very essential to understand the spatial distribution of pore pressure and the gene...Despite exploration and production success in Niger Delta,several failed wells have been encountered due to overpressures.Hence,it is very essential to understand the spatial distribution of pore pressure and the generating mechanisms in order to mitigate the pitfalls that might arise during drilling.This research provides estimates of pore pressure along three offshore wells using the Eaton's transit time method,multi-layer perceptron artificial neural network(MLP-ANN)and random forest regression(RFR)algorithms.Our results show that there are three pressure magnitude regimes:normal pressure zone(hydrostatic pressure),transition pressure zone(slightly above hydrostatic pressure),and over pressured zone(significantly above hydrostatic pressure).The top of the geopressured zone(2873 mbRT or 9425.853 ft)averagely marks the onset of overpressurization with the excess pore pressure above hydrostatic pressure(P∗)varying averagely along the three wells between 1.06−24.75 MPa.The results from the three methods are self-consistent with strong correlation between the Eaton's method and the two machine learning models.The models have high accuracy of about>97%,low mean absolute percentage error(MAPE<3%)and coefficient of determination(R2>0.98).Our results have also shown that the principal generating mechanisms responsible for high pore pressure in the offshore Niger Delta are disequilibrium compaction,unloading(fluid expansion)and shale diagenesis.展开更多
The aviation industry has seen significant advancements in safety procedures over the past few decades, resulting in a steady decline in aviation deaths worldwide. However, the safety standards in General Aviation (GA...The aviation industry has seen significant advancements in safety procedures over the past few decades, resulting in a steady decline in aviation deaths worldwide. However, the safety standards in General Aviation (GA) are still lower compared to those in commercial aviation. With the anticipated growth in air travel, there is an imminent need to improve operational safety in GA. One way to improve aircraft and operational safety is through trajectory prediction. Trajectory prediction plays a key role in optimizing air traffic control and improving overall flight safety. This paper proposes a meta-learning approach to predict short- to mid-term trajectories of aircraft using historical real flight data collected from multiple GA aircraft. The proposed solution brings together multiple models to improve prediction accuracy. In this paper, we are combining two models, Random Forest Regression (RFR) and Long Short-term Memory (LSTM), using k-Nearest Neighbors (k-NN), to output the final prediction based on the combined output of the individual models. This approach gives our model an edge over single-model predictions. We present the results of our meta-learner and evaluate its performance against individual models using the Mean Absolute Error (MAE), Absolute Altitude Error (AAE), and Root Mean Squared Error (RMSE) evaluation metrics. The proposed methodology for aircraft trajectory forecasting is discussed in detail, accompanied by a literature review and an overview of the data preprocessing techniques used. The results demonstrate that the proposed meta-learner outperforms individual models in terms of accuracy, providing a more robust and proactive approach to improve operational safety in GA.展开更多
Covariance functions have been proposed as an alternative to model longitudinal data in animal breeding because of their various merits in comparison to the classical analytical methods.In practical estimation,differe...Covariance functions have been proposed as an alternative to model longitudinal data in animal breeding because of their various merits in comparison to the classical analytical methods.In practical estimation,different models and polynomial orders fitted can influence the estimates of covariance functions and thus genetic parameters.The objective of this study was to select model for estimation of covariance functions for body weights of Angora goats at 7 time points.Covariance functions were estimated by fitting 6 random regression models with birth year,birth month,sex,age of dam,birth type,and relative birth date as fixed effects.Random effects involved were direct and maternal additive genetic,and animal and maternal permanent environmental effects with different orders of fit.Selection of model and orders of fit were carried out by likelihood ratio test and 4 types of information criteria.The results showed that model with 6 orders of polynomial fit for direct additive genetic and animal permanent environmental effects and 4 and 5 orders for maternal genetic and permanent environmental effects,respectively,were preferable for estimation of covariance functions.Models with and without maternal effects influenced the estimates of covariance functions greatly.Maternal permanent environmental effect does not explain the variation of all permanent environments,well suggesting different sources of permanent environmental effects also has large influence on covariance function estimates.展开更多
This paper adopts the NGI-ADP soil model to carry out finite element analysis,based on which the effects of soft clay anisotropy on the diaphragm wall deflections in the braced excavation were evaluated.More than one ...This paper adopts the NGI-ADP soil model to carry out finite element analysis,based on which the effects of soft clay anisotropy on the diaphragm wall deflections in the braced excavation were evaluated.More than one thousand finite element cases were numerically analyzed,followed by extensive parametric studies.Surrogate models were developed via ensemble learning methods(ELMs),including the e Xtreme Gradient Boosting(XGBoost),and Random Forest Regression(RFR)to predict the maximum lateral wall deformation(δhmax).Then the results of ELMs were compared with conventional soft computing methods such as Decision Tree Regression(DTR),Multilayer Perceptron Regression(MLPR),and Multivariate Adaptive Regression Splines(MARS).This study presents a cutting-edge application of ensemble learning in geotechnical engineering and a reasonable methodology that allows engineers to determine the wall deflection in a fast,alternative way.展开更多
Estimating spatial variation in crop transpiration coefficients(CTc) and aboveground biomass(AGB)rapidly and accurately by remote sensing can facilitate precision irrigation management in semiarid regions. This study ...Estimating spatial variation in crop transpiration coefficients(CTc) and aboveground biomass(AGB)rapidly and accurately by remote sensing can facilitate precision irrigation management in semiarid regions. This study developed and assessed a novel machine learning(ML) method for estimating CTc and AGB using time-series unmanned aerial vehicle(UAV)-based multispectral vegetation indices(VIs)of maize under several irrigation treatments at the field scale. Four ML regression methods: multiple linear regression(MLR), support vector regression(SVR), random forest regression(RFR), and adaptive boosting regression(ABR), were used to address the complex relationship between CTcand VIs. AGB was then estimated using exponential, logistic, sigmoid, and linear equations because of their clear mathematical formulations based on the optimal CTcestimation model. The UAV VIs-derived CTcusing the RFR estimation model yielded the highest accuracy(R^(2)= 0.91, RMSE = 0.0526, and n RMSE = 9.07%). The normalized difference red-edge index, transformed chlorophyll absorption in reflectance index, and simple ratio contributed significantly to the RFR-based CTcmodel. The accuracy of AGB estimation using nonlinear methods was higher than that using the linear method. The exponential method yielded the highest accuracy(R^(2)= 0.76, RMSE = 282.8 g m, and n RMSE = 39.24%) in both the 2018 and 2019 growing seasons. The study confirms that AGB estimation models based on cumulative CTcperformed well under several irrigation treatments using high-resolution time-series UAV multispectral VIs and can support irrigation management with high spatial precision at a field scale.展开更多
Objective:To study the application of a machine learning algorithm for predicting gestational diabetes mellitus(GDM)in early pregnancy.Methods:This study identified indicators related to GDM through a literature revie...Objective:To study the application of a machine learning algorithm for predicting gestational diabetes mellitus(GDM)in early pregnancy.Methods:This study identified indicators related to GDM through a literature review and expert discussion.Pregnant women who had attended medical institutions for an antenatal examination from November 2017 to August 2018 were selected for analysis,and the collected indicators were retrospectively analyzed.Based on Python,the indicators were classified and modeled using a random forest regression algorithm,and the performance of the prediction model was analyzed.Results:We obtained 4806 analyzable data from 1625 pregnant women.Among these,3265 samples with all 67 indicators were used to establish data set F1;4806 samples with 38 identical indicators were used to establish data set F2.Each of F1 and F2 was used for training the random forest algorithm.The overall predictive accuracy of the F1 model was 93.10%,area under the receiver operating characteristic curve(AUC)was 0.66,and the predictive accuracy of GDM-positive cases was 37.10%.The corresponding values for the F2 model were 88.70%,0.87,and 79.44%.The results thus showed that the F2 prediction model performed better than the F1 model.To explore the impact of sacrificial indicators on GDM prediction,the F3 data set was established using 3265 samples(F1)with 38 indicators(F2).After training,the overall predictive accuracy of the F3 model was 91.60%,AUC was 0.58,and the predictive accuracy of positive cases was 15.85%.Conclusions:In this study,a model for predicting GDM with several input variables(e.g.,physical examination,past history,personal history,family history,and laboratory indicators)was established using a random forest regression algorithm.The trained prediction model exhibited a good performance and is valuable as a reference for predicting GDM in women at an early stage of pregnancy.In addition,there are cer tain requirements for the propor tions of negative and positive cases in sample data sets when the random forest algorithm is applied to the early prediction of GDM.展开更多
To study riding safety at intersection entrance,video recognition technology is used to build vehicle-bicycle conflict models based on the Bayesian method.It is analyzed the relationship among the width of nonmotorize...To study riding safety at intersection entrance,video recognition technology is used to build vehicle-bicycle conflict models based on the Bayesian method.It is analyzed the relationship among the width of nonmotorized lanes at the entrance lane of the intersection,the vehicle-bicycle soft isolation form of the entrance lane of intersection,the traffic volume of right-turning motor vehicles and straight-going non-motor vehicles,the speed of right-turning motor vehicles,and straight-going non-motor vehicles,and the conflict between right-turning motor vehicles and straight-going nonmotor vehicles.Due to the traditional statistical methods,to overcome the discreteness of vehicle-bicycle conflict data and the differences of influencing factors,the Bayesian random effect Poisson-log-normal model and random effect negative binomial regression model are established.The results show that the random effect Poisson-log-normal model is better than the negative binomial distribution of random effects;The width of non-motorized lanes,the form of vehicle-bicycle soft isolation,the traffic volume of right-turning motor vehicles,and the coefficients of straight traffic volume obey a normal distribution.Among them,the type of vehicle-bicycle soft isolation facilities and the vehicle-bicycle traffic volumes are significantly positively correlated with the number of vehicle-bicycle conflicts.The width of non-motorized lanes is significantly negatively correlated with the number of vehicle-bicycle conflicts.Peak periods and flat periods,the average speed of right-turning motor vehicles,and the average speed of straight-going non-motor vehicles have no significant influence on the number of vehicle-bicycle conflicts.展开更多
Artificial intelligence(AI)and machine learning(ML)help in making predictions and businesses to make key decisions that are beneficial for them.In the case of the online shopping business,it’s very important to find ...Artificial intelligence(AI)and machine learning(ML)help in making predictions and businesses to make key decisions that are beneficial for them.In the case of the online shopping business,it’s very important to find trends in the data and get knowledge of features that helps drive the success of the business.In this research,a dataset of 12,330 records of customers has been analyzedwho visited an online shoppingwebsite over a period of one year.The main objective of this research is to find features that are relevant in terms of correctly predicting the purchasing decisions made by visiting customers and build ML models which could make correct predictions on unseen data in the future.The permutation feature importance approach has been used to get the importance of features according to the output variable(Revenue).Five ML models i.e.,decision tree(DT),random forest(RF),extra tree(ET)classifier,Neural networks(NN),and Logistic regression(LR)have been used to make predictions on the unseen data in the future.The performance of each model has been discussed in detail using performance measurement techniques such as accuracy score,precision,recall,F1 score,and ROC-AUC curve.RF model is the bestmodel among all five chosen based on accuracy score of 90%and F1 score of 79%followed by extra tree classifier.Hence,our study indicates that RF model can be used by online retailing businesses for predicting consumer buying behaviour.Our research also reveals the importance of page value as a key feature for capturing online purchasing trends.This may give a clue to future businesses who can focus on this specific feature and can find key factors behind page value success which in turn will help the online shopping business.展开更多
Due to unforeseen climate change,complicated chronic diseases,and mutation of viruses’hospital administration’s top challenge is to know about the Length of stay(LOS)of different diseased patients in the hospitals.H...Due to unforeseen climate change,complicated chronic diseases,and mutation of viruses’hospital administration’s top challenge is to know about the Length of stay(LOS)of different diseased patients in the hospitals.Hospital management does not exactly know when the existing patient leaves the hospital;this information could be crucial for hospital management.It could allow them to take more patients for admission.As a result,hospitals face many problems managing available resources and new patients in getting entries for their prompt treatment.Therefore,a robust model needs to be designed to help hospital administration predict patients’LOS to resolve these issues.For this purpose,a very large-sized data(more than 2.3 million patients’data)related to New-York Hospitals patients and containing information about a wide range of diseases including Bone-Marrow,Tuberculosis,Intestinal Transplant,Mental illness,Leukaemia,Spinal cord injury,Trauma,Rehabilitation,Kidney and Alcoholic Patients,HIV Patients,Malignant Breast disorder,Asthma,Respiratory distress syndrome,etc.have been analyzed to predict the LOS.We selected six Machine learning(ML)models named:Multiple linear regression(MLR),Lasso regression(LR),Ridge regression(RR),Decision tree regression(DTR),Extreme gradient boosting regression(XGBR),and Random Forest regression(RFR).The selected models’predictive performance was checked using R square andMean square error(MSE)as the performance evaluation criteria.Our results revealed the superior predictive performance of the RFRmodel,both in terms of RS score(92%)and MSE score(5),among all selected models.By Exploratory data analysis(EDA),we conclude that maximumstay was between 0 to 5 days with the meantime of each patient 5.3 days and more than 50 years old patients spent more days in the hospital.Based on the average LOS,results revealed that the patients with diagnoses related to birth complications spent more days in the hospital than other diseases.This finding could help predict the future length of hospital stay of new patients,which will help the hospital administration estimate and manage their resources efficiently.展开更多
Major fields such as military applications,medical fields,weather forecasting,and environmental applications use wireless sensor networks for major computing processes.Sensors play a vital role in emerging technologie...Major fields such as military applications,medical fields,weather forecasting,and environmental applications use wireless sensor networks for major computing processes.Sensors play a vital role in emerging technologies of the 20th century.Localization of sensors in needed locations is a very serious problem.The environment is home to every living being in the world.The growth of industries after the industrial revolution increased pollution across the environment.Owing to recent uncontrolled growth and development,sensors to measure pollution levels across industries and surroundings are needed.An interesting and challenging task is choosing the place to fit the sensors.Many meta-heuristic techniques have been introduced in node localization.Swarm intelligent algorithms have proven their efficiency in many studies on localization problems.In this article,we introduce an industrial-centric approach to solve the problem of node localization in the sensor network.First,our work aims at selecting industrial areas in the sensed location.We use random forest regression methodology to select the polluted area.Then,the elephant herding algorithm is used in sensor node localization.These two algorithms are combined to produce the best standard result in localizing the sensor nodes.To check the proposed performance,experiments are conducted with data from the KDD Cup 2018,which contain the name of 35 stations with concentrations of air pollutants such as PM,SO_(2),CO,NO_(2),and O_(3).These data are normalized and tested with algorithms.The results are comparatively analyzed with other swarm intelligence algorithms such as the elephant herding algorithm,particle swarm optimization,and machine learning algorithms such as decision tree regression and multi-layer perceptron.Results can indicate our proposed algorithm can suggest more meaningful locations for localizing the sensors in the topology.Our proposed method achieves a lower root mean square value with 0.06 to 0.08 for localizing with Stations 1 to 5.展开更多
In this paper, we have constructed a random weighting statistic to approximate the distribution of studentized least square estimator in a linear regression model with ideal accuracy o(n<sup>-1/2</sup>). T...In this paper, we have constructed a random weighting statistic to approximate the distribution of studentized least square estimator in a linear regression model with ideal accuracy o(n<sup>-1/2</sup>). Thus, we have provided a more practical distribution approximating method.展开更多
Nitrogen(N)as a pivotal factor in influencing the growth,development,and yield of maize.Monitoring the N status of maize rapidly and non-destructive and real-time is meaningful in fertilization management of agricultu...Nitrogen(N)as a pivotal factor in influencing the growth,development,and yield of maize.Monitoring the N status of maize rapidly and non-destructive and real-time is meaningful in fertilization management of agriculture,based on unmanned aerial vehicle(UAV)remote sensing technology.In this study,the hyperspectral images were acquired by UAV and the leaf nitrogen content(LNC)and leaf nitrogen accumulation(LNA)were measured to estimate the N nutrition status of maize.24 vegetation indices(VIs)were constructed using hyperspectral images,and four prediction models were used to estimate the LNC and LNA of maize.The models include a single linear regression model,multivariable linear regression(MLR)model,random forest regression(RFR)model,and support vector regression(SVR)model.Moreover,the model with the highest prediction accuracy was applied to invert the LNC and LNA of maize in breeding fields.The results of the single linear regression model with 24 VIs showed that normalized difference chlorophyll(NDchl)had the highest prediction accuracy for LNC(R^(2),RMSE,and RE were 0.72,0.21,and 12.19%,respectively)and LNA(R^(2),RMSE,and RE were 0.77,0.26,and 14.34%,respectively).And then,24 VIs were divided into 13 important VIs and 11 unimportant VIs.Three prediction models for LNC and LNA were constructed using 13 important VIs,and the results showed that RFR and SVR models significantly enhanced the prediction accuracy of LNC and LNA compared to the multivariable linear regression model,in which RFR model had the highest prediction accuracy for the validation dataset of LNC(R^(2),RMSE,and RE were 0.78,0.16,and 8.83%,respectively)and LNA(R^(2),RMSE,and RE were 0.85,0.19,and 9.88%,respectively).This study provides a theoretical basis for N diagnosis and precise management of crop production based on hyperspectral remote sensing in precision agriculture.展开更多
Flexible electrohydrodynamic(EHD)pumps have been developed and applied in many fields due to no transmission structure,no wear,easy manipulation,and no noise.Physical simulation is often used to predict the output per...Flexible electrohydrodynamic(EHD)pumps have been developed and applied in many fields due to no transmission structure,no wear,easy manipulation,and no noise.Physical simulation is often used to predict the output performance of flexible EHD pumps.However,this method neglects fluid–solid interaction and energy loss caused by flexible materials,which are both difficult to calculate when the flexible pumps deform.Therefore,this study proposes a flexible pump output performance prediction using machine learning algorithms.We used three different types of machine learning:random forest regression,ridge regression,and neural network to predict the critical parameters(pressure,flow rate,and power)of the flexible EHD pump.Voltage,angle,gap,overlap,and channel height are selected as five input data of the neural network.In addition,we optimized essential parameters in the three networks.Finally,we adopt the best predictive model and evaluate the significance of five input parameters to the output performance of the flexible EHD pumps.Among the three methods,the MLP model has exceptionally high accuracy in predicting pressure and flow.Our work is beneficial for the design process of fluid sources in flexible soft actuators and soft hydraulic sources in microfluidic chips.展开更多
HVDC system can realize a very fast frequency response to the disturbed system under a contingency because its active power control is decoupled from the frequency deviation.However,most of existing HVDC frequency con...HVDC system can realize a very fast frequency response to the disturbed system under a contingency because its active power control is decoupled from the frequency deviation.However,most of existing HVDC frequency control strategies are coupled with system primary frequency control and secondary frequency control.Since the traditional system frequency control is dominated by the thermal generators,the advantage of the fast response of the HVDC system is not made fully used.The development of a frequency response estimation based on a machine learning algorithm provides another approach to improve the frequency response capability of the HVDC system.Different from other frequency deviation tracking strategies,a machine learning based HVDC frequency response control can directly increase the power flow of a HVDC system by estimation of the system generator or load lost.In this paper,a fast frequency response control using a HVDC system for a large power system disturbance based on the multivariate random forest regression(MRFR)algorithm is proposed.The simulation is carried out with an integrated power system model based on the North American interconnections.The simulation results indicate that the proposed MRFR based frequency response control can significantly improve the frequency low point during an event,while stabilizing the frequency in advance.展开更多
The unprecedented coronavirus disease 2019(COVID-19)pandemic is still raging(in year 2021)in many countries worldwide.Various response strategies to study the characteristics and distributions of the virus in various ...The unprecedented coronavirus disease 2019(COVID-19)pandemic is still raging(in year 2021)in many countries worldwide.Various response strategies to study the characteristics and distributions of the virus in various regions of the world have been developed to assist in the prevention and control of this epidemic.Descriptive statistics and regression analysis on COVID-19 data from different countries were conducted in this study to compare and evaluate various regression models.Results showed that the extreme random forest regression(ERFR)model had the best performance,and factors such as population density,ozone,median age,life expectancy,and Human Development Index(HDI)were relatively influential on the spread and diffusion of COVID-19 in the ERFR model.In addition,the epidemic clustering characteristics were analyzed through the spectral clustering algorithm.The visualization results of spectral clustering showed that the geographical distribution of global COVID-19 pandemic spread formation was highly clustered,and its clustering characteristics and influencing factors also exhibited some consistency in distribution.This study aims to deepen the understanding of the international community regarding the global COVID-19 pandemic to develop measures for countries worldwide to mitigate potential large-scale outbreaks and improve the ability to respond to such public health emergencies.展开更多
基金Project supported in part by the National Natural Science Foundation of China(Grant No.12075168)the Fund from the Science and Technology Commission of Shanghai Municipality(Grant No.21JC1405600)。
文摘The layered pavements usually exhibit complicated mechanical properties with the effect of complex material properties under external environment.In some cases,such as launching missiles or rockets,layered pavements are required to bear large impulse load.However,traditional methods cannot non-destructively and quickly detect the internal structural of pavements.Thus,accurate and fast prediction of the mechanical properties of layered pavements is of great importance and necessity.In recent years,machine learning has shown great superiority in solving nonlinear problems.In this work,we present a method of predicting the maximum deflection and damage factor of layered pavements under instantaneous large impact based on random forest regression with the deflection basin parameters obtained from falling weight deflection testing.The regression coefficient R^(2)of testing datasets are above 0.94 in the process of predicting the elastic moduli of structural layers and mechanical responses,which indicates that the prediction results have great consistency with finite element simulation results.This paper provides a novel method for fast and accurate prediction of pavement mechanical responses under instantaneous large impact load using partial structural parameters of pavements,and has application potential in non-destructive evaluation of pavement structure.
基金supported by National Natural Science Foundation of China (61703410,61873175,62073336,61873273,61773386,61922089)。
文摘Remaining useful life(RUL) prediction is one of the most crucial elements in prognostics and health management(PHM). Aiming at the imperfect prior information, this paper proposes an RUL prediction method based on a nonlinear random coefficient regression(RCR) model with fusing failure time data.Firstly, some interesting natures of parameters estimation based on the nonlinear RCR model are given. Based on these natures,the failure time data can be fused as the prior information reasonably. Specifically, the fixed parameters are calculated by the field degradation data of the evaluated equipment and the prior information of random coefficient is estimated with fusing the failure time data of congeneric equipment. Then, the prior information of the random coefficient is updated online under the Bayesian framework, the probability density function(PDF) of the RUL with considering the limitation of the failure threshold is performed. Finally, two case studies are used for experimental verification. Compared with the traditional Bayesian method, the proposed method can effectively reduce the influence of imperfect prior information and improve the accuracy of RUL prediction.
文摘Given the challenge of estimating or calculating quantities of waste electrical and electronic equipment(WEEE)in developing countries,this article focuses on predicting the WEEE generated by Cameroonian small and medium enterprises(SMEs)that are engaged in ISO 14001:2015 initiatives and consume electrical and electronic equipment(EEE)to enhance their performance and profitability.The methodology employed an exploratory approach involving the application of general equilibrium theory(GET)to contextualize the study and generate relevant parameters for deploying the random forest regression learning algorithm for predictions.Machine learning was applied to 80%of the samples for training,while simulation was conducted on the remaining 20%of samples based on quantities of EEE utilized over a specific period,utilization rates,repair rates,and average lifespans.The results demonstrate that the model’s predicted values are significantly close to the actual quantities of generated WEEE,and the model’s performance was evaluated using the mean squared error(MSE)and yielding satisfactory results.Based on this model,both companies and stakeholders can set realistic objectives for managing companies’WEEE,fostering sustainable socio-environmental practices.
基金Under the auspices of National Natural Science Foundation of China(No.41977411,41771383)Technology Research Project of the Education Department of Jilin Province(No.JJKH20210445KJ)。
文摘Given the rapid urbanization worldwide, Urban Heat Island(UHI) effect has been a severe issue limiting urban sustainability in both large and small cities. In order to study the spatial pattern of Surface urban heat island(SUHI) in China’s Meihekou City, a combination method of Monte Carlo and Random Forest Regression(MC-RFR) is developed to construct the relationship between landscape pattern indices and Land Surface Temperature(LST). In this method, Monte Carlo acceptance-rejection sampling was added to the bootstrap layer of RFR to ensure the sensitivity of RFR to outliners of SUHI effect. The SHUI in 2030 was predicted by using this MC-RFR and the modeled future landscape pattern by Cellular Automata and Markov combination model(CA-Markov). Results reveal that forestland can greatly alleviate the impact of SUHI effect, while reasonable construction of urban land can also slow down the rising trend of SUHI. MC-RFR performs better for characterizing the relationship between landscape pattern and LST than single RFR or Linear Regression model. By 2030, the overall SUHI effect of Meihekou will be greatly enhanced, and the center of urban development will gradually shift to the central and western regions of the city. We suggest that urban designer and managers should concentrate vegetation and disperse built-up land to weaken the SUHI in the construction of new urban areas for its sustainability.
基金funded by the Deutsche Forschungsgemeinschaft(DFG,German Research Foundation)-SFB 1502/1-2022-project number:450058266.
文摘Due to the coarse scale of soil moisture products retrieved from passive microwave observations(SMPMW),several downscaling methods have been developed to enable regional scale applications.However,it can be challenging for users to access final data products and algorithms,as well as managing different data sources and formats,various data processing methods,and the complexity of the workflows from raw data to information products.Here,the Google Earth Engine(GEE),which as of late offers SMPMW,is used to implement a workflow for retrieving 1 km SM at a depth of 0-5 cm using MODIS optical/thermal measurements,the SM_(PMW)coarse scale product,and a random forest regression.The proposed method was implemented on the African continent to estimate weekly SM maps.The results of this study were evaluated against in-situ measurements of three validation networks.Overall,in comparison to the original SM_(PMW)product,which was limited by a spatial resolution of only 9 km,this method is able to estimate SM at 1 km spatial resolution with acceptable accuracy(an average correlation coefficient of 0.64 and a ubRMSD of 0.069 m^(3)/m^(3)).The results show that the proposed method in GEE provides a precise estimation of SM with a higher spatial resolution across the entire continent.
文摘Despite exploration and production success in Niger Delta,several failed wells have been encountered due to overpressures.Hence,it is very essential to understand the spatial distribution of pore pressure and the generating mechanisms in order to mitigate the pitfalls that might arise during drilling.This research provides estimates of pore pressure along three offshore wells using the Eaton's transit time method,multi-layer perceptron artificial neural network(MLP-ANN)and random forest regression(RFR)algorithms.Our results show that there are three pressure magnitude regimes:normal pressure zone(hydrostatic pressure),transition pressure zone(slightly above hydrostatic pressure),and over pressured zone(significantly above hydrostatic pressure).The top of the geopressured zone(2873 mbRT or 9425.853 ft)averagely marks the onset of overpressurization with the excess pore pressure above hydrostatic pressure(P∗)varying averagely along the three wells between 1.06−24.75 MPa.The results from the three methods are self-consistent with strong correlation between the Eaton's method and the two machine learning models.The models have high accuracy of about>97%,low mean absolute percentage error(MAPE<3%)and coefficient of determination(R2>0.98).Our results have also shown that the principal generating mechanisms responsible for high pore pressure in the offshore Niger Delta are disequilibrium compaction,unloading(fluid expansion)and shale diagenesis.
文摘The aviation industry has seen significant advancements in safety procedures over the past few decades, resulting in a steady decline in aviation deaths worldwide. However, the safety standards in General Aviation (GA) are still lower compared to those in commercial aviation. With the anticipated growth in air travel, there is an imminent need to improve operational safety in GA. One way to improve aircraft and operational safety is through trajectory prediction. Trajectory prediction plays a key role in optimizing air traffic control and improving overall flight safety. This paper proposes a meta-learning approach to predict short- to mid-term trajectories of aircraft using historical real flight data collected from multiple GA aircraft. The proposed solution brings together multiple models to improve prediction accuracy. In this paper, we are combining two models, Random Forest Regression (RFR) and Long Short-term Memory (LSTM), using k-Nearest Neighbors (k-NN), to output the final prediction based on the combined output of the individual models. This approach gives our model an edge over single-model predictions. We present the results of our meta-learner and evaluate its performance against individual models using the Mean Absolute Error (MAE), Absolute Altitude Error (AAE), and Root Mean Squared Error (RMSE) evaluation metrics. The proposed methodology for aircraft trajectory forecasting is discussed in detail, accompanied by a literature review and an overview of the data preprocessing techniques used. The results demonstrate that the proposed meta-learner outperforms individual models in terms of accuracy, providing a more robust and proactive approach to improve operational safety in GA.
基金funded by the Young Academic Leaders Supporting Project in Institutions of Higher Education of Shanxi Province,China
文摘Covariance functions have been proposed as an alternative to model longitudinal data in animal breeding because of their various merits in comparison to the classical analytical methods.In practical estimation,different models and polynomial orders fitted can influence the estimates of covariance functions and thus genetic parameters.The objective of this study was to select model for estimation of covariance functions for body weights of Angora goats at 7 time points.Covariance functions were estimated by fitting 6 random regression models with birth year,birth month,sex,age of dam,birth type,and relative birth date as fixed effects.Random effects involved were direct and maternal additive genetic,and animal and maternal permanent environmental effects with different orders of fit.Selection of model and orders of fit were carried out by likelihood ratio test and 4 types of information criteria.The results showed that model with 6 orders of polynomial fit for direct additive genetic and animal permanent environmental effects and 4 and 5 orders for maternal genetic and permanent environmental effects,respectively,were preferable for estimation of covariance functions.Models with and without maternal effects influenced the estimates of covariance functions greatly.Maternal permanent environmental effect does not explain the variation of all permanent environments,well suggesting different sources of permanent environmental effects also has large influence on covariance function estimates.
基金supported by the High-end Foreign Expert Introduction program(No.G20190022002)Chongqing Construction Science and Technology Plan Project(2019-0045)+1 种基金the Science and Technology Research Program of Chongqing Municipal Education Commission(Grant No.KJZD-K201900102)The financial support is gratefully acknowledged。
文摘This paper adopts the NGI-ADP soil model to carry out finite element analysis,based on which the effects of soft clay anisotropy on the diaphragm wall deflections in the braced excavation were evaluated.More than one thousand finite element cases were numerically analyzed,followed by extensive parametric studies.Surrogate models were developed via ensemble learning methods(ELMs),including the e Xtreme Gradient Boosting(XGBoost),and Random Forest Regression(RFR)to predict the maximum lateral wall deformation(δhmax).Then the results of ELMs were compared with conventional soft computing methods such as Decision Tree Regression(DTR),Multilayer Perceptron Regression(MLPR),and Multivariate Adaptive Regression Splines(MARS).This study presents a cutting-edge application of ensemble learning in geotechnical engineering and a reasonable methodology that allows engineers to determine the wall deflection in a fast,alternative way.
基金funded by the National Natural Science Foundation of China (51979233)the Natural Science Basic Research Plan in Shaanxi Province of China (2022JQ-363)。
文摘Estimating spatial variation in crop transpiration coefficients(CTc) and aboveground biomass(AGB)rapidly and accurately by remote sensing can facilitate precision irrigation management in semiarid regions. This study developed and assessed a novel machine learning(ML) method for estimating CTc and AGB using time-series unmanned aerial vehicle(UAV)-based multispectral vegetation indices(VIs)of maize under several irrigation treatments at the field scale. Four ML regression methods: multiple linear regression(MLR), support vector regression(SVR), random forest regression(RFR), and adaptive boosting regression(ABR), were used to address the complex relationship between CTcand VIs. AGB was then estimated using exponential, logistic, sigmoid, and linear equations because of their clear mathematical formulations based on the optimal CTcestimation model. The UAV VIs-derived CTcusing the RFR estimation model yielded the highest accuracy(R^(2)= 0.91, RMSE = 0.0526, and n RMSE = 9.07%). The normalized difference red-edge index, transformed chlorophyll absorption in reflectance index, and simple ratio contributed significantly to the RFR-based CTcmodel. The accuracy of AGB estimation using nonlinear methods was higher than that using the linear method. The exponential method yielded the highest accuracy(R^(2)= 0.76, RMSE = 282.8 g m, and n RMSE = 39.24%) in both the 2018 and 2019 growing seasons. The study confirms that AGB estimation models based on cumulative CTcperformed well under several irrigation treatments using high-resolution time-series UAV multispectral VIs and can support irrigation management with high spatial precision at a field scale.
基金supported by the Qingdao Municipal Bureau of Science and Technology(No.19-6-1-55-nsh)。
文摘Objective:To study the application of a machine learning algorithm for predicting gestational diabetes mellitus(GDM)in early pregnancy.Methods:This study identified indicators related to GDM through a literature review and expert discussion.Pregnant women who had attended medical institutions for an antenatal examination from November 2017 to August 2018 were selected for analysis,and the collected indicators were retrospectively analyzed.Based on Python,the indicators were classified and modeled using a random forest regression algorithm,and the performance of the prediction model was analyzed.Results:We obtained 4806 analyzable data from 1625 pregnant women.Among these,3265 samples with all 67 indicators were used to establish data set F1;4806 samples with 38 identical indicators were used to establish data set F2.Each of F1 and F2 was used for training the random forest algorithm.The overall predictive accuracy of the F1 model was 93.10%,area under the receiver operating characteristic curve(AUC)was 0.66,and the predictive accuracy of GDM-positive cases was 37.10%.The corresponding values for the F2 model were 88.70%,0.87,and 79.44%.The results thus showed that the F2 prediction model performed better than the F1 model.To explore the impact of sacrificial indicators on GDM prediction,the F3 data set was established using 3265 samples(F1)with 38 indicators(F2).After training,the overall predictive accuracy of the F3 model was 91.60%,AUC was 0.58,and the predictive accuracy of positive cases was 15.85%.Conclusions:In this study,a model for predicting GDM with several input variables(e.g.,physical examination,past history,personal history,family history,and laboratory indicators)was established using a random forest regression algorithm.The trained prediction model exhibited a good performance and is valuable as a reference for predicting GDM in women at an early stage of pregnancy.In addition,there are cer tain requirements for the propor tions of negative and positive cases in sample data sets when the random forest algorithm is applied to the early prediction of GDM.
基金This work was supported in part by the Ministry of Education of the People’s Republic of China Project of Humanities and Social Sciences under Grant No.19YJCZH208,author X.X,http://www.moe.gov.cn/in part by the Social Sciences Federation Think Tank Project of Hunan Province under Grant No.ZK2019025,author X.X,http://www.hnsk.gov.cn/+3 种基金in part by the Education Bureau Research Foundation Project of Hunan Province under Grant No.20A531,author X.X,http://jyt.hunan.gov.cn/in part by the Science and Technology Project of Changsha City,under Grant No.kq2004092,author X.X,http://kjj.changsha.gov.cn/in part by Key Subjects of the State Forestry Bureau in China under Grant No.[2016]21,author X.X,http://www.forestry.gov.cn/and in part by“Double First-Class”Cultivation Discipline of Hunan Province in China under Grant No.[2018]469,author X.X,http://jyt.hunan.gov.cn/.
文摘To study riding safety at intersection entrance,video recognition technology is used to build vehicle-bicycle conflict models based on the Bayesian method.It is analyzed the relationship among the width of nonmotorized lanes at the entrance lane of the intersection,the vehicle-bicycle soft isolation form of the entrance lane of intersection,the traffic volume of right-turning motor vehicles and straight-going non-motor vehicles,the speed of right-turning motor vehicles,and straight-going non-motor vehicles,and the conflict between right-turning motor vehicles and straight-going nonmotor vehicles.Due to the traditional statistical methods,to overcome the discreteness of vehicle-bicycle conflict data and the differences of influencing factors,the Bayesian random effect Poisson-log-normal model and random effect negative binomial regression model are established.The results show that the random effect Poisson-log-normal model is better than the negative binomial distribution of random effects;The width of non-motorized lanes,the form of vehicle-bicycle soft isolation,the traffic volume of right-turning motor vehicles,and the coefficients of straight traffic volume obey a normal distribution.Among them,the type of vehicle-bicycle soft isolation facilities and the vehicle-bicycle traffic volumes are significantly positively correlated with the number of vehicle-bicycle conflicts.The width of non-motorized lanes is significantly negatively correlated with the number of vehicle-bicycle conflicts.Peak periods and flat periods,the average speed of right-turning motor vehicles,and the average speed of straight-going non-motor vehicles have no significant influence on the number of vehicle-bicycle conflicts.
文摘Artificial intelligence(AI)and machine learning(ML)help in making predictions and businesses to make key decisions that are beneficial for them.In the case of the online shopping business,it’s very important to find trends in the data and get knowledge of features that helps drive the success of the business.In this research,a dataset of 12,330 records of customers has been analyzedwho visited an online shoppingwebsite over a period of one year.The main objective of this research is to find features that are relevant in terms of correctly predicting the purchasing decisions made by visiting customers and build ML models which could make correct predictions on unseen data in the future.The permutation feature importance approach has been used to get the importance of features according to the output variable(Revenue).Five ML models i.e.,decision tree(DT),random forest(RF),extra tree(ET)classifier,Neural networks(NN),and Logistic regression(LR)have been used to make predictions on the unseen data in the future.The performance of each model has been discussed in detail using performance measurement techniques such as accuracy score,precision,recall,F1 score,and ROC-AUC curve.RF model is the bestmodel among all five chosen based on accuracy score of 90%and F1 score of 79%followed by extra tree classifier.Hence,our study indicates that RF model can be used by online retailing businesses for predicting consumer buying behaviour.Our research also reveals the importance of page value as a key feature for capturing online purchasing trends.This may give a clue to future businesses who can focus on this specific feature and can find key factors behind page value success which in turn will help the online shopping business.
文摘Due to unforeseen climate change,complicated chronic diseases,and mutation of viruses’hospital administration’s top challenge is to know about the Length of stay(LOS)of different diseased patients in the hospitals.Hospital management does not exactly know when the existing patient leaves the hospital;this information could be crucial for hospital management.It could allow them to take more patients for admission.As a result,hospitals face many problems managing available resources and new patients in getting entries for their prompt treatment.Therefore,a robust model needs to be designed to help hospital administration predict patients’LOS to resolve these issues.For this purpose,a very large-sized data(more than 2.3 million patients’data)related to New-York Hospitals patients and containing information about a wide range of diseases including Bone-Marrow,Tuberculosis,Intestinal Transplant,Mental illness,Leukaemia,Spinal cord injury,Trauma,Rehabilitation,Kidney and Alcoholic Patients,HIV Patients,Malignant Breast disorder,Asthma,Respiratory distress syndrome,etc.have been analyzed to predict the LOS.We selected six Machine learning(ML)models named:Multiple linear regression(MLR),Lasso regression(LR),Ridge regression(RR),Decision tree regression(DTR),Extreme gradient boosting regression(XGBR),and Random Forest regression(RFR).The selected models’predictive performance was checked using R square andMean square error(MSE)as the performance evaluation criteria.Our results revealed the superior predictive performance of the RFRmodel,both in terms of RS score(92%)and MSE score(5),among all selected models.By Exploratory data analysis(EDA),we conclude that maximumstay was between 0 to 5 days with the meantime of each patient 5.3 days and more than 50 years old patients spent more days in the hospital.Based on the average LOS,results revealed that the patients with diagnoses related to birth complications spent more days in the hospital than other diseases.This finding could help predict the future length of hospital stay of new patients,which will help the hospital administration estimate and manage their resources efficiently.
文摘Major fields such as military applications,medical fields,weather forecasting,and environmental applications use wireless sensor networks for major computing processes.Sensors play a vital role in emerging technologies of the 20th century.Localization of sensors in needed locations is a very serious problem.The environment is home to every living being in the world.The growth of industries after the industrial revolution increased pollution across the environment.Owing to recent uncontrolled growth and development,sensors to measure pollution levels across industries and surroundings are needed.An interesting and challenging task is choosing the place to fit the sensors.Many meta-heuristic techniques have been introduced in node localization.Swarm intelligent algorithms have proven their efficiency in many studies on localization problems.In this article,we introduce an industrial-centric approach to solve the problem of node localization in the sensor network.First,our work aims at selecting industrial areas in the sensed location.We use random forest regression methodology to select the polluted area.Then,the elephant herding algorithm is used in sensor node localization.These two algorithms are combined to produce the best standard result in localizing the sensor nodes.To check the proposed performance,experiments are conducted with data from the KDD Cup 2018,which contain the name of 35 stations with concentrations of air pollutants such as PM,SO_(2),CO,NO_(2),and O_(3).These data are normalized and tested with algorithms.The results are comparatively analyzed with other swarm intelligence algorithms such as the elephant herding algorithm,particle swarm optimization,and machine learning algorithms such as decision tree regression and multi-layer perceptron.Results can indicate our proposed algorithm can suggest more meaningful locations for localizing the sensors in the topology.Our proposed method achieves a lower root mean square value with 0.06 to 0.08 for localizing with Stations 1 to 5.
基金Supported by the Doctoral Program Foundation of the Institute of Higher Educationthe National Natural Science Foundation of China.
文摘In this paper, we have constructed a random weighting statistic to approximate the distribution of studentized least square estimator in a linear regression model with ideal accuracy o(n<sup>-1/2</sup>). Thus, we have provided a more practical distribution approximating method.
基金financially supported by the Hainan Province Science and Technology Special Fund(Grant No.ZDYF2021GXJS038 and Grant No.ZDYF2024XDNY196)Hainan Provincial Natural Science Foundation of China(Grant No.320RC486)the National Natural Science Foundation of China(Grant No.42167011).
文摘Nitrogen(N)as a pivotal factor in influencing the growth,development,and yield of maize.Monitoring the N status of maize rapidly and non-destructive and real-time is meaningful in fertilization management of agriculture,based on unmanned aerial vehicle(UAV)remote sensing technology.In this study,the hyperspectral images were acquired by UAV and the leaf nitrogen content(LNC)and leaf nitrogen accumulation(LNA)were measured to estimate the N nutrition status of maize.24 vegetation indices(VIs)were constructed using hyperspectral images,and four prediction models were used to estimate the LNC and LNA of maize.The models include a single linear regression model,multivariable linear regression(MLR)model,random forest regression(RFR)model,and support vector regression(SVR)model.Moreover,the model with the highest prediction accuracy was applied to invert the LNC and LNA of maize in breeding fields.The results of the single linear regression model with 24 VIs showed that normalized difference chlorophyll(NDchl)had the highest prediction accuracy for LNC(R^(2),RMSE,and RE were 0.72,0.21,and 12.19%,respectively)and LNA(R^(2),RMSE,and RE were 0.77,0.26,and 14.34%,respectively).And then,24 VIs were divided into 13 important VIs and 11 unimportant VIs.Three prediction models for LNC and LNA were constructed using 13 important VIs,and the results showed that RFR and SVR models significantly enhanced the prediction accuracy of LNC and LNA compared to the multivariable linear regression model,in which RFR model had the highest prediction accuracy for the validation dataset of LNC(R^(2),RMSE,and RE were 0.78,0.16,and 8.83%,respectively)and LNA(R^(2),RMSE,and RE were 0.85,0.19,and 9.88%,respectively).This study provides a theoretical basis for N diagnosis and precise management of crop production based on hyperspectral remote sensing in precision agriculture.
基金supported by Grant-in-Aid for Early-Career Scientists from the Japan Society for the Promotion of Science(23K13290),Japan.
文摘Flexible electrohydrodynamic(EHD)pumps have been developed and applied in many fields due to no transmission structure,no wear,easy manipulation,and no noise.Physical simulation is often used to predict the output performance of flexible EHD pumps.However,this method neglects fluid–solid interaction and energy loss caused by flexible materials,which are both difficult to calculate when the flexible pumps deform.Therefore,this study proposes a flexible pump output performance prediction using machine learning algorithms.We used three different types of machine learning:random forest regression,ridge regression,and neural network to predict the critical parameters(pressure,flow rate,and power)of the flexible EHD pump.Voltage,angle,gap,overlap,and channel height are selected as five input data of the neural network.In addition,we optimized essential parameters in the three networks.Finally,we adopt the best predictive model and evaluate the significance of five input parameters to the output performance of the flexible EHD pumps.Among the three methods,the MLP model has exceptionally high accuracy in predicting pressure and flow.Our work is beneficial for the design process of fluid sources in flexible soft actuators and soft hydraulic sources in microfluidic chips.
基金supported primarily by the Engineering Research Center Program of the National Science Foundation and the Department of Energy under NSF Award Number EEC-1041877 and the CURENT Industry Partnership Program.
文摘HVDC system can realize a very fast frequency response to the disturbed system under a contingency because its active power control is decoupled from the frequency deviation.However,most of existing HVDC frequency control strategies are coupled with system primary frequency control and secondary frequency control.Since the traditional system frequency control is dominated by the thermal generators,the advantage of the fast response of the HVDC system is not made fully used.The development of a frequency response estimation based on a machine learning algorithm provides another approach to improve the frequency response capability of the HVDC system.Different from other frequency deviation tracking strategies,a machine learning based HVDC frequency response control can directly increase the power flow of a HVDC system by estimation of the system generator or load lost.In this paper,a fast frequency response control using a HVDC system for a large power system disturbance based on the multivariate random forest regression(MRFR)algorithm is proposed.The simulation is carried out with an integrated power system model based on the North American interconnections.The simulation results indicate that the proposed MRFR based frequency response control can significantly improve the frequency low point during an event,while stabilizing the frequency in advance.
基金This work was supported in part by the National University Student Innovation and Entrepreneurship in Training Program of China(No.202110373044)the National Natural Science Foundation of China(No.71801108)+1 种基金the Laboratory Opening Project Fund of Huaibei Normal University(No.2021sykf041)the Special Needs Project of Huaibei Normal University(No.2021zlgc147)。
文摘The unprecedented coronavirus disease 2019(COVID-19)pandemic is still raging(in year 2021)in many countries worldwide.Various response strategies to study the characteristics and distributions of the virus in various regions of the world have been developed to assist in the prevention and control of this epidemic.Descriptive statistics and regression analysis on COVID-19 data from different countries were conducted in this study to compare and evaluate various regression models.Results showed that the extreme random forest regression(ERFR)model had the best performance,and factors such as population density,ozone,median age,life expectancy,and Human Development Index(HDI)were relatively influential on the spread and diffusion of COVID-19 in the ERFR model.In addition,the epidemic clustering characteristics were analyzed through the spectral clustering algorithm.The visualization results of spectral clustering showed that the geographical distribution of global COVID-19 pandemic spread formation was highly clustered,and its clustering characteristics and influencing factors also exhibited some consistency in distribution.This study aims to deepen the understanding of the international community regarding the global COVID-19 pandemic to develop measures for countries worldwide to mitigate potential large-scale outbreaks and improve the ability to respond to such public health emergencies.