BACKGROUND Intensive care unit-acquired weakness(ICU-AW)is a common complication that significantly impacts the patient's recovery process,even leading to adverse outcomes.Currently,there is a lack of effective pr...BACKGROUND Intensive care unit-acquired weakness(ICU-AW)is a common complication that significantly impacts the patient's recovery process,even leading to adverse outcomes.Currently,there is a lack of effective preventive measures.AIM To identify significant risk factors for ICU-AW through iterative machine learning techniques and offer recommendations for its prevention and treatment.METHODS Patients were categorized into ICU-AW and non-ICU-AW groups on the 14th day post-ICU admission.Relevant data from the initial 14 d of ICU stay,such as age,comorbidities,sedative dosage,vasopressor dosage,duration of mechanical ventilation,length of ICU stay,and rehabilitation therapy,were gathered.The relationships between these variables and ICU-AW were examined.Utilizing iterative machine learning techniques,a multilayer perceptron neural network model was developed,and its predictive performance for ICU-AW was assessed using the receiver operating characteristic curve.RESULTS Within the ICU-AW group,age,duration of mechanical ventilation,lorazepam dosage,adrenaline dosage,and length of ICU stay were significantly higher than in the non-ICU-AW group.Additionally,sepsis,multiple organ dysfunction syndrome,hypoalbuminemia,acute heart failure,respiratory failure,acute kidney injury,anemia,stress-related gastrointestinal bleeding,shock,hypertension,coronary artery disease,malignant tumors,and rehabilitation therapy ratios were significantly higher in the ICU-AW group,demonstrating statistical significance.The most influential factors contributing to ICU-AW were identified as the length of ICU stay(100.0%)and the duration of mechanical ventilation(54.9%).The neural network model predicted ICU-AW with an area under the curve of 0.941,sensitivity of 92.2%,and specificity of 82.7%.CONCLUSION The main factors influencing ICU-AW are the length of ICU stay and the duration of mechanical ventilation.A primary preventive strategy,when feasible,involves minimizing both ICU stay and mechanical ventilation duration.展开更多
In response to the United Nations Sustainable Development Goals and China’s“Dual Carbon”Goals(DCGs means the goals of“Carbon Peak and carbon neutrality”),this paper from the perspective of the construction of Ch...In response to the United Nations Sustainable Development Goals and China’s“Dual Carbon”Goals(DCGs means the goals of“Carbon Peak and carbon neutrality”),this paper from the perspective of the construction of China’s Innovation Demonstration Zones for Sustainable Development Agenda(IDZSDAs),combines carbon emission-related metrics to construct a comprehensive assessment system for Urban Sustainable Development Capacity(USDC).After obtaining USDC assessment results through the assessment system,an approach combining Least Absolute Shrinkage and Selection Operator(LASSO)regression and Random Forest(RF)based on machine learning is proposed for identifying influencing factors and characterizing key issues.Combining Coupling Coordination Degree(CCD)analysis,the study further summarizes the systemic patterns and future directions of urban sustainable development.A case study on the IDZSDAs from 2015 to 2022 reveals that:(1)the combined identification method based on machine learning and CCD models effectively quantifies influencing factors and key issues in the urban sustainable development process;(2)the correspondence between influencing factors and key subsystems identified by the LASSO-RF combination model is generally consistent with the development situations in various cities;and(3)the machine learning-based combined recognition method is scalable and dynamic.It enables decision-makers to accurately identify influencing factors and characterize key issues based on actual urban development needs.展开更多
Cardiovascular disease(CVD)has gradually become one of the main causes of harm to the life and health of residents.Exploring the influencing factors and risk assessment methods of CVD has become a general trend.In thi...Cardiovascular disease(CVD)has gradually become one of the main causes of harm to the life and health of residents.Exploring the influencing factors and risk assessment methods of CVD has become a general trend.In this paper,a machine learning-based decision-making mechanism for risk assessment of CVD is designed.In this mechanism,the logistics regression analysismethod and factor analysismodel are used to select age,obesity degree,blood pressure,blood fat,blood sugar,smoking status,drinking status,and exercise status as the main pathogenic factors of CVD,and an index systemof risk assessment for CVD is established.Then,a two-stage model combining K-means cluster analysis and random forest(RF)is proposed to evaluate and predict the risk of CVD,and the predicted results are compared with the methods of Bayesian discrimination,K-means cluster analysis and RF.The results show that thepredictioneffect of theproposedtwo-stagemodel is better than that of the comparedmethods.Moreover,several suggestions for the government,the medical industry and the public are provided based on the research results.展开更多
To perform landslide susceptibility prediction(LSP),it is important to select appropriate mapping unit and landslide-related conditioning factors.The efficient and automatic multi-scale segmentation(MSS)method propose...To perform landslide susceptibility prediction(LSP),it is important to select appropriate mapping unit and landslide-related conditioning factors.The efficient and automatic multi-scale segmentation(MSS)method proposed by the authors promotes the application of slope units.However,LSP modeling based on these slope units has not been performed.Moreover,the heterogeneity of conditioning factors in slope units is neglected,leading to incomplete input variables of LSP modeling.In this study,the slope units extracted by the MSS method are used to construct LSP modeling,and the heterogeneity of conditioning factors is represented by the internal variations of conditioning factors within slope unit using the descriptive statistics features of mean,standard deviation and range.Thus,slope units-based machine learning models considering internal variations of conditioning factors(variant slope-machine learning)are proposed.The Chongyi County is selected as the case study and is divided into 53,055 slope units.Fifteen original slope unit-based conditioning factors are expanded to 38 slope unit-based conditioning factors through considering their internal variations.Random forest(RF)and multi-layer perceptron(MLP)machine learning models are used to construct variant Slope-RF and Slope-MLP models.Meanwhile,the Slope-RF and Slope-MLP models without considering the internal variations of conditioning factors,and conventional grid units-based machine learning(Grid-RF and MLP)models are built for comparisons through the LSP performance assessments.Results show that the variant Slopemachine learning models have higher LSP performances than Slope-machine learning models;LSP results of variant Slope-machine learning models have stronger directivity and practical application than Grid-machine learning models.It is concluded that slope units extracted by MSS method can be appropriate for LSP modeling,and the heterogeneity of conditioning factors within slope units can more comprehensively reflect the relationships between conditioning factors and landslides.The research results have important reference significance for land use and landslide prevention.展开更多
Multi-phase machines are so attractive for electrical machine designers because of their valuable advantages such as high reliability and fault tolerant ability.Meanwhile,fractional slot concentrated windings(FSCW)are...Multi-phase machines are so attractive for electrical machine designers because of their valuable advantages such as high reliability and fault tolerant ability.Meanwhile,fractional slot concentrated windings(FSCW)are well known because of short end winding length,simple structure,field weakening sufficiency,fault tolerant capability and higher slot fill factor.The five-phase machines equipped with FSCW,are very good candidates for the purpose of designing motors for high reliable applications,like electric cars,major transporting buses,high speed trains and massive trucks.But,in comparison to the general distributed windings,the FSCWs contain high magnetomotive force(MMF)space harmonic contents,which cause unwanted effects on the machine ability,such as localized iron saturation and core losses.This manuscript introduces several new five-phase fractional slot winding layouts,by the means of slot shifting concept in order to design the new types of synchronous reluctance motors(SynRels).In order to examine the proposed winding’s performances,three sample machines are designed as case studies,and analytical study and finite element analysis(FEA)is used for validation.展开更多
The Indian Himalayan region is frequently experiencing climate change-induced landslides.Thus,landslide susceptibility assessment assumes greater significance for lessening the impact of a landslide hazard.This paper ...The Indian Himalayan region is frequently experiencing climate change-induced landslides.Thus,landslide susceptibility assessment assumes greater significance for lessening the impact of a landslide hazard.This paper makes an attempt to assess landslide susceptibility in Shimla district of the northwest Indian Himalayan region.It examined the effectiveness of random forest(RF),multilayer perceptron(MLP),sequential minimal optimization regression(SMOreg)and bagging ensemble(B-RF,BSMOreg,B-MLP)models.A landslide inventory map comprising 1052 locations of past landslide occurrences was classified into training(70%)and testing(30%)datasets.The site-specific influencing factors were selected by employing a multicollinearity test.The relationship between past landslide occurrences and influencing factors was established using the frequency ratio method.The effectiveness of machine learning models was verified through performance assessors.The landslide susceptibility maps were validated by the area under the receiver operating characteristic curves(ROC-AUC),accuracy,precision,recall and F1-score.The key performance metrics and map validation demonstrated that the BRF model(correlation coefficient:0.988,mean absolute error:0.010,root mean square error:0.058,relative absolute error:2.964,ROC-AUC:0.947,accuracy:0.778,precision:0.819,recall:0.917 and F-1 score:0.865)outperformed the single classifiers and other bagging ensemble models for landslide susceptibility.The results show that the largest area was found under the very high susceptibility zone(33.87%),followed by the low(27.30%),high(20.68%)and moderate(18.16%)susceptibility zones.The factors,namely average annual rainfall,slope,lithology,soil texture and earthquake magnitude have been identified as the influencing factors for very high landslide susceptibility.Soil texture,lineament density and elevation have been attributed to high and moderate susceptibility.Thus,the study calls for devising suitable landslide mitigation measures in the study area.Structural measures,an immediate response system,community participation and coordination among stakeholders may help lessen the detrimental impact of landslides.The findings from this study could aid decision-makers in mitigating future catastrophes and devising suitable strategies in other geographical regions with similar geological characteristics.展开更多
Intensive care unit-acquired weakness(ICU-AW)significantly hampers patient recovery and increases morbidity.With the absence of established preventive strategies,this study utilizes advanced machine learning methodolo...Intensive care unit-acquired weakness(ICU-AW)significantly hampers patient recovery and increases morbidity.With the absence of established preventive strategies,this study utilizes advanced machine learning methodologies to unearth key predictors of ICU-AW.Employing a sophisticated multilayer perceptron neural network,the research methodically assesses the predictive power for ICU-AW,pinpointing the length of ICU stay and duration of mechanical ventilation as pivotal risk factors.The findings advocate for minimizing these elements as a preventive approach,offering a novel perspective on combating ICU-AW.This research illuminates critical risk factors and lays the groundwork for future explorations into effective prevention and intervention strategies.展开更多
Underwater direction of arrival(DOA)estimation has always been a very challenging theoretical and practical problem.Due to the serious non-stationary,non-linear,and non-Gaussian characteristics,machine learning based ...Underwater direction of arrival(DOA)estimation has always been a very challenging theoretical and practical problem.Due to the serious non-stationary,non-linear,and non-Gaussian characteristics,machine learning based DOA estimation methods trained on simulated Gaussian noised array data cannot be directly applied to actual underwater DOA estimation tasks.In order to deal with this problem,environmental data with no target echoes can be employed to analyze the non-Gaussian components.Then,the obtained information about non-Gaussian components can be used to whiten the array data.Based on these considerations,a novel practical sonar array whitening method was proposed.Specifically,based on a weak assumption that the non-Gaussian components in adjacent patches with and without target echoes are almost the same,canonical cor-relation analysis(CCA)and non-negative matrix factorization(NMF)techniques are employed for whitening the array data.With the whitened array data,machine learning based DOA estimation models trained on simulated Gaussian noised datasets can be used to perform underwater DOA estimation tasks.Experimental results illustrated that,using actual underwater datasets for testing with known machine learning based DOA estimation models,accurate and robust DOA estimation performance can be achieved by using the proposed whitening method in different underwater con-ditions.展开更多
Software-Defined Network(SDN)decouples the control plane of network devices from the data plane.While alleviating the problems presented in traditional network architectures,it also brings potential security risks,par...Software-Defined Network(SDN)decouples the control plane of network devices from the data plane.While alleviating the problems presented in traditional network architectures,it also brings potential security risks,particularly network Denial-of-Service(DoS)attacks.While many research efforts have been devoted to identifying new features for DoS attack detection,detection methods are less accurate in detecting DoS attacks against client hosts due to the high stealth of such attacks.To solve this problem,a new method of DoS attack detection based on Deep Factorization Machine(DeepFM)is proposed in SDN.Firstly,we select the Growth Rate of Max Matched Packets(GRMMP)in SDN as detection feature.Then,the DeepFM algorithm is used to extract features from flow rules and classify them into dense and discrete features to detect DoS attacks.After training,the model can be used to infer whether SDN is under DoS attacks,and a DeepFM-based detection method for DoS attacks against client host is implemented.Simulation results show that our method can effectively detect DoS attacks in SDN.Compared with the K-Nearest Neighbor(K-NN),Artificial Neural Network(ANN)models,Support Vector Machine(SVM)and Random Forest models,our proposed method outperforms in accuracy,precision and F1 values.展开更多
A comprehensive and precise analysis of shale gas production performance is crucial for evaluating resource potential,designing a field development plan,and making investment decisions.However,quantitative analysis ca...A comprehensive and precise analysis of shale gas production performance is crucial for evaluating resource potential,designing a field development plan,and making investment decisions.However,quantitative analysis can be challenging because production performance is dominated by the complex interaction among a series of geological and engineering factors.In fact,each factor can be viewed as a player who makes cooperative contributions to the production payoff within the constraints of physical laws and models.Inspired by the idea,we propose a hybrid data-driven analysis framework in this study,where the contributions of dominant factors are quantitatively evaluated,the productions are precisely forecasted,and the development optimization suggestions are comprehensively generated.More specifically,game theory and machine learning models are coupled to determine the dominating geological and engineering factors.The Shapley value with definite physical meaning is employed to quantitatively measure the effects of individual factors.A multi-model-fused stacked model is trained for production forecast,which provides the basis for derivative-free optimization algorithms to optimize the development plan.The complete workflow is validated with actual production data collected from the Fuling shale gas field,Sichuan Basin,China.The validation results show that the proposed procedure can draw rigorous conclusions with quantified evidence and thereby provide specific and reliable suggestions for development plan optimization.Comparing with traditional and experience-based approaches,the hybrid data-driven procedure is advanced in terms of both efficiency and accuracy.展开更多
A procedure to recognize individual discontinuities in rock mass from measurement while drilling(MWD)technology is developed,using the binary pattern of structural rock characteristics obtained from in-hole images for...A procedure to recognize individual discontinuities in rock mass from measurement while drilling(MWD)technology is developed,using the binary pattern of structural rock characteristics obtained from in-hole images for calibration.Data from two underground operations with different drilling technology and different rock mass characteristics are considered,which generalizes the application of the methodology to different sites and ensures the full operational integration of MWD data analysis.Two approaches are followed for site-specific structural model building:a discontinuity index(DI)built from variations in MWD parameters,and a machine learning(ML)classifier as function of the drilling parameters and their variability.The prediction ability of the models is quantitatively assessed as the rate of recognition of discontinuities observed in borehole logs.Differences between the parameters involved in the models for each site,and differences in their weights,highlight the site-dependence of the resulting models.The ML approach offers better performance than the classical DI,with recognition rates in the range 89%to 96%.However,the simpler DI still yields fairly accurate results,with recognition rates 70%to 90%.These results validate the adaptive MWD-based methodology as an engineering solution to predict rock structural condition in underground mining operations.展开更多
Landslide is a serious natural disaster next only to earthquake and flood,which will cause a great threat to people’s lives and property safety.The traditional research of landslide disaster based on experience-drive...Landslide is a serious natural disaster next only to earthquake and flood,which will cause a great threat to people’s lives and property safety.The traditional research of landslide disaster based on experience-driven or statistical model and its assessment results are subjective,difficult to quantify,and no pertinence.As a new research method for landslide susceptibility assessment,machine learning can greatly improve the landslide susceptibility model’s accuracy by constructing statistical models.Taking Western Henan for example,the study selected 16 landslide influencing factors such as topography,geological environment,hydrological conditions,and human activities,and 11 landslide factors with the most significant influence on the landslide were selected by the recursive feature elimination(RFE)method.Five machine learning methods[Support Vector Machines(SVM),Logistic Regression(LR),Random Forest(RF),Extreme Gradient Boosting(XGBoost),and Linear Discriminant Analysis(LDA)]were used to construct the spatial distribution model of landslide susceptibility.The models were evaluated by the receiver operating characteristic curve and statistical index.After analysis and comparison,the XGBoost model(AUC 0.8759)performed the best and was suitable for dealing with regression problems.The model had a high adaptability to landslide data.According to the landslide susceptibility map of the five models,the overall distribution can be observed.The extremely high and high susceptibility areas are distributed in the Funiu Mountain range in the southwest,the Xiaoshan Mountain range in the west,and the Yellow River Basin in the north.These areas have large terrain fluctuations,complicated geological structural environments and frequent human engineering activities.The extremely high and highly prone areas were 12043.3 km^(2)and 3087.45 km^(2),accounting for 47.61%and 12.20%of the total area of the study area,respectively.Our study reflects the distribution of landslide susceptibility in western Henan Province,which provides a scientific basis for regional disaster warning,prediction,and resource protection.The study has important practical significance for subsequent landslide disaster management.展开更多
BACKGROUND Geriatric hip fractures are one of the most common fractures in elderly individuals,and prolonged hospital stays increase the risk of death and complications.Machine learning(ML)has become prevalent in clin...BACKGROUND Geriatric hip fractures are one of the most common fractures in elderly individuals,and prolonged hospital stays increase the risk of death and complications.Machine learning(ML)has become prevalent in clinical data processing and predictive models.This study aims to develop ML models for predicting extended length of stay(eLOS)among geriatric patients with hip fractures and to identify the associated risk factors.AIM To develop ML models for predicting the eLOS among geriatric patients with hip fractures,identify associated risk factors,and compare the performance of each model.METHODS A retrospective study was conducted at a single orthopaedic trauma centre,enrolling all patients who underwent hip fracture surgery between January 2018 and December 2022.The study collected various patient characteristics,encompassing demographic data,general health status,injury-related data,laboratory examinations,surgery-related data,and length of stay.Features that exhibited significant differences in univariate analysis were integrated into the ML model establishment and subsequently cross-verified.The study compared the performance of the ML models and determined the risk factors for eLOS.RESULTS The study included 763 patients,with 380 experiencing eLOS.Among the models,the decision tree,random forest,and extreme Gradient Boosting models demonstrated the most robust performance.Notably,the artificial neural network model also exhibited impressive results.After cross-validation,the support vector machine and logistic regression models demonstrated superior performance.Predictors for eLOS included delayed surgery,D-dimer level,American Society of Anaesthesiologists(ASA)classification,type of surgery,and sex.CONCLUSION ML proved to be highly accurate in predicting the eLOS for geriatric patients with hip fractures.The identified key risk factors were delayed surgery,D-dimer level,ASA classification,type of surgery,and sex.This valuable information can aid clinicians in allocating resources more efficiently to meet patient demand effectively.展开更多
Harmonic analysis, the traditional tidal forecasting method, cannot take into account the impact of noncyclical factors, and is also based on the BP neural network tidal prediction model which is easily limited by the...Harmonic analysis, the traditional tidal forecasting method, cannot take into account the impact of noncyclical factors, and is also based on the BP neural network tidal prediction model which is easily limited by the amount of data. According to the movement of celestial bodies, and considering the insufficient tidal characteristics of historical data which are impacted by the nonperiodic weather, a tidal prediction method is designed based on support vector machine (SVM) to carry out the simulation experiment by using tidal data from Xiamen Tide Gauge, Luchaogang Tide Gauge and Weifang Tide Gauge individually. And the results show that the model satisfactorily carries out the tide prediction which is influenced by noncyclical factors. At the same time, it also proves that the proposed prediction method, which when compared with harmonic analysis method and the BP neural network method, has faster modeling speed, higher prediction precision and stronger generalization ability.展开更多
This investigation assessed the efficacy of 10 widely used machine learning algorithms(MLA)comprising the least absolute shrinkage and selection operator(LASSO),generalized linear model(GLM),stepwise generalized linea...This investigation assessed the efficacy of 10 widely used machine learning algorithms(MLA)comprising the least absolute shrinkage and selection operator(LASSO),generalized linear model(GLM),stepwise generalized linear model(SGLM),elastic net(ENET),partial least square(PLS),ridge regression,support vector machine(SVM),classification and regression trees(CART),bagged CART,and random forest(RF)for gully erosion susceptibility mapping(GESM)in Iran.The location of 462 previously existing gully erosion sites were mapped through widespread field investigations,of which 70%(323)and 30%(139)of observations were arbitrarily divided for algorithm calibration and validation.Twelve controlling factors for gully erosion,namely,soil texture,annual mean rainfall,digital elevation model(DEM),drainage density,slope,lithology,topographic wetness index(TWI),distance from rivers,aspect,distance from roads,plan curvature,and profile curvature were ranked in terms of their importance using each MLA.The MLA were compared using a training dataset for gully erosion and statistical measures such as RMSE(root mean square error),MAE(mean absolute error),and R-squared.Based on the comparisons among MLA,the RF algorithm exhibited the minimum RMSE and MAE and the maximum value of R-squared,and was therefore selected as the best model.The variable importance evaluation using the RF model revealed that distance from rivers had the highest significance in influencing the occurrence of gully erosion whereas plan curvature had the least importance.According to the GESM generated using RF,most of the study area is predicted to have a low(53.72%)or moderate(29.65%)susceptibility to gully erosion,whereas only a small area is identified to have a high(12.56%)or very high(4.07%)susceptibility.The outcome generated by RF model is validated using the ROC(Receiver Operating Characteristics)curve approach,which returned an area under the curve(AUC)of 0.985,proving the excellent forecasting ability of the model.The GESM prepared using the RF algorithm can aid decision-makers in targeting remedial actions for minimizing the damage caused by gully erosion.展开更多
Background: Nonalcoholic fatty liver disease(NAFLD) had become the most prevalent liver disease worldwide. Early diagnosis could effectively reduce NAFLD-related morbidity and mortality. This study aimed to combine th...Background: Nonalcoholic fatty liver disease(NAFLD) had become the most prevalent liver disease worldwide. Early diagnosis could effectively reduce NAFLD-related morbidity and mortality. This study aimed to combine the risk factors to develop and validate a novel model for predicting NAFLD. Methods: We enrolled 578 participants completing abdominal ultrasound into the training set. The least absolute shrinkage and selection operator(LASSO) regression combined with random forest(RF) was conducted to screen significant predictors for NAFLD risk. Five machine learning models including logistic regression(LR), RF, extreme gradient boosting(XGBoost), gradient boosting machine(GBM), and support vector machine(SVM) were developed. To further improve model performance, we conducted hyperparameter tuning with train function in Python package ‘sklearn’. We included 131 participants completing magnetic resonance imaging into the testing set for external validation. Results: There were 329 participants with NAFLD and 249 without in the training set, while 96 with NAFLD and 35 without were in the testing set. Visceral adiposity index, abdominal circumference, body mass index, alanine aminotransferase(ALT), ALT/AST(aspartate aminotransferase), age, high-density lipoprotein cholesterol(HDL-C) and elevated triglyceride(TG) were important predictors for NAFLD risk. The area under curve(AUC) of LR, RF, XGBoost, GBM, SVM were 0.915 [95% confidence interval(CI): 0.886–0.937], 0.907(95% CI: 0.856–0.938), 0.928(95% CI: 0.873–0.944), 0.924(95% CI: 0.875–0.939), and 0.900(95% CI: 0.883–0.913), respectively. XGBoost model presented the best predictive performance, and its AUC was enhanced to 0.938(95% CI: 0.870–0.950) with further parameter tuning. Conclusions: This study developed and validated five novel machine learning models for NAFLD prediction, among which XGBoost presented the best performance and was considered a reliable reference for early identification of high-risk patients with NAFLD in clinical practice.展开更多
The power factor is the ratio between the active and apparent power,and it is available to determine the operational capability of the intended circuit or the parts.The excitation current of the synchronous motor is a...The power factor is the ratio between the active and apparent power,and it is available to determine the operational capability of the intended circuit or the parts.The excitation current of the synchronous motor is an essential parameter required for adjusting the power factor because it determines whether the motor is under the optimal operating status.Although the excitation current should predict with the experimental devices,such a method is unsuitable for online real-time prediction.The artificial intelligence algorithm can compensate for the defect of conventional measurement methods requiring the measuring devices and the model optimization is compared during the research process.In this article,the load current,power factor,and power factor errors available in the existing dataset are used as the input parameters for training the proposed artificial intelligence algorithms to select the optimal algorithm according to the training result,for this algorithm to have higher accuracy.The SMOGN(Synthetic Minority Over-Sampling Technique for Regression with Gaussian Noise)is selected for the research by which the data and the MFO(Moth-flame optimization algorithm)are created for the model to adjust and optimize the parameters automatically.In addition to enhancing the prediction accuracy for the excitation current,the automatic parameter adjusting method also allows the researchers not specializing in the professional algorithm to apply such application method more efficiently.The final result indicated that the prediction accuracy has reached“Mean Absolute Error(MAE)=0.0057,Root Mean Square Error(RMSE)=0.0093 andR2 score=0.9973”.Applying this method to themotor control would be much easier for the power factor adjustment in the future because it allows the motor to operate under the optimal power status to reduce energy consumption while enhancing working efficiency.展开更多
In the live broadcast process,eye movement characteristics can reflect people’s attention to the product.However,the existing interest degree predictive model research does not consider the eye movement characteristi...In the live broadcast process,eye movement characteristics can reflect people’s attention to the product.However,the existing interest degree predictive model research does not consider the eye movement characteristics.In order to obtain the users’interest in the product more effectively,we will consider the key eye movement indicators.We first collect eye movement characteristics based on the self-developed data processing algorithm fast discriminative model prediction for tracking(FDIMP),and then we add data dimensions to the original data set through information filling.In addition,we apply the deep factorization machine(DeepFM)architecture to simultaneously learn the combination of low-level and high-level features.In order to effectively learn important features and emphasize relatively important features,the multi-head attention mechanism is applied in the interest model.The experimental results on the public data set Criteo show that,compared with the original DeepFM algorithm,the area under curve(AUC)value was improved by up to 9.32%.展开更多
In this article, research was conducted to improve Linter machines that remove short fibers remaining in ginned cotton seeds at cotton ginneries. The study examined the effect of changing the dimensions of the brush d...In this article, research was conducted to improve Linter machines that remove short fibers remaining in ginned cotton seeds at cotton ginneries. The study examined the effect of changing the dimensions of the brush drum, guide and mesh surface in the cleaning device proposed for the linting machine on the movement of the peg and the cleaning efficiency, and the highest level of efficiency in separating impurities from the peg was determined. During the study, the main factors influencing the effective operation of the improved linting machine were identified, the limits of their values were determined, and studies were carried out using the mathematical modeling method. As a result, at the values of the given coefficients, efficient operation of the improved linting machine was observed, that is, the lint cleaning efficiency reached 55.1%.展开更多
The proliferation of internet communication channels has increased telecom fraud,causing billions of euros in losses for customers and the industry each year.Fraudsters constantly find new ways to engage in illegal ac...The proliferation of internet communication channels has increased telecom fraud,causing billions of euros in losses for customers and the industry each year.Fraudsters constantly find new ways to engage in illegal activity on the network.To reduce these losses,a new fraud detection approach is required.Telecom fraud detection involves identifying a small number of fraudulent calls from a vast amount of call traffic.Developing an effective strategy to combat fraud has become challenging.Although much effort has been made to detect fraud,most existing methods are designed for batch processing,not real-time detection.To solve this problem,we propose an online fraud detection model using a Neural Factorization Autoencoder(NFA),which analyzes customer calling patterns to detect fraudulent calls.The model employs Neural Factorization Machines(NFM)and an Autoencoder(AE)to model calling patterns and a memory module to adapt to changing customer behaviour.We evaluate our approach on a large dataset of real-world call detail records and compare it with several state-of-the-art methods.Our results show that our approach outperforms the baselines,with an AUC of 91.06%,a TPR of 91.89%,an FPR of 14.76%,and an F1-score of 95.45%.These results demonstrate the effectiveness of our approach in detecting fraud in real-time and suggest that it can be a valuable tool for preventing fraud in telecommunications networks.展开更多
基金Supported by Science and Technology Support Program of Qiandongnan Prefecture,No.Qiandongnan Sci-Tech Support[2021]12Guizhou Province High-Level Innovative Talent Training Program,No.Qiannan Thousand Talents[2022]201701.
文摘BACKGROUND Intensive care unit-acquired weakness(ICU-AW)is a common complication that significantly impacts the patient's recovery process,even leading to adverse outcomes.Currently,there is a lack of effective preventive measures.AIM To identify significant risk factors for ICU-AW through iterative machine learning techniques and offer recommendations for its prevention and treatment.METHODS Patients were categorized into ICU-AW and non-ICU-AW groups on the 14th day post-ICU admission.Relevant data from the initial 14 d of ICU stay,such as age,comorbidities,sedative dosage,vasopressor dosage,duration of mechanical ventilation,length of ICU stay,and rehabilitation therapy,were gathered.The relationships between these variables and ICU-AW were examined.Utilizing iterative machine learning techniques,a multilayer perceptron neural network model was developed,and its predictive performance for ICU-AW was assessed using the receiver operating characteristic curve.RESULTS Within the ICU-AW group,age,duration of mechanical ventilation,lorazepam dosage,adrenaline dosage,and length of ICU stay were significantly higher than in the non-ICU-AW group.Additionally,sepsis,multiple organ dysfunction syndrome,hypoalbuminemia,acute heart failure,respiratory failure,acute kidney injury,anemia,stress-related gastrointestinal bleeding,shock,hypertension,coronary artery disease,malignant tumors,and rehabilitation therapy ratios were significantly higher in the ICU-AW group,demonstrating statistical significance.The most influential factors contributing to ICU-AW were identified as the length of ICU stay(100.0%)and the duration of mechanical ventilation(54.9%).The neural network model predicted ICU-AW with an area under the curve of 0.941,sensitivity of 92.2%,and specificity of 82.7%.CONCLUSION The main factors influencing ICU-AW are the length of ICU stay and the duration of mechanical ventilation.A primary preventive strategy,when feasible,involves minimizing both ICU stay and mechanical ventilation duration.
基金supported by the National Key Research and Development Program of China under the sub-theme“Research on the Path of Enhancing the Sustainable Development Capacity of Cities and Towns under the Carbon Neutral Goal”[Grant No.2022YFC3802902-04].
文摘In response to the United Nations Sustainable Development Goals and China’s“Dual Carbon”Goals(DCGs means the goals of“Carbon Peak and carbon neutrality”),this paper from the perspective of the construction of China’s Innovation Demonstration Zones for Sustainable Development Agenda(IDZSDAs),combines carbon emission-related metrics to construct a comprehensive assessment system for Urban Sustainable Development Capacity(USDC).After obtaining USDC assessment results through the assessment system,an approach combining Least Absolute Shrinkage and Selection Operator(LASSO)regression and Random Forest(RF)based on machine learning is proposed for identifying influencing factors and characterizing key issues.Combining Coupling Coordination Degree(CCD)analysis,the study further summarizes the systemic patterns and future directions of urban sustainable development.A case study on the IDZSDAs from 2015 to 2022 reveals that:(1)the combined identification method based on machine learning and CCD models effectively quantifies influencing factors and key issues in the urban sustainable development process;(2)the correspondence between influencing factors and key subsystems identified by the LASSO-RF combination model is generally consistent with the development situations in various cities;and(3)the machine learning-based combined recognition method is scalable and dynamic.It enables decision-makers to accurately identify influencing factors and characterize key issues based on actual urban development needs.
基金This work is supported by the National Natural Science Foundation of China(Nos.72071150,71871174).
文摘Cardiovascular disease(CVD)has gradually become one of the main causes of harm to the life and health of residents.Exploring the influencing factors and risk assessment methods of CVD has become a general trend.In this paper,a machine learning-based decision-making mechanism for risk assessment of CVD is designed.In this mechanism,the logistics regression analysismethod and factor analysismodel are used to select age,obesity degree,blood pressure,blood fat,blood sugar,smoking status,drinking status,and exercise status as the main pathogenic factors of CVD,and an index systemof risk assessment for CVD is established.Then,a two-stage model combining K-means cluster analysis and random forest(RF)is proposed to evaluate and predict the risk of CVD,and the predicted results are compared with the methods of Bayesian discrimination,K-means cluster analysis and RF.The results show that thepredictioneffect of theproposedtwo-stagemodel is better than that of the comparedmethods.Moreover,several suggestions for the government,the medical industry and the public are provided based on the research results.
基金funded by the Natural Science Foundation of China(Grant Nos.41807285,41972280 and 52179103).
文摘To perform landslide susceptibility prediction(LSP),it is important to select appropriate mapping unit and landslide-related conditioning factors.The efficient and automatic multi-scale segmentation(MSS)method proposed by the authors promotes the application of slope units.However,LSP modeling based on these slope units has not been performed.Moreover,the heterogeneity of conditioning factors in slope units is neglected,leading to incomplete input variables of LSP modeling.In this study,the slope units extracted by the MSS method are used to construct LSP modeling,and the heterogeneity of conditioning factors is represented by the internal variations of conditioning factors within slope unit using the descriptive statistics features of mean,standard deviation and range.Thus,slope units-based machine learning models considering internal variations of conditioning factors(variant slope-machine learning)are proposed.The Chongyi County is selected as the case study and is divided into 53,055 slope units.Fifteen original slope unit-based conditioning factors are expanded to 38 slope unit-based conditioning factors through considering their internal variations.Random forest(RF)and multi-layer perceptron(MLP)machine learning models are used to construct variant Slope-RF and Slope-MLP models.Meanwhile,the Slope-RF and Slope-MLP models without considering the internal variations of conditioning factors,and conventional grid units-based machine learning(Grid-RF and MLP)models are built for comparisons through the LSP performance assessments.Results show that the variant Slopemachine learning models have higher LSP performances than Slope-machine learning models;LSP results of variant Slope-machine learning models have stronger directivity and practical application than Grid-machine learning models.It is concluded that slope units extracted by MSS method can be appropriate for LSP modeling,and the heterogeneity of conditioning factors within slope units can more comprehensively reflect the relationships between conditioning factors and landslides.The research results have important reference significance for land use and landslide prevention.
文摘Multi-phase machines are so attractive for electrical machine designers because of their valuable advantages such as high reliability and fault tolerant ability.Meanwhile,fractional slot concentrated windings(FSCW)are well known because of short end winding length,simple structure,field weakening sufficiency,fault tolerant capability and higher slot fill factor.The five-phase machines equipped with FSCW,are very good candidates for the purpose of designing motors for high reliable applications,like electric cars,major transporting buses,high speed trains and massive trucks.But,in comparison to the general distributed windings,the FSCWs contain high magnetomotive force(MMF)space harmonic contents,which cause unwanted effects on the machine ability,such as localized iron saturation and core losses.This manuscript introduces several new five-phase fractional slot winding layouts,by the means of slot shifting concept in order to design the new types of synchronous reluctance motors(SynRels).In order to examine the proposed winding’s performances,three sample machines are designed as case studies,and analytical study and finite element analysis(FEA)is used for validation.
文摘The Indian Himalayan region is frequently experiencing climate change-induced landslides.Thus,landslide susceptibility assessment assumes greater significance for lessening the impact of a landslide hazard.This paper makes an attempt to assess landslide susceptibility in Shimla district of the northwest Indian Himalayan region.It examined the effectiveness of random forest(RF),multilayer perceptron(MLP),sequential minimal optimization regression(SMOreg)and bagging ensemble(B-RF,BSMOreg,B-MLP)models.A landslide inventory map comprising 1052 locations of past landslide occurrences was classified into training(70%)and testing(30%)datasets.The site-specific influencing factors were selected by employing a multicollinearity test.The relationship between past landslide occurrences and influencing factors was established using the frequency ratio method.The effectiveness of machine learning models was verified through performance assessors.The landslide susceptibility maps were validated by the area under the receiver operating characteristic curves(ROC-AUC),accuracy,precision,recall and F1-score.The key performance metrics and map validation demonstrated that the BRF model(correlation coefficient:0.988,mean absolute error:0.010,root mean square error:0.058,relative absolute error:2.964,ROC-AUC:0.947,accuracy:0.778,precision:0.819,recall:0.917 and F-1 score:0.865)outperformed the single classifiers and other bagging ensemble models for landslide susceptibility.The results show that the largest area was found under the very high susceptibility zone(33.87%),followed by the low(27.30%),high(20.68%)and moderate(18.16%)susceptibility zones.The factors,namely average annual rainfall,slope,lithology,soil texture and earthquake magnitude have been identified as the influencing factors for very high landslide susceptibility.Soil texture,lineament density and elevation have been attributed to high and moderate susceptibility.Thus,the study calls for devising suitable landslide mitigation measures in the study area.Structural measures,an immediate response system,community participation and coordination among stakeholders may help lessen the detrimental impact of landslides.The findings from this study could aid decision-makers in mitigating future catastrophes and devising suitable strategies in other geographical regions with similar geological characteristics.
文摘Intensive care unit-acquired weakness(ICU-AW)significantly hampers patient recovery and increases morbidity.With the absence of established preventive strategies,this study utilizes advanced machine learning methodologies to unearth key predictors of ICU-AW.Employing a sophisticated multilayer perceptron neural network,the research methodically assesses the predictive power for ICU-AW,pinpointing the length of ICU stay and duration of mechanical ventilation as pivotal risk factors.The findings advocate for minimizing these elements as a preventive approach,offering a novel perspective on combating ICU-AW.This research illuminates critical risk factors and lays the groundwork for future explorations into effective prevention and intervention strategies.
基金supported by the National Natural Science Foundation of China(No.51279033).
文摘Underwater direction of arrival(DOA)estimation has always been a very challenging theoretical and practical problem.Due to the serious non-stationary,non-linear,and non-Gaussian characteristics,machine learning based DOA estimation methods trained on simulated Gaussian noised array data cannot be directly applied to actual underwater DOA estimation tasks.In order to deal with this problem,environmental data with no target echoes can be employed to analyze the non-Gaussian components.Then,the obtained information about non-Gaussian components can be used to whiten the array data.Based on these considerations,a novel practical sonar array whitening method was proposed.Specifically,based on a weak assumption that the non-Gaussian components in adjacent patches with and without target echoes are almost the same,canonical cor-relation analysis(CCA)and non-negative matrix factorization(NMF)techniques are employed for whitening the array data.With the whitened array data,machine learning based DOA estimation models trained on simulated Gaussian noised datasets can be used to perform underwater DOA estimation tasks.Experimental results illustrated that,using actual underwater datasets for testing with known machine learning based DOA estimation models,accurate and robust DOA estimation performance can be achieved by using the proposed whitening method in different underwater con-ditions.
基金This work was funded by the Researchers Supporting Project No.(RSP-2021/102)King Saud University,Riyadh,Saudi ArabiaThis work was supported by the Research Project on Teaching Reform of General Colleges and Universities in Hunan Province(Grant No.HNJG-2020-0261),China.
文摘Software-Defined Network(SDN)decouples the control plane of network devices from the data plane.While alleviating the problems presented in traditional network architectures,it also brings potential security risks,particularly network Denial-of-Service(DoS)attacks.While many research efforts have been devoted to identifying new features for DoS attack detection,detection methods are less accurate in detecting DoS attacks against client hosts due to the high stealth of such attacks.To solve this problem,a new method of DoS attack detection based on Deep Factorization Machine(DeepFM)is proposed in SDN.Firstly,we select the Growth Rate of Max Matched Packets(GRMMP)in SDN as detection feature.Then,the DeepFM algorithm is used to extract features from flow rules and classify them into dense and discrete features to detect DoS attacks.After training,the model can be used to infer whether SDN is under DoS attacks,and a DeepFM-based detection method for DoS attacks against client host is implemented.Simulation results show that our method can effectively detect DoS attacks in SDN.Compared with the K-Nearest Neighbor(K-NN),Artificial Neural Network(ANN)models,Support Vector Machine(SVM)and Random Forest models,our proposed method outperforms in accuracy,precision and F1 values.
基金This work was supported by the National Natural Science Foundation of China(Grant No.42050104)the Science Foundation of SINOPEC Group(Grant No.P20030).
文摘A comprehensive and precise analysis of shale gas production performance is crucial for evaluating resource potential,designing a field development plan,and making investment decisions.However,quantitative analysis can be challenging because production performance is dominated by the complex interaction among a series of geological and engineering factors.In fact,each factor can be viewed as a player who makes cooperative contributions to the production payoff within the constraints of physical laws and models.Inspired by the idea,we propose a hybrid data-driven analysis framework in this study,where the contributions of dominant factors are quantitatively evaluated,the productions are precisely forecasted,and the development optimization suggestions are comprehensively generated.More specifically,game theory and machine learning models are coupled to determine the dominating geological and engineering factors.The Shapley value with definite physical meaning is employed to quantitatively measure the effects of individual factors.A multi-model-fused stacked model is trained for production forecast,which provides the basis for derivative-free optimization algorithms to optimize the development plan.The complete workflow is validated with actual production data collected from the Fuling shale gas field,Sichuan Basin,China.The validation results show that the proposed procedure can draw rigorous conclusions with quantified evidence and thereby provide specific and reliable suggestions for development plan optimization.Comparing with traditional and experience-based approaches,the hybrid data-driven procedure is advanced in terms of both efficiency and accuracy.
基金conducted under the illu MINEation project, funded by the European Union’s Horizon 2020 research and innovation program under grant agreement (No. 869379)supported by the China Scholarship Council (No. 202006370006)
文摘A procedure to recognize individual discontinuities in rock mass from measurement while drilling(MWD)technology is developed,using the binary pattern of structural rock characteristics obtained from in-hole images for calibration.Data from two underground operations with different drilling technology and different rock mass characteristics are considered,which generalizes the application of the methodology to different sites and ensures the full operational integration of MWD data analysis.Two approaches are followed for site-specific structural model building:a discontinuity index(DI)built from variations in MWD parameters,and a machine learning(ML)classifier as function of the drilling parameters and their variability.The prediction ability of the models is quantitatively assessed as the rate of recognition of discontinuities observed in borehole logs.Differences between the parameters involved in the models for each site,and differences in their weights,highlight the site-dependence of the resulting models.The ML approach offers better performance than the classical DI,with recognition rates in the range 89%to 96%.However,the simpler DI still yields fairly accurate results,with recognition rates 70%to 90%.These results validate the adaptive MWD-based methodology as an engineering solution to predict rock structural condition in underground mining operations.
基金This work was financially supported by National Natural Science Foundation of China(41972262)Hebei Natural Science Foundation for Excellent Young Scholars(D2020504032)+1 种基金Central Plains Science and technology innovation leader Project(214200510030)Key research and development Project of Henan province(221111321500).
文摘Landslide is a serious natural disaster next only to earthquake and flood,which will cause a great threat to people’s lives and property safety.The traditional research of landslide disaster based on experience-driven or statistical model and its assessment results are subjective,difficult to quantify,and no pertinence.As a new research method for landslide susceptibility assessment,machine learning can greatly improve the landslide susceptibility model’s accuracy by constructing statistical models.Taking Western Henan for example,the study selected 16 landslide influencing factors such as topography,geological environment,hydrological conditions,and human activities,and 11 landslide factors with the most significant influence on the landslide were selected by the recursive feature elimination(RFE)method.Five machine learning methods[Support Vector Machines(SVM),Logistic Regression(LR),Random Forest(RF),Extreme Gradient Boosting(XGBoost),and Linear Discriminant Analysis(LDA)]were used to construct the spatial distribution model of landslide susceptibility.The models were evaluated by the receiver operating characteristic curve and statistical index.After analysis and comparison,the XGBoost model(AUC 0.8759)performed the best and was suitable for dealing with regression problems.The model had a high adaptability to landslide data.According to the landslide susceptibility map of the five models,the overall distribution can be observed.The extremely high and high susceptibility areas are distributed in the Funiu Mountain range in the southwest,the Xiaoshan Mountain range in the west,and the Yellow River Basin in the north.These areas have large terrain fluctuations,complicated geological structural environments and frequent human engineering activities.The extremely high and highly prone areas were 12043.3 km^(2)and 3087.45 km^(2),accounting for 47.61%and 12.20%of the total area of the study area,respectively.Our study reflects the distribution of landslide susceptibility in western Henan Province,which provides a scientific basis for regional disaster warning,prediction,and resource protection.The study has important practical significance for subsequent landslide disaster management.
基金Supported by Winfast Charity Foundation for Financial Support,No.YL20220525.
文摘BACKGROUND Geriatric hip fractures are one of the most common fractures in elderly individuals,and prolonged hospital stays increase the risk of death and complications.Machine learning(ML)has become prevalent in clinical data processing and predictive models.This study aims to develop ML models for predicting extended length of stay(eLOS)among geriatric patients with hip fractures and to identify the associated risk factors.AIM To develop ML models for predicting the eLOS among geriatric patients with hip fractures,identify associated risk factors,and compare the performance of each model.METHODS A retrospective study was conducted at a single orthopaedic trauma centre,enrolling all patients who underwent hip fracture surgery between January 2018 and December 2022.The study collected various patient characteristics,encompassing demographic data,general health status,injury-related data,laboratory examinations,surgery-related data,and length of stay.Features that exhibited significant differences in univariate analysis were integrated into the ML model establishment and subsequently cross-verified.The study compared the performance of the ML models and determined the risk factors for eLOS.RESULTS The study included 763 patients,with 380 experiencing eLOS.Among the models,the decision tree,random forest,and extreme Gradient Boosting models demonstrated the most robust performance.Notably,the artificial neural network model also exhibited impressive results.After cross-validation,the support vector machine and logistic regression models demonstrated superior performance.Predictors for eLOS included delayed surgery,D-dimer level,American Society of Anaesthesiologists(ASA)classification,type of surgery,and sex.CONCLUSION ML proved to be highly accurate in predicting the eLOS for geriatric patients with hip fractures.The identified key risk factors were delayed surgery,D-dimer level,ASA classification,type of surgery,and sex.This valuable information can aid clinicians in allocating resources more efficiently to meet patient demand effectively.
基金The Shanghai Committee of Science and Technology of China under contract No. 10510502800the Graduate Student Education Innovation Program Foundation of Shanghai Municipal Education Commission of Chinathe National Key Science Foundation Research "973" Project of the Ministry of Science and Technology of China under contract No. 2012CB316200
文摘Harmonic analysis, the traditional tidal forecasting method, cannot take into account the impact of noncyclical factors, and is also based on the BP neural network tidal prediction model which is easily limited by the amount of data. According to the movement of celestial bodies, and considering the insufficient tidal characteristics of historical data which are impacted by the nonperiodic weather, a tidal prediction method is designed based on support vector machine (SVM) to carry out the simulation experiment by using tidal data from Xiamen Tide Gauge, Luchaogang Tide Gauge and Weifang Tide Gauge individually. And the results show that the model satisfactorily carries out the tide prediction which is influenced by noncyclical factors. At the same time, it also proves that the proposed prediction method, which when compared with harmonic analysis method and the BP neural network method, has faster modeling speed, higher prediction precision and stronger generalization ability.
基金supported by the College of Agriculture,Shiraz University(Grant No.97GRC1M271143)funding from the UK Biotechnology and Biological Sciences Research Council(BBSRC)funded by BBSRC grant award BBS/E/C/000I0330–Soil to Nutrition project 3–Sustainable intensification:optimisation at multiple scales。
文摘This investigation assessed the efficacy of 10 widely used machine learning algorithms(MLA)comprising the least absolute shrinkage and selection operator(LASSO),generalized linear model(GLM),stepwise generalized linear model(SGLM),elastic net(ENET),partial least square(PLS),ridge regression,support vector machine(SVM),classification and regression trees(CART),bagged CART,and random forest(RF)for gully erosion susceptibility mapping(GESM)in Iran.The location of 462 previously existing gully erosion sites were mapped through widespread field investigations,of which 70%(323)and 30%(139)of observations were arbitrarily divided for algorithm calibration and validation.Twelve controlling factors for gully erosion,namely,soil texture,annual mean rainfall,digital elevation model(DEM),drainage density,slope,lithology,topographic wetness index(TWI),distance from rivers,aspect,distance from roads,plan curvature,and profile curvature were ranked in terms of their importance using each MLA.The MLA were compared using a training dataset for gully erosion and statistical measures such as RMSE(root mean square error),MAE(mean absolute error),and R-squared.Based on the comparisons among MLA,the RF algorithm exhibited the minimum RMSE and MAE and the maximum value of R-squared,and was therefore selected as the best model.The variable importance evaluation using the RF model revealed that distance from rivers had the highest significance in influencing the occurrence of gully erosion whereas plan curvature had the least importance.According to the GESM generated using RF,most of the study area is predicted to have a low(53.72%)or moderate(29.65%)susceptibility to gully erosion,whereas only a small area is identified to have a high(12.56%)or very high(4.07%)susceptibility.The outcome generated by RF model is validated using the ROC(Receiver Operating Characteristics)curve approach,which returned an area under the curve(AUC)of 0.985,proving the excellent forecasting ability of the model.The GESM prepared using the RF algorithm can aid decision-makers in targeting remedial actions for minimizing the damage caused by gully erosion.
基金the Declaration of Helsinki and was approved by Clinical Research Ethics Committee of China-Japan Friendship Hospital(2018-110-K79-1).
文摘Background: Nonalcoholic fatty liver disease(NAFLD) had become the most prevalent liver disease worldwide. Early diagnosis could effectively reduce NAFLD-related morbidity and mortality. This study aimed to combine the risk factors to develop and validate a novel model for predicting NAFLD. Methods: We enrolled 578 participants completing abdominal ultrasound into the training set. The least absolute shrinkage and selection operator(LASSO) regression combined with random forest(RF) was conducted to screen significant predictors for NAFLD risk. Five machine learning models including logistic regression(LR), RF, extreme gradient boosting(XGBoost), gradient boosting machine(GBM), and support vector machine(SVM) were developed. To further improve model performance, we conducted hyperparameter tuning with train function in Python package ‘sklearn’. We included 131 participants completing magnetic resonance imaging into the testing set for external validation. Results: There were 329 participants with NAFLD and 249 without in the training set, while 96 with NAFLD and 35 without were in the testing set. Visceral adiposity index, abdominal circumference, body mass index, alanine aminotransferase(ALT), ALT/AST(aspartate aminotransferase), age, high-density lipoprotein cholesterol(HDL-C) and elevated triglyceride(TG) were important predictors for NAFLD risk. The area under curve(AUC) of LR, RF, XGBoost, GBM, SVM were 0.915 [95% confidence interval(CI): 0.886–0.937], 0.907(95% CI: 0.856–0.938), 0.928(95% CI: 0.873–0.944), 0.924(95% CI: 0.875–0.939), and 0.900(95% CI: 0.883–0.913), respectively. XGBoost model presented the best predictive performance, and its AUC was enhanced to 0.938(95% CI: 0.870–0.950) with further parameter tuning. Conclusions: This study developed and validated five novel machine learning models for NAFLD prediction, among which XGBoost presented the best performance and was considered a reliable reference for early identification of high-risk patients with NAFLD in clinical practice.
基金This work was supported by the Ministry of Science and Technology,Taiwan,under Grants MOST 110-2221-E-194-037,NSTC 111-2823-8-194-002,111-2221-E-194-052 and 11-2218-E-194-007。
文摘The power factor is the ratio between the active and apparent power,and it is available to determine the operational capability of the intended circuit or the parts.The excitation current of the synchronous motor is an essential parameter required for adjusting the power factor because it determines whether the motor is under the optimal operating status.Although the excitation current should predict with the experimental devices,such a method is unsuitable for online real-time prediction.The artificial intelligence algorithm can compensate for the defect of conventional measurement methods requiring the measuring devices and the model optimization is compared during the research process.In this article,the load current,power factor,and power factor errors available in the existing dataset are used as the input parameters for training the proposed artificial intelligence algorithms to select the optimal algorithm according to the training result,for this algorithm to have higher accuracy.The SMOGN(Synthetic Minority Over-Sampling Technique for Regression with Gaussian Noise)is selected for the research by which the data and the MFO(Moth-flame optimization algorithm)are created for the model to adjust and optimize the parameters automatically.In addition to enhancing the prediction accuracy for the excitation current,the automatic parameter adjusting method also allows the researchers not specializing in the professional algorithm to apply such application method more efficiently.The final result indicated that the prediction accuracy has reached“Mean Absolute Error(MAE)=0.0057,Root Mean Square Error(RMSE)=0.0093 andR2 score=0.9973”.Applying this method to themotor control would be much easier for the power factor adjustment in the future because it allows the motor to operate under the optimal power status to reduce energy consumption while enhancing working efficiency.
文摘In the live broadcast process,eye movement characteristics can reflect people’s attention to the product.However,the existing interest degree predictive model research does not consider the eye movement characteristics.In order to obtain the users’interest in the product more effectively,we will consider the key eye movement indicators.We first collect eye movement characteristics based on the self-developed data processing algorithm fast discriminative model prediction for tracking(FDIMP),and then we add data dimensions to the original data set through information filling.In addition,we apply the deep factorization machine(DeepFM)architecture to simultaneously learn the combination of low-level and high-level features.In order to effectively learn important features and emphasize relatively important features,the multi-head attention mechanism is applied in the interest model.The experimental results on the public data set Criteo show that,compared with the original DeepFM algorithm,the area under curve(AUC)value was improved by up to 9.32%.
文摘In this article, research was conducted to improve Linter machines that remove short fibers remaining in ginned cotton seeds at cotton ginneries. The study examined the effect of changing the dimensions of the brush drum, guide and mesh surface in the cleaning device proposed for the linting machine on the movement of the peg and the cleaning efficiency, and the highest level of efficiency in separating impurities from the peg was determined. During the study, the main factors influencing the effective operation of the improved linting machine were identified, the limits of their values were determined, and studies were carried out using the mathematical modeling method. As a result, at the values of the given coefficients, efficient operation of the improved linting machine was observed, that is, the lint cleaning efficiency reached 55.1%.
基金This research work has been conducted in cooperation with members of DETSI project supported by BPI France and Pays de Loire and Auvergne Rhone Alpes.
文摘The proliferation of internet communication channels has increased telecom fraud,causing billions of euros in losses for customers and the industry each year.Fraudsters constantly find new ways to engage in illegal activity on the network.To reduce these losses,a new fraud detection approach is required.Telecom fraud detection involves identifying a small number of fraudulent calls from a vast amount of call traffic.Developing an effective strategy to combat fraud has become challenging.Although much effort has been made to detect fraud,most existing methods are designed for batch processing,not real-time detection.To solve this problem,we propose an online fraud detection model using a Neural Factorization Autoencoder(NFA),which analyzes customer calling patterns to detect fraudulent calls.The model employs Neural Factorization Machines(NFM)and an Autoencoder(AE)to model calling patterns and a memory module to adapt to changing customer behaviour.We evaluate our approach on a large dataset of real-world call detail records and compare it with several state-of-the-art methods.Our results show that our approach outperforms the baselines,with an AUC of 91.06%,a TPR of 91.89%,an FPR of 14.76%,and an F1-score of 95.45%.These results demonstrate the effectiveness of our approach in detecting fraud in real-time and suggest that it can be a valuable tool for preventing fraud in telecommunications networks.