期刊文献+
共找到7,158篇文章
< 1 2 250 >
每页显示 20 50 100
基于Random Forest和UHPLC-QTOF-MS^(E)对不同来源龟甲基原的鉴定
1
作者 王献瑞 张佳婷 +5 位作者 张宇 李明华 郭晓晗 荆文光 程显隆 魏锋 《中国药事》 CAS 2024年第9期1008-1019,共12页
目的:基于超高效液相色谱串联四极杆飞行时间质谱(UHPLC-QTOF-MS^(E))分析并经数字量化处理,结合随机森林(Random Forest,RF)算法构建数据辨识模型,以实现中华草龟、巴西龟、台湾龟、鳄鱼龟、鳖甲基原的数字化鉴定。方法:经样品预处理后... 目的:基于超高效液相色谱串联四极杆飞行时间质谱(UHPLC-QTOF-MS^(E))分析并经数字量化处理,结合随机森林(Random Forest,RF)算法构建数据辨识模型,以实现中华草龟、巴西龟、台湾龟、鳄鱼龟、鳖甲基原的数字化鉴定。方法:经样品预处理后,对不同来源、不同批次的龟甲进行UPLC-QTOF-MS^(E)分析,并以混合样品为基准进行峰位校正、提取并经量化处理,获取反映多肽离子信息的精确质量数-保留时间数据对(Exact Mass Retention Time,EMRT)。然后基于信息增益率的特征筛选获取重要多肽离子信息,结合随机森林(RF)算法进行数据建模,同时基于内部交叉验证中的准确率(Acc)、精确率(P)、曲线下面积(AUC)等参数进行模型评价。最后基于最优模型进行龟甲基原的鉴定验证分析。结果:基于信息增益率的特征筛选,得到71个特征多肽信息,建立的RF模型具有优秀的辨识效果,准确率、精确率以及AUC均大于0.950且外部鉴定验证的正确率为100.0%。结论:基于UHPLC-QTOF-MS^(E)分析,并结合RF算法能够高效准确地实现不同来源龟甲基原的数字化鉴定,可为龟甲的质量控制及基原考证提供参考和帮助。 展开更多
关键词 龟甲 基原鉴定 机器学习 随机森林 超高效液相色谱串联四极杆飞行时间质谱
下载PDF
一种基于KMeans与Random Forest的异常温升捕捉方法
2
作者 汪海良 《现代建筑电气》 2024年第6期21-26,49,共7页
针对线路老化、线路过载的火灾频发问题,分析了线路老化、线路过载与异常温升之间的关联性,以电流值、线缆温度作为输入,利用KMeans聚类算法划分可能存在异常温升的区间,通过Random Forest算法识别线路过载问题,可以提前通知用户整改线... 针对线路老化、线路过载的火灾频发问题,分析了线路老化、线路过载与异常温升之间的关联性,以电流值、线缆温度作为输入,利用KMeans聚类算法划分可能存在异常温升的区间,通过Random Forest算法识别线路过载问题,可以提前通知用户整改线路,预防火灾的发生。 展开更多
关键词 线路过载 异常温升 random forest KMeans
下载PDF
A real-time intelligent lithology identification method based on a dynamic felling strategy weighted random forest algorithm
3
作者 Tie Yan Rui Xu +2 位作者 Shi-Hui Sun Zhao-Kai Hou Jin-Yu Feng 《Petroleum Science》 SCIE EI CAS CSCD 2024年第2期1135-1148,共14页
Real-time intelligent lithology identification while drilling is vital to realizing downhole closed-loop drilling. The complex and changeable geological environment in the drilling makes lithology identification face ... Real-time intelligent lithology identification while drilling is vital to realizing downhole closed-loop drilling. The complex and changeable geological environment in the drilling makes lithology identification face many challenges. This paper studies the problems of difficult feature information extraction,low precision of thin-layer identification and limited applicability of the model in intelligent lithologic identification. The author tries to improve the comprehensive performance of the lithology identification model from three aspects: data feature extraction, class balance, and model design. A new real-time intelligent lithology identification model of dynamic felling strategy weighted random forest algorithm(DFW-RF) is proposed. According to the feature selection results, gamma ray and 2 MHz phase resistivity are the logging while drilling(LWD) parameters that significantly influence lithology identification. The comprehensive performance of the DFW-RF lithology identification model has been verified in the application of 3 wells in different areas. By comparing the prediction results of five typical lithology identification algorithms, the DFW-RF model has a higher lithology identification accuracy rate and F1 score. This model improves the identification accuracy of thin-layer lithology and is effective and feasible in different geological environments. The DFW-RF model plays a truly efficient role in the realtime intelligent identification of lithologic information in closed-loop drilling and has greater applicability, which is worthy of being widely used in logging interpretation. 展开更多
关键词 Intelligent drilling Closed-loop drilling Lithology identification random forest algorithm Feature extraction
下载PDF
Detecting XSS with Random Forest and Multi-Channel Feature Extraction
4
作者 Qiurong Qin Yueqin Li +3 位作者 Yajie Mi Jinhui Shen Kexin Wu Zhenzhao Wang 《Computers, Materials & Continua》 SCIE EI 2024年第7期843-874,共32页
In the era of the Internet,widely used web applications have become the target of hacker attacks because they contain a large amount of personal information.Among these vulnerabilities,stealing private data through cr... In the era of the Internet,widely used web applications have become the target of hacker attacks because they contain a large amount of personal information.Among these vulnerabilities,stealing private data through crosssite scripting(XSS)attacks is one of the most commonly used attacks by hackers.Currently,deep learning-based XSS attack detection methods have good application prospects;however,they suffer from problems such as being prone to overfitting,a high false alarm rate,and low accuracy.To address these issues,we propose a multi-stage feature extraction and fusion model for XSS detection based on Random Forest feature enhancement.The model utilizes RandomForests to capture the intrinsic structure and patterns of the data by extracting leaf node indices as features,which are subsequentlymergedwith the original data features to forma feature setwith richer information content.Further feature extraction is conducted through three parallel channels.Channel I utilizes parallel onedimensional convolutional layers(1Dconvolutional layers)with different convolutional kernel sizes to extract local features at different scales and performmulti-scale feature fusion;Channel II employsmaximum one-dimensional pooling layers(max 1D pooling layers)of various sizes to extract key features from the data;and Channel III extracts global information bi-directionally using a Bi-Directional Long-Short TermMemory Network(Bi-LSTM)and incorporates a multi-head attention mechanism to enhance global features.Finally,effective classification and prediction of XSS are performed by fusing the features of the three channels.To test the effectiveness of the model,we conduct experiments on six datasets.We achieve an accuracy of 100%on the UNSW-NB15 dataset and 99.99%on the CICIDS2017 dataset,which is higher than that of the existing models. 展开更多
关键词 random forest feature enhancement three-channel parallelism XSS detection
下载PDF
Random Forest-Based Fatigue Reliability-Based Design Optimization for Aeroengine Structures
5
作者 Xue-Qin Li Lu-Kai Song 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第7期665-684,共20页
Fatigue reliability-based design optimization of aeroengine structures involves multiple repeated calculations of reliability degree and large-scale calls of implicit high-nonlinearity limit state function,leading to ... Fatigue reliability-based design optimization of aeroengine structures involves multiple repeated calculations of reliability degree and large-scale calls of implicit high-nonlinearity limit state function,leading to the traditional direct Monte Claro and surrogate methods prone to unacceptable computing efficiency and accuracy.In this case,by fusing the random subspace strategy and weight allocation technology into bagging ensemble theory,a random forest(RF)model is presented to enhance the computing efficiency of reliability degree;moreover,by embedding the RF model into multilevel optimization model,an efficient RF-assisted fatigue reliability-based design optimization framework is developed.Regarding the low-cycle fatigue reliability-based design optimization of aeroengine turbine disc as a case,the effectiveness of the presented framework is validated.The reliabilitybased design optimization results exhibit that the proposed framework holds high computing accuracy and computing efficiency.The current efforts shed a light on the theory/method development of reliability-based design optimization of complex engineering structures. 展开更多
关键词 random forest reliability-based design optimization ensemble learning machine learning
下载PDF
Determination of the Pile Drivability Using Random Forest Optimized by Particle Swarm Optimization and Bayesian Optimizer
6
作者 Shengdong Cheng Juncheng Gao Hongning Qi 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第10期871-892,共22页
Driven piles are used in many geological environments as a practical and convenient structural component.Hence,the determination of the drivability of piles is actually of great importance in complex geotechnical appl... Driven piles are used in many geological environments as a practical and convenient structural component.Hence,the determination of the drivability of piles is actually of great importance in complex geotechnical applications.Conventional methods of predicting pile drivability often rely on simplified physicalmodels or empirical formulas,whichmay lack accuracy or applicability in complex geological conditions.Therefore,this study presents a practical machine learning approach,namely a Random Forest(RF)optimized by Bayesian Optimization(BO)and Particle Swarm Optimization(PSO),which not only enhances prediction accuracy but also better adapts to varying geological environments to predict the drivability parameters of piles(i.e.,maximumcompressive stress,maximum tensile stress,and blow per foot).In addition,support vector regression,extreme gradient boosting,k nearest neighbor,and decision tree are also used and applied for comparison purposes.In order to train and test these models,among the 4072 datasets collected with 17model inputs,3258 datasets were randomly selected for training,and the remaining 814 datasets were used for model testing.Lastly,the results of these models were compared and evaluated using two performance indices,i.e.,the root mean square error(RMSE)and the coefficient of determination(R2).The results indicate that the optimized RF model achieved lower RMSE than other prediction models in predicting the three parameters,specifically 0.044,0.438,and 0.146;and higher R^(2) values than other implemented techniques,specifically 0.966,0.884,and 0.977.In addition,the sensitivity and uncertainty of the optimized RF model were analyzed using Sobol sensitivity analysis and Monte Carlo(MC)simulation.It can be concluded that the optimized RF model could be used to predict the performance of the pile,and it may provide a useful reference for solving some problems under similar engineering conditions. 展开更多
关键词 random forest regression model pile drivability Bayesian optimization particle swarm optimization
下载PDF
An Optimized System of Random Forest Model by Global Harmony Search with Generalized Opposition-Based Learning for Forecasting TBM Advance Rate
7
作者 Yingui Qiu Shuai Huang +3 位作者 Danial Jahed Armaghani Biswajeet Pradhan Annan Zhou Jian Zhou 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第3期2873-2897,共25页
As massive underground projects have become popular in dense urban cities,a problem has arisen:which model predicts the best for Tunnel Boring Machine(TBM)performance in these tunneling projects?However,performance le... As massive underground projects have become popular in dense urban cities,a problem has arisen:which model predicts the best for Tunnel Boring Machine(TBM)performance in these tunneling projects?However,performance level of TBMs in complex geological conditions is still a great challenge for practitioners and researchers.On the other hand,a reliable and accurate prediction of TBM performance is essential to planning an applicable tunnel construction schedule.The performance of TBM is very difficult to estimate due to various geotechnical and geological factors and machine specifications.The previously-proposed intelligent techniques in this field are mostly based on a single or base model with a low level of accuracy.Hence,this study aims to introduce a hybrid randomforest(RF)technique optimized by global harmony search with generalized oppositionbased learning(GOGHS)for forecasting TBM advance rate(AR).Optimizing the RF hyper-parameters in terms of,e.g.,tree number and maximum tree depth is the main objective of using the GOGHS-RF model.In the modelling of this study,a comprehensive databasewith themost influential parameters onTBMtogetherwithTBM AR were used as input and output variables,respectively.To examine the capability and power of the GOGHSRF model,three more hybrid models of particle swarm optimization-RF,genetic algorithm-RF and artificial bee colony-RF were also constructed to forecast TBM AR.Evaluation of the developed models was performed by calculating several performance indices,including determination coefficient(R2),root-mean-square-error(RMSE),and mean-absolute-percentage-error(MAPE).The results showed that theGOGHS-RF is a more accurate technique for estimatingTBMAR compared to the other applied models.The newly-developedGOGHS-RFmodel enjoyed R2=0.9937 and 0.9844,respectively,for train and test stages,which are higher than a pre-developed RF.Also,the importance of the input parameters was interpreted through the SHapley Additive exPlanations(SHAP)method,and it was found that thrust force per cutter is the most important variable on TBMAR.The GOGHS-RF model can be used in mechanized tunnel projects for predicting and checking performance. 展开更多
关键词 Tunnel boring machine random forest GOGHS optimization PSO optimization GA optimization ABC optimization SHAP
下载PDF
Winter Wheat Yield Estimation Based on Sparrow Search Algorithm Combined with Random Forest:A Case Study in Henan Province,China
8
作者 SHI Xiaoliang CHEN Jiajun +2 位作者 DING Hao YANG Yuanqi ZHANG Yan 《Chinese Geographical Science》 SCIE CSCD 2024年第2期342-356,共15页
Precise and timely prediction of crop yields is crucial for food security and the development of agricultural policies.However,crop yield is influenced by multiple factors within complex growth environments.Previous r... Precise and timely prediction of crop yields is crucial for food security and the development of agricultural policies.However,crop yield is influenced by multiple factors within complex growth environments.Previous research has paid relatively little attention to the interference of environmental factors and drought on the growth of winter wheat.Therefore,there is an urgent need for more effective methods to explore the inherent relationship between these factors and crop yield,making precise yield prediction increasingly important.This study was based on four type of indicators including meteorological,crop growth status,environmental,and drought index,from October 2003 to June 2019 in Henan Province as the basic data for predicting winter wheat yield.Using the sparrow search al-gorithm combined with random forest(SSA-RF)under different input indicators,accuracy of winter wheat yield estimation was calcu-lated.The estimation accuracy of SSA-RF was compared with partial least squares regression(PLSR),extreme gradient boosting(XG-Boost),and random forest(RF)models.Finally,the determined optimal yield estimation method was used to predict winter wheat yield in three typical years.Following are the findings:1)the SSA-RF demonstrates superior performance in estimating winter wheat yield compared to other algorithms.The best yield estimation method is achieved by four types indicators’composition with SSA-RF)(R^(2)=0.805,RRMSE=9.9%.2)Crops growth status and environmental indicators play significant roles in wheat yield estimation,accounting for 46%and 22%of the yield importance among all indicators,respectively.3)Selecting indicators from October to April of the follow-ing year yielded the highest accuracy in winter wheat yield estimation,with an R^(2)of 0.826 and an RMSE of 9.0%.Yield estimates can be completed two months before the winter wheat harvest in June.4)The predicted performance will be slightly affected by severe drought.Compared with severe drought year(2011)(R^(2)=0.680)and normal year(2017)(R^(2)=0.790),the SSA-RF model has higher prediction accuracy for wet year(2018)(R^(2)=0.820).This study could provide an innovative approach for remote sensing estimation of winter wheat yield.yield. 展开更多
关键词 winter wheat yield estimation sparrow search algorithm combined with random forest(SSA-RF) machine learning multi-source indicator optimal lead time Henan Province China
下载PDF
A HybridManufacturing ProcessMonitoringMethod Using Stacked Gated Recurrent Unit and Random Forest
9
作者 Chao-Lung Yang Atinkut Atinafu Yilma +2 位作者 Bereket Haile Woldegiorgis Hendrik Tampubolon Hendri Sutrisno 《Intelligent Automation & Soft Computing》 2024年第2期233-254,共22页
This study proposed a new real-time manufacturing process monitoring method to monitor and detect process shifts in manufacturing operations.Since real-time production process monitoring is critical in today’s smart ... This study proposed a new real-time manufacturing process monitoring method to monitor and detect process shifts in manufacturing operations.Since real-time production process monitoring is critical in today’s smart manufacturing.The more robust the monitoring model,the more reliable a process is to be under control.In the past,many researchers have developed real-time monitoring methods to detect process shifts early.However,thesemethods have limitations in detecting process shifts as quickly as possible and handling various data volumes and varieties.In this paper,a robust monitoring model combining Gated Recurrent Unit(GRU)and Random Forest(RF)with Real-Time Contrast(RTC)called GRU-RF-RTC was proposed to detect process shifts rapidly.The effectiveness of the proposed GRU-RF-RTC model is first evaluated using multivariate normal and nonnormal distribution datasets.Then,to prove the applicability of the proposed model in a realmanufacturing setting,the model was evaluated using real-world normal and non-normal problems.The results demonstrate that the proposed GRU-RF-RTC outperforms other methods in detecting process shifts quickly with the lowest average out-of-control run length(ARL1)in all synthesis and real-world problems under normal and non-normal cases.The experiment results on real-world problems highlight the significance of the proposed GRU-RF-RTC model in modern manufacturing process monitoring applications.The result reveals that the proposed method improves the shift detection capability by 42.14%in normal and 43.64%in gamma distribution problems. 展开更多
关键词 Smart manufacturing process monitoring quality control gated recurrent unit neural network random forest
下载PDF
基于Random Forest和层次分析法的混凝土连续梁桥耐久性评估
10
作者 王璐瑶 常兴科 张海君 《沈阳大学学报(自然科学版)》 CAS 2024年第3期255-261,共7页
为了准确快速地评估混凝土连续梁桥的耐久性,避免造成结构耐久性评估结果受桥梁技术人员因对规范不熟悉的主观因素影响,基于层次分析法建立适用于混凝土连续梁桥的耐久性评估指标体系,构建随机森林耐久性评估模型。经过参数调优获得随... 为了准确快速地评估混凝土连续梁桥的耐久性,避免造成结构耐久性评估结果受桥梁技术人员因对规范不熟悉的主观因素影响,基于层次分析法建立适用于混凝土连续梁桥的耐久性评估指标体系,构建随机森林耐久性评估模型。经过参数调优获得随机森林模型最优参数组合为105、10、2、2。结果表明:使用随机森林耐久性评估模型的精确率、召回率、F1值均大于87%;主梁裂缝、重载率、下部结构保护层厚度安全系数等对混凝土连续梁桥耐久性的影响依次递减。将评估结果与桥检报告技术状况等级、课题软件结果对比,验证了模型的可靠性。 展开更多
关键词 混凝土连续梁桥 耐久性 层次分析法 随机森林 评估指标体系
下载PDF
Random forest-based prediction of decay modes and half-lives of superheavy nuclei 被引量:4
11
作者 Bo‑Shuai Cai Cen‑Xi Yuan 《Nuclear Science and Techniques》 SCIE EI CAS CSCD 2023年第12期271-280,共10页
Information on the decay process of nuclides in the superheavy region is critical in investigating new elements beyond oganesson and the island of stability.This paper presents the application of a random forest algor... Information on the decay process of nuclides in the superheavy region is critical in investigating new elements beyond oganesson and the island of stability.This paper presents the application of a random forest algorithm to examine the competition among different decay modes in the superheavy region,includingα decay,β^(-)decay,β^(+)decay,electron capture and spontaneous fission.The observed half-lives and dominant decay mode are well reproduced.The dominant decay mode of 96.9%of the nuclei beyond ^(212) Po is correctly obtained.Further,α decay is predicted to be the dominant decay mode for isotopes in new elements Z=119-122,except for spontaneous fission in certain even–even elements owing to the increased Coulomb repulsion and odd–even effect.The predicted half-lives demonstrate the existence of a long-lived spontaneous fission island southwest of ^(298) Fl caused by the competition between the fission barrier and Coulomb repulsion.A better understanding of spontaneous fission,particularly beyond ^(286)Fl,is crucial in the search for new elements and the island of stability. 展开更多
关键词 Decay mode Superheavy nuclide random forest
下载PDF
Data cleaning method for the process of acid production with flue gas based on improved random forest 被引量:2
12
作者 Xiaoli Li Minghua Liu +2 位作者 Kang Wang Zhiqiang Liu Guihai Li 《Chinese Journal of Chemical Engineering》 SCIE EI CAS CSCD 2023年第7期72-84,共13页
Acid production with flue gas is a complex nonlinear process with multiple variables and strong coupling.The operation data is an important basis for state monitoring,optimal control,and fault diagnosis.However,the op... Acid production with flue gas is a complex nonlinear process with multiple variables and strong coupling.The operation data is an important basis for state monitoring,optimal control,and fault diagnosis.However,the operating environment of acid production with flue gas is complex and there is much equipment.The data obtained by the detection equipment is seriously polluted and prone to abnormal phenomena such as data loss and outliers.Therefore,to solve the problem of abnormal data in the process of acid production with flue gas,a data cleaning method based on improved random forest is proposed.Firstly,an outlier data recognition model based on isolation forest is designed to identify and eliminate the outliers in the dataset.Secondly,an improved random forest regression model is established.Genetic algorithm is used to optimize the hyperparameters of the random forest regression model.Then the optimal parameter combination is found in the search space and the trend of data is predicted.Finally,the improved random forest data cleaning method is used to compensate for the missing data after eliminating abnormal data and the data cleaning is realized.Results show that the proposed method can accurately eliminate and compensate for the abnormal data in the process of acid production with flue gas.The method improves the accuracy of compensation for missing data.With the data after cleaning,a more accurate model can be established,which is significant to the subsequent temperature control.The conversion rate of SO_(2) can be further improved,thereby improving the yield of sulfuric acid and economic benefits. 展开更多
关键词 Acid production Data cleaning Isolation forest random forest Data compensation
下载PDF
High-Accuracy NLOS Identification Based on Random Forest and High-Precision Positioning on 60 GHz Millimeter Wave 被引量:2
13
作者 Qiuna Niu Wei Shi +1 位作者 Yongdao Xu Weijun Wen 《China Communications》 SCIE CSCD 2023年第12期96-110,共15页
60 GHz millimeter wave(mmWave)system provides extremely high time resolution and multipath components(MPC)separation and has great potential to achieve high precision in the indoor positioning.However,the ranging data... 60 GHz millimeter wave(mmWave)system provides extremely high time resolution and multipath components(MPC)separation and has great potential to achieve high precision in the indoor positioning.However,the ranging data is often contaminated by non-line-of-sight(NLOS)transmission.First,six features of 60GHz mm Wave signal under LOS and NLOS conditions are evaluated.Next,a classifier constructed by random forest(RF)algorithm is used to identify line-of-sight(LOS)or NLOS channel.The identification mechanism has excellent generalization performance and the classification accuracy is over 97%.Finally,based on the identification results,a residual weighted least squares positioning method is proposed.All ranging information including that under NLOS channels is fully utilized,positioning failure caused by insufficient LOS links can be avoided.Compared with the conventional least squares approach,the positioning error of the proposed algorithm is reduced by 49%. 展开更多
关键词 60 GHz millimeter wave indoor positioning NLOS identification random forest
下载PDF
Vault predicting after implantable collamer lens implantation using random forest network based on different features in ultrasound biomicroscopy images 被引量:2
14
作者 Bin Fang Qiu-Jian Zhu +1 位作者 Hui Yang Li-Cheng Fan 《International Journal of Ophthalmology(English edition)》 SCIE CAS 2023年第10期1561-1567,共7页
AIM:To analyze ultrasound biomicroscopy(UBM)images using random forest network to find new features to make predictions about vault after implantable collamer lens(ICL)implantation.METHODS:A total of 450 UBM images we... AIM:To analyze ultrasound biomicroscopy(UBM)images using random forest network to find new features to make predictions about vault after implantable collamer lens(ICL)implantation.METHODS:A total of 450 UBM images were collected from the Lixiang Eye Hospital to provide the patient’s preoperative parameters as well as the vault of the ICL after implantation.The vault was set as the prediction target,and the input elements were mainly ciliary sulcus shape parameters,which included 6 angular parameters,2 area parameters,and 2 parameters,distance between ciliary sulci,and anterior chamber height.A random forest regression model was applied to predict the vault,with the number of base estimators(n_estimators)of 2000,the maximum tree depth(max_depth)of 17,the number of tree features(max_features)of Auto,and the random state(random_state)of 40.0.RESULTS:Among the parameters selected in this study,the distance between ciliary sulci had a greater importance proportion,reaching 52%before parameter optimization is performed,and other features had less influence,with an importance proportion of about 5%.The importance of the distance between the ciliary sulci increased to 53% after parameter optimization,and the importance of angle 3 and area 1 increased to 5% and 8%respectively,while the importance of the other parameters remained unchanged,and the distance between the ciliary sulci was considered the most important feature.Other features,although they accounted for a relatively small proportion,also had an impact on the vault prediction.After parameter optimization,the best prediction results were obtained,with a predicted mean value of 763.688μm and an actual mean value of 776.9304μm.The R²was 0.4456 and the root mean square error was 201.5166.CONCLUSION:A study based on UBM images using random forest network can be performed for prediction of the vault after ICL implantation and can provide some reference for ICL size selection. 展开更多
关键词 random forest network ultrasound biomicroscopy images vault prediction implantable collamer lens
下载PDF
Research on stock trend prediction method based on optimized random forest 被引量:1
15
作者 Lili Yin Benling Li +1 位作者 Peng Li Rubo Zhang 《CAAI Transactions on Intelligence Technology》 SCIE EI 2023年第1期274-284,共11页
As a complex hot problem in the financial field,stock trend forecasting uses a large amount of data and many related indicators;hence it is difficult to obtain sustainable and effective results only by relying on empi... As a complex hot problem in the financial field,stock trend forecasting uses a large amount of data and many related indicators;hence it is difficult to obtain sustainable and effective results only by relying on empirical analysis.Researchers in the field of machine learning have proved that random forest can form better judgements on this kind of problem,and it has an auxiliary role in the prediction of stock trend.This study uses historical trading data of four listed companies in the USA stock market,and the purpose of this study is to improve the performance of random forest model in medium-and long-term stock trend prediction.This study applies the exponential smoothing method to process the initial data,calculates the relevant technical indicators as the characteristics to be selected,and proposes the D-RF-RS method to optimize random forest.As the random forest is an ensemble learning model and is closely related to decision tree,D-RF-RS method uses a decision tree to screen the importance of features,and obtains the effective strong feature set of the model as input.Then,the parameter combination of the model is optimized through random parameter search.The experimental results show that the average accuracy of random forest is increased by 0.17 after the above process optimization,which is 0.18 higher than the average accuracy of light gradient boosting machine model.Combined with the performance of the ROC curve and Precision–Recall curve,the stability of the model is also guaranteed,which further demonstrates the advantages of random forest in medium-and long-term trend prediction of the stock market. 展开更多
关键词 ensemble learning FINANCE random forest random search technical indicator
下载PDF
Structural Damage Identification System Suitable for Old Arch Bridge in Rural Regions: Random Forest Approach 被引量:1
16
作者 Yu Zhang Zhihua Xiong +2 位作者 Zhuoxi Liang Jiachen She Chicheng Ma 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第7期447-469,共23页
A huge number of old arch bridges located in rural regions are at the peak of maintenance.The health monitoring technology of the long-span bridge is hardly applicable to the small-span bridge,owing to the absence of ... A huge number of old arch bridges located in rural regions are at the peak of maintenance.The health monitoring technology of the long-span bridge is hardly applicable to the small-span bridge,owing to the absence of technical resources and sufficient funds in rural regions.There is an urgent need for an economical,fast,and accurate damage identification solution.The authors proposed a damage identification system of an old arch bridge implemented with amachine learning algorithm,which took the vehicle-induced response as the excitation.A damage index was defined based on wavelet packet theory,and a machine learning sample database collecting the denoised response was constructed.Through comparing three machine learning algorithms:Back-Propagation Neural Network(BPNN),Support Vector Machine(SVM),and Random Forest(R.F.),the R.F.damage identification model were found to have a better recognition ability.Finally,the Particle Swarm Optimization(PSO)algorithm was used to optimize the number of subtrees and split features of the R.F.model.The PSO optimized R.F.model was capable of the identification of different damage levels of old arch bridges with sensitive damage index.The proposed framework is practical and promising for the old bridge’s structural damage identification in rural regions. 展开更多
关键词 Old arch bridge damage identification machine learning random forest particle swarm optimization
下载PDF
Fast prediction of the mechanical response for layered pavement under instantaneous large impact based on random forest regression 被引量:1
17
作者 励明君 杨哩娜 +4 位作者 王登 王斯艺 唐静楠 姜毅 陈杰 《Chinese Physics B》 SCIE EI CAS CSCD 2023年第4期1-10,共10页
The layered pavements usually exhibit complicated mechanical properties with the effect of complex material properties under external environment.In some cases,such as launching missiles or rockets,layered pavements a... The layered pavements usually exhibit complicated mechanical properties with the effect of complex material properties under external environment.In some cases,such as launching missiles or rockets,layered pavements are required to bear large impulse load.However,traditional methods cannot non-destructively and quickly detect the internal structural of pavements.Thus,accurate and fast prediction of the mechanical properties of layered pavements is of great importance and necessity.In recent years,machine learning has shown great superiority in solving nonlinear problems.In this work,we present a method of predicting the maximum deflection and damage factor of layered pavements under instantaneous large impact based on random forest regression with the deflection basin parameters obtained from falling weight deflection testing.The regression coefficient R^(2)of testing datasets are above 0.94 in the process of predicting the elastic moduli of structural layers and mechanical responses,which indicates that the prediction results have great consistency with finite element simulation results.This paper provides a novel method for fast and accurate prediction of pavement mechanical responses under instantaneous large impact load using partial structural parameters of pavements,and has application potential in non-destructive evaluation of pavement structure. 展开更多
关键词 deflection basin parameters pavement condition assessment instantaneous large impact random forest regression
下载PDF
Power Transformer Fault Diagnosis Using Random Forest and Optimized Kernel Extreme Learning Machine 被引量:1
18
作者 Tusongjiang Kari Zhiyang He +3 位作者 Aisikaer Rouzi Ziwei Zhang Xiaojing Ma Lin Du 《Intelligent Automation & Soft Computing》 SCIE 2023年第7期691-705,共15页
Power transformer is one of the most crucial devices in power grid.It is significant to determine incipient faults of power transformers fast and accurately.Input features play critical roles in fault diagnosis accura... Power transformer is one of the most crucial devices in power grid.It is significant to determine incipient faults of power transformers fast and accurately.Input features play critical roles in fault diagnosis accuracy.In order to further improve the fault diagnosis performance of power trans-formers,a random forest feature selection method coupled with optimized kernel extreme learning machine is presented in this study.Firstly,the random forest feature selection approach is adopted to rank 42 related input features derived from gas concentration,gas ratio and energy-weighted dissolved gas analysis.Afterwards,a kernel extreme learning machine tuned by the Aquila optimization algorithm is implemented to adjust crucial parameters and select the optimal feature subsets.The diagnosis accuracy is used to assess the fault diagnosis capability of concerned feature subsets.Finally,the optimal feature subsets are applied to establish fault diagnosis model.According to the experimental results based on two public datasets and comparison with 5 conventional approaches,it can be seen that the average accuracy of the pro-posed method is up to 94.5%,which is superior to that of other conventional approaches.Fault diagnosis performances verify that the optimum feature subset obtained by the presented method can dramatically improve power transformers fault diagnosis accuracy. 展开更多
关键词 Power transformer fault diagnosis kernel extreme learning machine aquila optimization random forest
下载PDF
A Robust Tuned Random Forest Classifier Using Randomized Grid Search to Predict Coronary Artery Diseases
19
作者 Sameh Abd El-Ghany A.A.Abd El-Aziz 《Computers, Materials & Continua》 SCIE EI 2023年第5期4633-4648,共16页
Coronary artery disease(CAD)is one of themost authentic cardiovascular afflictions because it is an uncommonly overwhelming heart issue.The breakdown of coronary cardiovascular disease is one of the principal sources ... Coronary artery disease(CAD)is one of themost authentic cardiovascular afflictions because it is an uncommonly overwhelming heart issue.The breakdown of coronary cardiovascular disease is one of the principal sources of death all over theworld.Cardiovascular deterioration is a challenge,especially in youthful and rural countries where there is an absence of humantrained professionals.Since heart diseases happen without apparent signs,high-level detection is desirable.This paper proposed a robust and tuned random forest model using the randomized grid search technique to predictCAD.The proposed framework increases the ability of CADpredictions by tracking down risk pointers and learning the confusing joint efforts between them.Nowadays,the healthcare industry has a lot of data but needs to gain more knowledge.Our proposed framework is used for extracting knowledge from data stores and using that knowledge to help doctors accurately and effectively diagnose heart disease(HD).We evaluated the proposed framework over two public databases,Cleveland and Framingham datasets.The datasets were preprocessed by using a cleaning technique,a normalization technique,and an outlier detection technique.Secondly,the principal component analysis(PCA)algorithm was utilized to lessen the feature dimensionality of the two datasets.Finally,we used a hyperparameter tuning technique,randomized grid search,to tune a random forest(RF)machine learning(ML)model.The randomized grid search selected the best parameters and got the ideal CAD analysis.The proposed framework was evaluated and compared with traditional classifiers.Our proposed framework’s accuracy,sensitivity,precision,specificity,and f1-score were 100%.The evaluation of the proposed framework showed that it is an unrivaled perceptive outcome with tuning as opposed to other ongoing existing frameworks. 展开更多
关键词 Coronary artery disease tuned random forest randomized grid search CLASSIFIER
下载PDF
Liver Ailment Prediction Using Random Forest Model
20
作者 Fazal Muhammad Bilal Khan +7 位作者 Rashid Naseem Abdullah A Asiri Hassan A Alshamrani Khalaf A Alshamrani Samar M Alqhtani Muhammad Irfan Khlood M Mehdar Hanan Talal Halawani 《Computers, Materials & Continua》 SCIE EI 2023年第1期1049-1067,共19页
Today,liver disease,or any deterioration in one’s ability to survive,is extremely common all around the world.Previous research has indicated that liver disease is more frequent in younger people than in older ones.W... Today,liver disease,or any deterioration in one’s ability to survive,is extremely common all around the world.Previous research has indicated that liver disease is more frequent in younger people than in older ones.When the liver’s capability begins to deteriorate,life can be shortened to one or two days,and early prediction of such diseases is difficult.Using several machine learning(ML)approaches,researchers analyzed a variety of models for predicting liver disorders in their early stages.As a result,this research looks at using the Random Forest(RF)classifier to diagnose the liver disease early on.The dataset was picked from the University of California,Irvine repository.RF’s accomplishments are contrasted to those of Multi-Layer Perceptron(MLP),Average One Dependency Estimator(A1DE),Support Vector Machine(SVM),Credal Decision Tree(CDT),Composite Hypercube on Iterated Random Projection(CHIRP),K-nearest neighbor(KNN),Naïve Bayes(NB),J48-Decision Tree(J48),and Forest by Penalizing Attributes(Forest-PA).Some of the assessment measures used to evaluate each classifier include Root Relative Squared Error(RRSE),Root Mean Squared Error(RMSE),accuracy,recall,precision,specificity,Matthew’s Correlation Coefficient(MCC),F-measure,and G-measure.RF has an RRSE performance of 87.6766 and an RMSE performance of 0.4328,however,its percentage accuracy is 72.1739.The widely acknowledged result of this work can be used as a starting point for subsequent research.As a result,every claim that a new model,framework,or method enhances forecastingmay be benchmarked and demonstrated. 展开更多
关键词 Liver ailment random forest machine learning
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部