As some recent information security legislation endowed users with unconditional rights to be forgotten by any trained machine learning model,personalised IoT service pro-viders have to put unlearning functionality in...As some recent information security legislation endowed users with unconditional rights to be forgotten by any trained machine learning model,personalised IoT service pro-viders have to put unlearning functionality into their consideration.The most straight-forward method to unlearn users'contribution is to retrain the model from the initial state,which is not realistic in high throughput applications with frequent unlearning requests.Though some machine unlearning frameworks have been proposed to speed up the retraining process,they fail to match decentralised learning scenarios.A decentralised unlearning framework called heterogeneous decentralised unlearning framework with seed(HDUS)is designed,which uses distilled seed models to construct erasable en-sembles for all clients.Moreover,the framework is compatible with heterogeneous on-device models,representing stronger scalability in real-world applications.Extensive experiments on three real-world datasets show that our HDUS achieves state-of-the-art performance.展开更多
Ore production is usually affected by multiple influencing inputs at open-pit mines.Nevertheless,the complex nonlinear relationships between these inputs and ore production remain unclear.This becomes even more challe...Ore production is usually affected by multiple influencing inputs at open-pit mines.Nevertheless,the complex nonlinear relationships between these inputs and ore production remain unclear.This becomes even more challenging when training data(e.g.truck haulage information and weather conditions)are massive.In machine learning(ML)algorithms,deep neural network(DNN)is a superior method for processing nonlinear and massive data by adjusting the amount of neurons and hidden layers.This study adopted DNN to forecast ore production using truck haulage information and weather conditions at open-pit mines as training data.Before the prediction models were built,principal component analysis(PCA)was employed to reduce the data dimensionality and eliminate the multicollinearity among highly correlated input variables.To verify the superiority of DNN,three ANNs containing only one hidden layer and six traditional ML models were established as benchmark models.The DNN model with multiple hidden layers performed better than the ANN models with a single hidden layer.The DNN model outperformed the extensively applied benchmark models in predicting ore production.This can provide engineers and researchers with an accurate method to forecast ore production,which helps make sound budgetary decisions and mine planning at open-pit mines.展开更多
In order to reduce the horizontal crossing transportation problems between coal trucks and stripping trucks,large and small vehicles,and transport trucks and belt conveyors at key points of open pit mine in production...In order to reduce the horizontal crossing transportation problems between coal trucks and stripping trucks,large and small vehicles,and transport trucks and belt conveyors at key points of open pit mine in production,the separate transportation mode of underpass bridge and overpass steel trestle is proposed to optimize the open pit development transportation system,so as to solve the practical problems that the horizontal cross of transport vehicles causes vehicle blockage,affects production schedule and production safety.The results show that the horizontal crossing road can be changed into a separate type of overpass steel trestle,which can realize the classified transportation of large and small vehicles,reduce the traffic density,make vehicles with different functions go their own way,eliminate the hidden danger of traffic accidents,and improve the production efficiency.展开更多
A procedure to recognize individual discontinuities in rock mass from measurement while drilling(MWD)technology is developed,using the binary pattern of structural rock characteristics obtained from in-hole images for...A procedure to recognize individual discontinuities in rock mass from measurement while drilling(MWD)technology is developed,using the binary pattern of structural rock characteristics obtained from in-hole images for calibration.Data from two underground operations with different drilling technology and different rock mass characteristics are considered,which generalizes the application of the methodology to different sites and ensures the full operational integration of MWD data analysis.Two approaches are followed for site-specific structural model building:a discontinuity index(DI)built from variations in MWD parameters,and a machine learning(ML)classifier as function of the drilling parameters and their variability.The prediction ability of the models is quantitatively assessed as the rate of recognition of discontinuities observed in borehole logs.Differences between the parameters involved in the models for each site,and differences in their weights,highlight the site-dependence of the resulting models.The ML approach offers better performance than the classical DI,with recognition rates in the range 89%to 96%.However,the simpler DI still yields fairly accurate results,with recognition rates 70%to 90%.These results validate the adaptive MWD-based methodology as an engineering solution to predict rock structural condition in underground mining operations.展开更多
The outbreak of the pandemic,caused by Coronavirus Disease 2019(COVID-19),has affected the daily activities of people across the globe.During COVID-19 outbreak and the successive lockdowns,Twitter was heavily used and...The outbreak of the pandemic,caused by Coronavirus Disease 2019(COVID-19),has affected the daily activities of people across the globe.During COVID-19 outbreak and the successive lockdowns,Twitter was heavily used and the number of tweets regarding COVID-19 increased tremendously.Several studies used Sentiment Analysis(SA)to analyze the emotions expressed through tweets upon COVID-19.Therefore,in current study,a new Artificial Bee Colony(ABC)with Machine Learning-driven SA(ABCMLSA)model is developed for conducting Sentiment Analysis of COVID-19 Twitter data.The prime focus of the presented ABCML-SA model is to recognize the sentiments expressed in tweets made uponCOVID-19.It involves data pre-processing at the initial stage followed by n-gram based feature extraction to derive the feature vectors.For identification and classification of the sentiments,the Support Vector Machine(SVM)model is exploited.At last,the ABC algorithm is applied to fine tune the parameters involved in SVM.To demonstrate the improved performance of the proposed ABCML-SA model,a sequence of simulations was conducted.The comparative assessment results confirmed the effectual performance of the proposed ABCML-SA model over other approaches.展开更多
The traction characteristics of the grouser, cutting the simulative soil of deepsea sediment, with different tooth widths, tooth heights, and ground pressures are studied with traction characteristic test apparatus. A...The traction characteristics of the grouser, cutting the simulative soil of deepsea sediment, with different tooth widths, tooth heights, and ground pressures are studied with traction characteristic test apparatus. A traction-displacement model is obtained by combining the analysis of the cutting mechanism. The results show that the tractiondisplacement curves of grousers with different tooth widths, tooth heights, and ground pressures have the same changing trend, which matches the Wong traction model. Their sensitivity coefficient and shear modulus are slightly fluctuated. Therefore, the average values can be used as the traction model parameters. The maximum traction of the grouser with a two-side edge and a 10 mm tooth width increment changing with the tooth height and ground pressure can be determined according to the grousers with different tooth widths. By combining the traction model parameters, the traction-displacement curve of the grouser with a certain group values of tooth width, tooth height, and ground pressure can be predicted. Therefore, the slip of the mining machine can be prevented to improve the mining efficiency.展开更多
The risk of rockbursts is one of the main threats in hard coal mines. Compared to other underground mines, the number of factors contributing to the rockburst at underground coal mines is much greater.Factors such as ...The risk of rockbursts is one of the main threats in hard coal mines. Compared to other underground mines, the number of factors contributing to the rockburst at underground coal mines is much greater.Factors such as the coal seam tendency to rockbursts, the thickness of the coal seam, and the stress level in the seam have to be considered, but also the entire coal seam-surrounding rock system has to be evaluated when trying to predict the rockbursts. However, in hard coal mines, there are stroke or stress-stroke rockbursts in which the fracture of a thick layer of sandstone plays an essential role in predicting rockbursts. The occurrence of rockbursts in coal mines is complex, and their prediction is even more difficult than in other mines. In recent years, the interest in machine learning algorithms for solving complex nonlinear problems has increased, which also applies to geosciences. This study attempts to use machine learning algorithms, i.e. neural network, decision tree, random forest, gradient boosting, and extreme gradient boosting(XGB), to assess the rockburst hazard of an active hard coal mine in the Upper Silesian Coal Basin. The rock mass bursting tendency index WTGthat describes the tendency of the seam-surrounding rock system to rockbursts and the anomaly of the vertical stress component were applied for this purpose. Especially, the decision tree and neural network models were proved to be effective in correctly distinguishing rockbursts from tremors, after which the excavation was not damaged. On average, these models correctly classified about 80% of the rockbursts in the testing datasets.展开更多
According to the chaotic and non-linear characters of power load data,the time series matrix is established with the theory of phase-space reconstruction,and then Lyapunov exponents with chaotic time series are comput...According to the chaotic and non-linear characters of power load data,the time series matrix is established with the theory of phase-space reconstruction,and then Lyapunov exponents with chaotic time series are computed to determine the time delay and the embedding dimension.Due to different features of the data,data mining algorithm is conducted to classify the data into different groups.Redundant information is eliminated by the advantage of data mining technology,and the historical loads that have highly similar features with the forecasting day are searched by the system.As a result,the training data can be decreased and the computing speed can also be improved when constructing support vector machine(SVM) model.Then,SVM algorithm is used to predict power load with parameters that get in pretreatment.In order to prove the effectiveness of the new model,the calculation with data mining SVM algorithm is compared with that of single SVM and back propagation network.It can be seen that the new DSVM algorithm effectively improves the forecast accuracy by 0.75%,1.10% and 1.73% compared with SVM for two random dimensions of 11-dimension,14-dimension and BP network,respectively.This indicates that the DSVM gains perfect improvement effect in the short-term power load forecasting.展开更多
Discrimination of seismicity distributed in different areas is essential for reliable seismic risk assessment in mines.Although machine learning has been widely applied in seismic data processing,feasibility and relia...Discrimination of seismicity distributed in different areas is essential for reliable seismic risk assessment in mines.Although machine learning has been widely applied in seismic data processing,feasibility and reliability of applying this technique to classify spatially clustered seismic events in underground mines are yet to be investigated.In this research,two groups of seismic events with a minimum local magnitude(ML) of-3 were observed in an underground coal mine.They were respectively located around a dyke and the longwall face.Additionally,two types of undesired signals were also recorded.Four machine learning methods,i.e.random forest(RF),support vector machine(SVM),deep convolutional neural network(DCNN),and residual neural network(ResNN),were used for classifying these signals.The results obtained based on a primary dataset showed that these seismic events could be classified with at least 91% accuracy.The DCNN using seismogram images as the inputs reached the best performance with more than 94% accuracy.As mining is a dynamic progress which could change the characteristics of seismic signals,the temporal variance in the prediction performance of DCNN was also investigated to assess the reliability of this classifier during mining.A cascaded workflow consisting of database update,model training,signal prediction,and results review was established.By progressively calibrating the DCNN model,it achieved up to 99% prediction accuracy.The results demonstrated that machine learning is a reliable tool for the automatic discrimination of spatially clustered seismicity in underground mining.展开更多
Text classification or categorization is the procedure of automatically tagging a textual document with most related labels or classes.When the number of labels is limited to one,the task becomes single-label text cat...Text classification or categorization is the procedure of automatically tagging a textual document with most related labels or classes.When the number of labels is limited to one,the task becomes single-label text categorization.The Arabic texts include unstructured information also like English texts,and that is understandable for machine learning(ML)techniques,the text is changed and demonstrated by numerical value.In recent times,the dominant method for natural language processing(NLP)tasks is recurrent neural network(RNN),in general,long short termmemory(LSTM)and convolutional neural network(CNN).Deep learning(DL)models are currently presented for deriving a massive amount of text deep features to an optimum performance from distinct domains such as text detection,medical image analysis,and so on.This paper introduces aModified Dragonfly Optimization with Extreme Learning Machine for Text Representation and Recognition(MDFO-EMTRR)model onArabicCorpus.The presentedMDFO-EMTRR technique mainly concentrates on the recognition and classification of the Arabic text.To achieve this,theMDFO-EMTRRtechnique encompasses data pre-processing to transform the input data into compatible format.Next,the ELM model is utilized for the representation and recognition of the Arabic text.At last,the MDFO algorithm was exploited for optimal tuning of the parameters related to the ELM method and thereby accomplish enhanced classifier results.The experimental result analysis of the MDFO-EMTRR system was performed on benchmark datasets and attained maximum accuracy of 99.74%.展开更多
In 2021,the abnormal short-term price fluctuations of GameStop,which were triggered by internet stock discussions,drew the attention of academics,financial analysts,and stock trading commissions alike,prompting calls ...In 2021,the abnormal short-term price fluctuations of GameStop,which were triggered by internet stock discussions,drew the attention of academics,financial analysts,and stock trading commissions alike,prompting calls to address such events and maintain market stability.However,the impact of stock discussions on volatile trading behavior has received comparatively less attention than traditional fundamentals.Furthermore,data mining methods are less often used to predict stock trading despite their higher accuracy.This study adopts an innovative approach using social media data to obtain stock rumors,and then trains three decision trees to demonstrate the impact of rumor propagation on stock trading behavior.Our findings show that rumor propagation outperforms traditional fundamentals in predicting abnormal trading behavior.The study serves as an impetus for further research using data mining as a method of inquiry.展开更多
Urgent care clinics and emergency departments around the world periodically suffer from extended wait times beyond patient expectations due to surges in patient flows.The delays arising from inadequate staffing levels...Urgent care clinics and emergency departments around the world periodically suffer from extended wait times beyond patient expectations due to surges in patient flows.The delays arising from inadequate staffing levels during these periods have been linked with adverse clinical outcomes.Previous research into forecasting patient flows has mostly used statistical techniques.These studies have also predominately focussed on short‐term forecasts,which have limited practicality for the resourcing of medical personnel.This study joins an emerging body of work which seeks to explore the potential of machine learning algorithms to generate accurate forecasts of patient presentations.Our research uses datasets covering 10 years from two large urgent care clinics to develop long‐term patient flow forecasts up to one quarter ahead using a range of state‐of‐the‐art algo-rithms.A distinctive feature of this study is the use of eXplainable Artificial Intelligence(XAI)tools like Shapely and LIME that enable an in‐depth analysis of the behaviour of the models,which would otherwise be uninterpretable.These analysis tools enabled us to explore the ability of the models to adapt to the volatility in patient demand during the COVID‐19 pandemic lockdowns and to identify the most impactful variables,resulting in valuable insights into their performance.The results showed that a novel combination of advanced univariate models like Prophet as well as gradient boosting,into an ensemble,delivered the most accurate and consistent solutions on average.This approach generated improvements in the range of 16%-30%over the existing in‐house methods for esti-mating the daily patient flows 90 days ahead.展开更多
Learning Management System(LMS)is an application software that is used in automation,delivery,administration,tracking,and reporting of courses and programs in educational sector.The LMS which exploits machine learning...Learning Management System(LMS)is an application software that is used in automation,delivery,administration,tracking,and reporting of courses and programs in educational sector.The LMS which exploits machine learning(ML)has the ability of accessing user data and exploit it for improving the learning experience.The recently developed artificial intelligence(AI)and ML models helps to accomplish effective performance monitoring for LMS.Among the different processes involved in ML based LMS,feature selection and classification processesfind beneficial.In this motivation,this study introduces Glowworm-based Feature Selection with Machine Learning Enabled Performance Monitoring(GSO-MFWELM)technique for LMS.The key objective of the proposed GSO-MFWELM technique is to effectually monitor the performance in LMS.The pro-posed GSO-MFWELM technique involves GSO-based feature selection techni-que to select the optimal features.Besides,Weighted Extreme Learning Machine(WELM)model is applied for classification process whereas the parameters involved in WELM model are optimallyfine-tuned with the help of May-fly Optimization(MFO)algorithm.The design of GSO and MFO techniques result in reduced computation complexity and improved classification performance.The presented GSO-MFWELM technique was validated for its performance against benchmark dataset and the results were inspected under several aspects.The simulation results established the supremacy of GSO-MFWELM technique over recent approaches with the maximum classification accuracy of 0.9589.展开更多
Datamining plays a crucial role in extractingmeaningful knowledge fromlarge-scale data repositories,such as data warehouses and databases.Association rule mining,a fundamental process in data mining,involves discoveri...Datamining plays a crucial role in extractingmeaningful knowledge fromlarge-scale data repositories,such as data warehouses and databases.Association rule mining,a fundamental process in data mining,involves discovering correlations,patterns,and causal structures within datasets.In the healthcare domain,association rules offer valuable opportunities for building knowledge bases,enabling intelligent diagnoses,and extracting invaluable information rapidly.This paper presents a novel approach called the Machine Learning based Association Rule Mining and Classification for Healthcare Data Management System(MLARMC-HDMS).The MLARMC-HDMS technique integrates classification and association rule mining(ARM)processes.Initially,the chimp optimization algorithm-based feature selection(COAFS)technique is employed within MLARMC-HDMS to select relevant attributes.Inspired by the foraging behavior of chimpanzees,the COA algorithm mimics their search strategy for food.Subsequently,the classification process utilizes stochastic gradient descent with a multilayer perceptron(SGD-MLP)model,while the Apriori algorithm determines attribute relationships.We propose a COA-based feature selection approach for medical data classification using machine learning techniques.This approach involves selecting pertinent features from medical datasets through COA and training machine learning models using the reduced feature set.We evaluate the performance of our approach on various medical datasets employing diverse machine learning classifiers.Experimental results demonstrate that our proposed approach surpasses alternative feature selection methods,achieving higher accuracy and precision rates in medical data classification tasks.The study showcases the effectiveness and efficiency of the COA-based feature selection approach in identifying relevant features,thereby enhancing the diagnosis and treatment of various diseases.To provide further validation,we conduct detailed experiments on a benchmark medical dataset,revealing the superiority of the MLARMCHDMS model over other methods,with a maximum accuracy of 99.75%.Therefore,this research contributes to the advancement of feature selection techniques in medical data classification and highlights the potential for improving healthcare outcomes through accurate and efficient data analysis.The presented MLARMC-HDMS framework and COA-based feature selection approach offer valuable insights for researchers and practitioners working in the field of healthcare data mining and machine learning.展开更多
Open-source and free tools are readily available to the public to process data and assist producers in making management decisions related to agricultural landscapes. On-the-go soil sensors are being used as a proxy t...Open-source and free tools are readily available to the public to process data and assist producers in making management decisions related to agricultural landscapes. On-the-go soil sensors are being used as a proxy to develop digital soil maps because of the data they can collect and their ability to cover a large area quickly. Machine learning, a subcomponent of artificial intelligence, makes predictions from data. Intermixing open-source tools, on-the-go sensor technologies, and machine learning may improve Mississippi soil mapping and crop production. This study aimed to evaluate machine learning for mapping apparent soil electrical conductivity (EC<sub>a</sub>) collected with an on-the-go sensor system at two sites (i.e., MF2, MF9) on a research farm in Mississippi. Machine learning tools (support vector machine) incorporated in Smart-Map, an open-source application, were used to evaluate the sites and derive the apparent electrical conductivity maps. Autocorrelation of the shallow (EC<sub>as</sub>) and deep (EC<sub>ad</sub>) readings was statistically significant at both locations (Moran’s I, p 0.001);however, the spatial correlation was greater at MF2. According to the leave-one-out cross-validation results, the best models were developed for EC<sub>as</sub> versus EC<sub>ad</sub>. Spatial patterns were observed for the EC<sub>as</sub> and EC<sub>ad</sub> readings in both fields. The patterns observed for the EC<sub>ad</sub> readings were more distinct than the EC<sub>as</sub> measurements. The research results indicated that machine learning was valuable for deriving apparent electrical conductivity maps in two Mississippi fields. Location and depth played a role in the machine learner’s ability to develop maps.展开更多
This study was conducted to establish a Support Vector Machines(SVM)-Markov Chain prediction model for prediction of mining water inflow. According to the raw data sequence, the Support Vector Machines(SVM) model was ...This study was conducted to establish a Support Vector Machines(SVM)-Markov Chain prediction model for prediction of mining water inflow. According to the raw data sequence, the Support Vector Machines(SVM) model was built, and then revised by means of a Markov state change probability matrix. Through dividing the state and analyzing absolute errors and relative errors and other indexes of the measured value and the fitted value of SVM, the prediction results were improved. Finally,the model was used to calculate relative errors. Through predicting and analyzing mining water inflow, the prediction results of the model were satisfactory. The results of this study enlarge the application scope of the Support Vector Machines(SVM) prediction model and provide a new method for scientific forecasting water inflow in coal mining.展开更多
Movies are the better source of entertainment.Every year,a great percentage of movies are released.People comment on movies in the form of reviews after watching them.Since it is difficult to read all of the reviews f...Movies are the better source of entertainment.Every year,a great percentage of movies are released.People comment on movies in the form of reviews after watching them.Since it is difficult to read all of the reviews for a movie,summarizing all of the reviews will help make this decision without wasting time in reading all of the reviews.Opinion mining also known as sentiment analysis is the process of extracting subjective information from textual data.Opinion mining involves identifying and extracting the opinions of individuals,which can be positive,neutral,or negative.The task of opinion mining also called sentiment analysis is performed to understand people’s emotions and attitudes in movie reviews.Movie reviews are an important source of opinion data because they provide insight into the general public’s opinions about a particular movie.The summary of all reviews can give a general idea about the movie.This study compares baseline techniques,Logistic Regression,Random Forest Classifier,Decision Tree,K-Nearest Neighbor,Gradient Boosting Classifier,and Passive Aggressive Classifier with Linear Support Vector Machines and Multinomial Naïve Bayes on the IMDB Dataset of 50K reviews and Sentiment Polarity Dataset Version 2.0.Before applying these classifiers,in pre-processing both datasets are cleaned,duplicate data is dropped and chat words are treated for better results.On the IMDB Dataset of 50K reviews,Linear Support Vector Machines achieve the highest accuracy of 89.48%,and after hyperparameter tuning,the Passive Aggressive Classifier achieves the highest accuracy of 90.27%,while Multinomial Nave Bayes achieves the highest accuracy of 70.69%and 71.04%after hyperparameter tuning on the Sentiment Polarity Dataset Version 2.0.This study highlights the importance of sentiment analysis as a tool for understanding the emotions and attitudes in movie reviews and predicts the performance of a movie based on the average sentiment of all the reviews.展开更多
An approach which combines particle swarm optimization and support vector machine(PSO–SVM)is proposed to forecast large-scale goaf instability(LSGI).Firstly,influencing factors of goaf safety are analyzed,and followi...An approach which combines particle swarm optimization and support vector machine(PSO–SVM)is proposed to forecast large-scale goaf instability(LSGI).Firstly,influencing factors of goaf safety are analyzed,and following parameters were selected as evaluation indexes in the LSGI:uniaxial compressive strength(UCS)of rock,elastic modulus(E)of rock,rock quality designation(RQD),area ration of pillar(Sp),the ratio of width to height of the pillar(w/h),depth of ore body(H),volume of goaf(V),dip of ore body(a)and area of goaf(Sg).Then LSGI forecasting model by PSO-SVM was established according to the influencing factors.The performance of hybrid model(PSO+SVM=PSO–SVM)has been compared with the grid search method of support vector machine(GSM–SVM)model.The actual data of 40 goafs are applied to research the forecasting ability of the proposed method,and two cases of underground mine are also validated by the proposed model.The results indicated that the heuristic algorithm of PSO can speed up the SVM parameter optimization search,and the predictive ability of the PSO–SVM model with the RBF kernel function is acceptable and robust,which might hold a high potential to become a useful tool in goaf risky prediction research.展开更多
Aiming to identify policy topics and their evolutionary logic that enhance the digital and green development(dual development)of traditional manufacturing enterprises,address weaknesses in current policies,and provide...Aiming to identify policy topics and their evolutionary logic that enhance the digital and green development(dual development)of traditional manufacturing enterprises,address weaknesses in current policies,and provide resources for refining dual development policies,a total of 15954 dual development-related policies issued by national and various departmental authorities in China from January 2000 to August 2023 were analyzed.Based on topic modeling techniques and the policy modeling consistency(PMC)framework,the evolution of policy topics was visualized,and a dynamic assessment of the policies was conducted.The results show that the digital and green development policy framework is progressively refined,and the governance philosophy shifts from a“regulatory government”paradigm to a“service-oriented government”.The support pattern evolves from“dispersed matching”to“integrated symbiosis”.However,there are still significant deficiencies in departmental cooperation,balanced measures,coordinated links,and multi-stakeholder participation.Future policy improvements should,therefore,focus on guiding multi-stakeholder participation,enhancing public demand orientation,and addressing the entire value chain.These steps aim to create an open and shared digital industry ecosystem to promote the coordinated dual development of traditional manufacturing enterprises.展开更多
Laser cladding of 316 L steel powders on pick substrate of coal mining machine was conducted, and microstructure of laser cladding coating was analyzed. The micro-hardness of laser cladding coating was examined. The r...Laser cladding of 316 L steel powders on pick substrate of coal mining machine was conducted, and microstructure of laser cladding coating was analyzed. The micro-hardness of laser cladding coating was examined. The results show that microstructure of laser cladding zone is exiguous dentrite, and there are hard spots dispersible distribution in the laser cladding zone. Performances of erode-resistant, surface micro-hardness and wear-resistant are improved obviously.展开更多
基金Australian Research Council,Grant/Award Numbers:FT210100624,DP190101985,DE230101033。
文摘As some recent information security legislation endowed users with unconditional rights to be forgotten by any trained machine learning model,personalised IoT service pro-viders have to put unlearning functionality into their consideration.The most straight-forward method to unlearn users'contribution is to retrain the model from the initial state,which is not realistic in high throughput applications with frequent unlearning requests.Though some machine unlearning frameworks have been proposed to speed up the retraining process,they fail to match decentralised learning scenarios.A decentralised unlearning framework called heterogeneous decentralised unlearning framework with seed(HDUS)is designed,which uses distilled seed models to construct erasable en-sembles for all clients.Moreover,the framework is compatible with heterogeneous on-device models,representing stronger scalability in real-world applications.Extensive experiments on three real-world datasets show that our HDUS achieves state-of-the-art performance.
基金This work was supported by the Pilot Seed Grant(Grant No.RES0049944)the Collaborative Research Project(Grant No.RES0043251)from the University of Alberta.
文摘Ore production is usually affected by multiple influencing inputs at open-pit mines.Nevertheless,the complex nonlinear relationships between these inputs and ore production remain unclear.This becomes even more challenging when training data(e.g.truck haulage information and weather conditions)are massive.In machine learning(ML)algorithms,deep neural network(DNN)is a superior method for processing nonlinear and massive data by adjusting the amount of neurons and hidden layers.This study adopted DNN to forecast ore production using truck haulage information and weather conditions at open-pit mines as training data.Before the prediction models were built,principal component analysis(PCA)was employed to reduce the data dimensionality and eliminate the multicollinearity among highly correlated input variables.To verify the superiority of DNN,three ANNs containing only one hidden layer and six traditional ML models were established as benchmark models.The DNN model with multiple hidden layers performed better than the ANN models with a single hidden layer.The DNN model outperformed the extensively applied benchmark models in predicting ore production.This can provide engineers and researchers with an accurate method to forecast ore production,which helps make sound budgetary decisions and mine planning at open-pit mines.
文摘In order to reduce the horizontal crossing transportation problems between coal trucks and stripping trucks,large and small vehicles,and transport trucks and belt conveyors at key points of open pit mine in production,the separate transportation mode of underpass bridge and overpass steel trestle is proposed to optimize the open pit development transportation system,so as to solve the practical problems that the horizontal cross of transport vehicles causes vehicle blockage,affects production schedule and production safety.The results show that the horizontal crossing road can be changed into a separate type of overpass steel trestle,which can realize the classified transportation of large and small vehicles,reduce the traffic density,make vehicles with different functions go their own way,eliminate the hidden danger of traffic accidents,and improve the production efficiency.
基金conducted under the illu MINEation project, funded by the European Union’s Horizon 2020 research and innovation program under grant agreement (No. 869379)supported by the China Scholarship Council (No. 202006370006)
文摘A procedure to recognize individual discontinuities in rock mass from measurement while drilling(MWD)technology is developed,using the binary pattern of structural rock characteristics obtained from in-hole images for calibration.Data from two underground operations with different drilling technology and different rock mass characteristics are considered,which generalizes the application of the methodology to different sites and ensures the full operational integration of MWD data analysis.Two approaches are followed for site-specific structural model building:a discontinuity index(DI)built from variations in MWD parameters,and a machine learning(ML)classifier as function of the drilling parameters and their variability.The prediction ability of the models is quantitatively assessed as the rate of recognition of discontinuities observed in borehole logs.Differences between the parameters involved in the models for each site,and differences in their weights,highlight the site-dependence of the resulting models.The ML approach offers better performance than the classical DI,with recognition rates in the range 89%to 96%.However,the simpler DI still yields fairly accurate results,with recognition rates 70%to 90%.These results validate the adaptive MWD-based methodology as an engineering solution to predict rock structural condition in underground mining operations.
基金The Deanship of ScientificResearch (DSR)at King Abdulaziz University,Jeddah,Saudi Arabia has funded this project,under Grant No. (FP-205-43).
文摘The outbreak of the pandemic,caused by Coronavirus Disease 2019(COVID-19),has affected the daily activities of people across the globe.During COVID-19 outbreak and the successive lockdowns,Twitter was heavily used and the number of tweets regarding COVID-19 increased tremendously.Several studies used Sentiment Analysis(SA)to analyze the emotions expressed through tweets upon COVID-19.Therefore,in current study,a new Artificial Bee Colony(ABC)with Machine Learning-driven SA(ABCMLSA)model is developed for conducting Sentiment Analysis of COVID-19 Twitter data.The prime focus of the presented ABCML-SA model is to recognize the sentiments expressed in tweets made uponCOVID-19.It involves data pre-processing at the initial stage followed by n-gram based feature extraction to derive the feature vectors.For identification and classification of the sentiments,the Support Vector Machine(SVM)model is exploited.At last,the ABC algorithm is applied to fine tune the parameters involved in SVM.To demonstrate the improved performance of the proposed ABCML-SA model,a sequence of simulations was conducted.The comparative assessment results confirmed the effectual performance of the proposed ABCML-SA model over other approaches.
基金Project supported by the National Natural Science Foundation of China(No.51274251)
文摘The traction characteristics of the grouser, cutting the simulative soil of deepsea sediment, with different tooth widths, tooth heights, and ground pressures are studied with traction characteristic test apparatus. A traction-displacement model is obtained by combining the analysis of the cutting mechanism. The results show that the tractiondisplacement curves of grousers with different tooth widths, tooth heights, and ground pressures have the same changing trend, which matches the Wong traction model. Their sensitivity coefficient and shear modulus are slightly fluctuated. Therefore, the average values can be used as the traction model parameters. The maximum traction of the grouser with a two-side edge and a 10 mm tooth width increment changing with the tooth height and ground pressure can be determined according to the grousers with different tooth widths. By combining the traction model parameters, the traction-displacement curve of the grouser with a certain group values of tooth width, tooth height, and ground pressure can be predicted. Therefore, the slip of the mining machine can be prevented to improve the mining efficiency.
基金supported by the Ministry of Science and Higher Education, Republic of Poland (Statutory Activity of the Central Mining Institute, Grant No. 11133010)
文摘The risk of rockbursts is one of the main threats in hard coal mines. Compared to other underground mines, the number of factors contributing to the rockburst at underground coal mines is much greater.Factors such as the coal seam tendency to rockbursts, the thickness of the coal seam, and the stress level in the seam have to be considered, but also the entire coal seam-surrounding rock system has to be evaluated when trying to predict the rockbursts. However, in hard coal mines, there are stroke or stress-stroke rockbursts in which the fracture of a thick layer of sandstone plays an essential role in predicting rockbursts. The occurrence of rockbursts in coal mines is complex, and their prediction is even more difficult than in other mines. In recent years, the interest in machine learning algorithms for solving complex nonlinear problems has increased, which also applies to geosciences. This study attempts to use machine learning algorithms, i.e. neural network, decision tree, random forest, gradient boosting, and extreme gradient boosting(XGB), to assess the rockburst hazard of an active hard coal mine in the Upper Silesian Coal Basin. The rock mass bursting tendency index WTGthat describes the tendency of the seam-surrounding rock system to rockbursts and the anomaly of the vertical stress component were applied for this purpose. Especially, the decision tree and neural network models were proved to be effective in correctly distinguishing rockbursts from tremors, after which the excavation was not damaged. On average, these models correctly classified about 80% of the rockbursts in the testing datasets.
基金Project(70671039) supported by the National Natural Science Foundation of China
文摘According to the chaotic and non-linear characters of power load data,the time series matrix is established with the theory of phase-space reconstruction,and then Lyapunov exponents with chaotic time series are computed to determine the time delay and the embedding dimension.Due to different features of the data,data mining algorithm is conducted to classify the data into different groups.Redundant information is eliminated by the advantage of data mining technology,and the historical loads that have highly similar features with the forecasting day are searched by the system.As a result,the training data can be decreased and the computing speed can also be improved when constructing support vector machine(SVM) model.Then,SVM algorithm is used to predict power load with parameters that get in pretreatment.In order to prove the effectiveness of the new model,the calculation with data mining SVM algorithm is compared with that of single SVM and back propagation network.It can be seen that the new DSVM algorithm effectively improves the forecast accuracy by 0.75%,1.10% and 1.73% compared with SVM for two random dimensions of 11-dimension,14-dimension and BP network,respectively.This indicates that the DSVM gains perfect improvement effect in the short-term power load forecasting.
基金the Australia Coal Association Research Program(ACARP)(Grant Nos.C26006 and C26053)Supports from CSIRO。
文摘Discrimination of seismicity distributed in different areas is essential for reliable seismic risk assessment in mines.Although machine learning has been widely applied in seismic data processing,feasibility and reliability of applying this technique to classify spatially clustered seismic events in underground mines are yet to be investigated.In this research,two groups of seismic events with a minimum local magnitude(ML) of-3 were observed in an underground coal mine.They were respectively located around a dyke and the longwall face.Additionally,two types of undesired signals were also recorded.Four machine learning methods,i.e.random forest(RF),support vector machine(SVM),deep convolutional neural network(DCNN),and residual neural network(ResNN),were used for classifying these signals.The results obtained based on a primary dataset showed that these seismic events could be classified with at least 91% accuracy.The DCNN using seismogram images as the inputs reached the best performance with more than 94% accuracy.As mining is a dynamic progress which could change the characteristics of seismic signals,the temporal variance in the prediction performance of DCNN was also investigated to assess the reliability of this classifier during mining.A cascaded workflow consisting of database update,model training,signal prediction,and results review was established.By progressively calibrating the DCNN model,it achieved up to 99% prediction accuracy.The results demonstrated that machine learning is a reliable tool for the automatic discrimination of spatially clustered seismicity in underground mining.
基金Princess Nourah bint Abdulrahman University Researchers Supporting Project number(PNURSP2022R263),Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabiathe Deanship of Scientific Research at Umm Al-Qura University for supporting this work by Grant Code:22UQU4340237DSR35.
文摘Text classification or categorization is the procedure of automatically tagging a textual document with most related labels or classes.When the number of labels is limited to one,the task becomes single-label text categorization.The Arabic texts include unstructured information also like English texts,and that is understandable for machine learning(ML)techniques,the text is changed and demonstrated by numerical value.In recent times,the dominant method for natural language processing(NLP)tasks is recurrent neural network(RNN),in general,long short termmemory(LSTM)and convolutional neural network(CNN).Deep learning(DL)models are currently presented for deriving a massive amount of text deep features to an optimum performance from distinct domains such as text detection,medical image analysis,and so on.This paper introduces aModified Dragonfly Optimization with Extreme Learning Machine for Text Representation and Recognition(MDFO-EMTRR)model onArabicCorpus.The presentedMDFO-EMTRR technique mainly concentrates on the recognition and classification of the Arabic text.To achieve this,theMDFO-EMTRRtechnique encompasses data pre-processing to transform the input data into compatible format.Next,the ELM model is utilized for the representation and recognition of the Arabic text.At last,the MDFO algorithm was exploited for optimal tuning of the parameters related to the ELM method and thereby accomplish enhanced classifier results.The experimental result analysis of the MDFO-EMTRR system was performed on benchmark datasets and attained maximum accuracy of 99.74%.
基金supported by the National Science and Technology Council,Taiwan,under grants MOST 108-2410-H-027-020,MOST 109-2410-H-027-009-MY2 and MOST 111-2410-H-027-011-MY3.
文摘In 2021,the abnormal short-term price fluctuations of GameStop,which were triggered by internet stock discussions,drew the attention of academics,financial analysts,and stock trading commissions alike,prompting calls to address such events and maintain market stability.However,the impact of stock discussions on volatile trading behavior has received comparatively less attention than traditional fundamentals.Furthermore,data mining methods are less often used to predict stock trading despite their higher accuracy.This study adopts an innovative approach using social media data to obtain stock rumors,and then trains three decision trees to demonstrate the impact of rumor propagation on stock trading behavior.Our findings show that rumor propagation outperforms traditional fundamentals in predicting abnormal trading behavior.The study serves as an impetus for further research using data mining as a method of inquiry.
文摘Urgent care clinics and emergency departments around the world periodically suffer from extended wait times beyond patient expectations due to surges in patient flows.The delays arising from inadequate staffing levels during these periods have been linked with adverse clinical outcomes.Previous research into forecasting patient flows has mostly used statistical techniques.These studies have also predominately focussed on short‐term forecasts,which have limited practicality for the resourcing of medical personnel.This study joins an emerging body of work which seeks to explore the potential of machine learning algorithms to generate accurate forecasts of patient presentations.Our research uses datasets covering 10 years from two large urgent care clinics to develop long‐term patient flow forecasts up to one quarter ahead using a range of state‐of‐the‐art algo-rithms.A distinctive feature of this study is the use of eXplainable Artificial Intelligence(XAI)tools like Shapely and LIME that enable an in‐depth analysis of the behaviour of the models,which would otherwise be uninterpretable.These analysis tools enabled us to explore the ability of the models to adapt to the volatility in patient demand during the COVID‐19 pandemic lockdowns and to identify the most impactful variables,resulting in valuable insights into their performance.The results showed that a novel combination of advanced univariate models like Prophet as well as gradient boosting,into an ensemble,delivered the most accurate and consistent solutions on average.This approach generated improvements in the range of 16%-30%over the existing in‐house methods for esti-mating the daily patient flows 90 days ahead.
基金supported by the Researchers Supporting Program(TUMA-Project2021-27)Almaarefa University,RiyadhSaudi Arabia.Taif University Researchers Supporting Project number(TURSP-2020/161)Taif University,Taif,Saudi Arabia.
文摘Learning Management System(LMS)is an application software that is used in automation,delivery,administration,tracking,and reporting of courses and programs in educational sector.The LMS which exploits machine learning(ML)has the ability of accessing user data and exploit it for improving the learning experience.The recently developed artificial intelligence(AI)and ML models helps to accomplish effective performance monitoring for LMS.Among the different processes involved in ML based LMS,feature selection and classification processesfind beneficial.In this motivation,this study introduces Glowworm-based Feature Selection with Machine Learning Enabled Performance Monitoring(GSO-MFWELM)technique for LMS.The key objective of the proposed GSO-MFWELM technique is to effectually monitor the performance in LMS.The pro-posed GSO-MFWELM technique involves GSO-based feature selection techni-que to select the optimal features.Besides,Weighted Extreme Learning Machine(WELM)model is applied for classification process whereas the parameters involved in WELM model are optimallyfine-tuned with the help of May-fly Optimization(MFO)algorithm.The design of GSO and MFO techniques result in reduced computation complexity and improved classification performance.The presented GSO-MFWELM technique was validated for its performance against benchmark dataset and the results were inspected under several aspects.The simulation results established the supremacy of GSO-MFWELM technique over recent approaches with the maximum classification accuracy of 0.9589.
基金Deputyship for Research&Innovation,Ministry of Education in Saudi Arabia for funding this research work through the Project Number RI-44-0444.
文摘Datamining plays a crucial role in extractingmeaningful knowledge fromlarge-scale data repositories,such as data warehouses and databases.Association rule mining,a fundamental process in data mining,involves discovering correlations,patterns,and causal structures within datasets.In the healthcare domain,association rules offer valuable opportunities for building knowledge bases,enabling intelligent diagnoses,and extracting invaluable information rapidly.This paper presents a novel approach called the Machine Learning based Association Rule Mining and Classification for Healthcare Data Management System(MLARMC-HDMS).The MLARMC-HDMS technique integrates classification and association rule mining(ARM)processes.Initially,the chimp optimization algorithm-based feature selection(COAFS)technique is employed within MLARMC-HDMS to select relevant attributes.Inspired by the foraging behavior of chimpanzees,the COA algorithm mimics their search strategy for food.Subsequently,the classification process utilizes stochastic gradient descent with a multilayer perceptron(SGD-MLP)model,while the Apriori algorithm determines attribute relationships.We propose a COA-based feature selection approach for medical data classification using machine learning techniques.This approach involves selecting pertinent features from medical datasets through COA and training machine learning models using the reduced feature set.We evaluate the performance of our approach on various medical datasets employing diverse machine learning classifiers.Experimental results demonstrate that our proposed approach surpasses alternative feature selection methods,achieving higher accuracy and precision rates in medical data classification tasks.The study showcases the effectiveness and efficiency of the COA-based feature selection approach in identifying relevant features,thereby enhancing the diagnosis and treatment of various diseases.To provide further validation,we conduct detailed experiments on a benchmark medical dataset,revealing the superiority of the MLARMCHDMS model over other methods,with a maximum accuracy of 99.75%.Therefore,this research contributes to the advancement of feature selection techniques in medical data classification and highlights the potential for improving healthcare outcomes through accurate and efficient data analysis.The presented MLARMC-HDMS framework and COA-based feature selection approach offer valuable insights for researchers and practitioners working in the field of healthcare data mining and machine learning.
文摘Open-source and free tools are readily available to the public to process data and assist producers in making management decisions related to agricultural landscapes. On-the-go soil sensors are being used as a proxy to develop digital soil maps because of the data they can collect and their ability to cover a large area quickly. Machine learning, a subcomponent of artificial intelligence, makes predictions from data. Intermixing open-source tools, on-the-go sensor technologies, and machine learning may improve Mississippi soil mapping and crop production. This study aimed to evaluate machine learning for mapping apparent soil electrical conductivity (EC<sub>a</sub>) collected with an on-the-go sensor system at two sites (i.e., MF2, MF9) on a research farm in Mississippi. Machine learning tools (support vector machine) incorporated in Smart-Map, an open-source application, were used to evaluate the sites and derive the apparent electrical conductivity maps. Autocorrelation of the shallow (EC<sub>as</sub>) and deep (EC<sub>ad</sub>) readings was statistically significant at both locations (Moran’s I, p 0.001);however, the spatial correlation was greater at MF2. According to the leave-one-out cross-validation results, the best models were developed for EC<sub>as</sub> versus EC<sub>ad</sub>. Spatial patterns were observed for the EC<sub>as</sub> and EC<sub>ad</sub> readings in both fields. The patterns observed for the EC<sub>ad</sub> readings were more distinct than the EC<sub>as</sub> measurements. The research results indicated that machine learning was valuable for deriving apparent electrical conductivity maps in two Mississippi fields. Location and depth played a role in the machine learner’s ability to develop maps.
文摘This study was conducted to establish a Support Vector Machines(SVM)-Markov Chain prediction model for prediction of mining water inflow. According to the raw data sequence, the Support Vector Machines(SVM) model was built, and then revised by means of a Markov state change probability matrix. Through dividing the state and analyzing absolute errors and relative errors and other indexes of the measured value and the fitted value of SVM, the prediction results were improved. Finally,the model was used to calculate relative errors. Through predicting and analyzing mining water inflow, the prediction results of the model were satisfactory. The results of this study enlarge the application scope of the Support Vector Machines(SVM) prediction model and provide a new method for scientific forecasting water inflow in coal mining.
文摘Movies are the better source of entertainment.Every year,a great percentage of movies are released.People comment on movies in the form of reviews after watching them.Since it is difficult to read all of the reviews for a movie,summarizing all of the reviews will help make this decision without wasting time in reading all of the reviews.Opinion mining also known as sentiment analysis is the process of extracting subjective information from textual data.Opinion mining involves identifying and extracting the opinions of individuals,which can be positive,neutral,or negative.The task of opinion mining also called sentiment analysis is performed to understand people’s emotions and attitudes in movie reviews.Movie reviews are an important source of opinion data because they provide insight into the general public’s opinions about a particular movie.The summary of all reviews can give a general idea about the movie.This study compares baseline techniques,Logistic Regression,Random Forest Classifier,Decision Tree,K-Nearest Neighbor,Gradient Boosting Classifier,and Passive Aggressive Classifier with Linear Support Vector Machines and Multinomial Naïve Bayes on the IMDB Dataset of 50K reviews and Sentiment Polarity Dataset Version 2.0.Before applying these classifiers,in pre-processing both datasets are cleaned,duplicate data is dropped and chat words are treated for better results.On the IMDB Dataset of 50K reviews,Linear Support Vector Machines achieve the highest accuracy of 89.48%,and after hyperparameter tuning,the Passive Aggressive Classifier achieves the highest accuracy of 90.27%,while Multinomial Nave Bayes achieves the highest accuracy of 70.69%and 71.04%after hyperparameter tuning on the Sentiment Polarity Dataset Version 2.0.This study highlights the importance of sentiment analysis as a tool for understanding the emotions and attitudes in movie reviews and predicts the performance of a movie based on the average sentiment of all the reviews.
基金supported by the National Basic Research Program Project of China(No.2010CB732004)the National Natural Science Foundation Project of China(Nos.50934006 and41272304)+2 种基金the Graduated Students’ResearchInnovation Fund Project of Hunan Province of China(No.CX2011B119)the Scholarship Award for Excellent Doctoral Student of Ministry of Education of China and the Valuable Equipment Open Sharing Fund of Central South University(No.1343-76140000022)
文摘An approach which combines particle swarm optimization and support vector machine(PSO–SVM)is proposed to forecast large-scale goaf instability(LSGI).Firstly,influencing factors of goaf safety are analyzed,and following parameters were selected as evaluation indexes in the LSGI:uniaxial compressive strength(UCS)of rock,elastic modulus(E)of rock,rock quality designation(RQD),area ration of pillar(Sp),the ratio of width to height of the pillar(w/h),depth of ore body(H),volume of goaf(V),dip of ore body(a)and area of goaf(Sg).Then LSGI forecasting model by PSO-SVM was established according to the influencing factors.The performance of hybrid model(PSO+SVM=PSO–SVM)has been compared with the grid search method of support vector machine(GSM–SVM)model.The actual data of 40 goafs are applied to research the forecasting ability of the proposed method,and two cases of underground mine are also validated by the proposed model.The results indicated that the heuristic algorithm of PSO can speed up the SVM parameter optimization search,and the predictive ability of the PSO–SVM model with the RBF kernel function is acceptable and robust,which might hold a high potential to become a useful tool in goaf risky prediction research.
基金The National Natural Science Foundation of China(No.71973023,42277493).
文摘Aiming to identify policy topics and their evolutionary logic that enhance the digital and green development(dual development)of traditional manufacturing enterprises,address weaknesses in current policies,and provide resources for refining dual development policies,a total of 15954 dual development-related policies issued by national and various departmental authorities in China from January 2000 to August 2023 were analyzed.Based on topic modeling techniques and the policy modeling consistency(PMC)framework,the evolution of policy topics was visualized,and a dynamic assessment of the policies was conducted.The results show that the digital and green development policy framework is progressively refined,and the governance philosophy shifts from a“regulatory government”paradigm to a“service-oriented government”.The support pattern evolves from“dispersed matching”to“integrated symbiosis”.However,there are still significant deficiencies in departmental cooperation,balanced measures,coordinated links,and multi-stakeholder participation.Future policy improvements should,therefore,focus on guiding multi-stakeholder participation,enhancing public demand orientation,and addressing the entire value chain.These steps aim to create an open and shared digital industry ecosystem to promote the coordinated dual development of traditional manufacturing enterprises.
文摘Laser cladding of 316 L steel powders on pick substrate of coal mining machine was conducted, and microstructure of laser cladding coating was analyzed. The micro-hardness of laser cladding coating was examined. The results show that microstructure of laser cladding zone is exiguous dentrite, and there are hard spots dispersible distribution in the laser cladding zone. Performances of erode-resistant, surface micro-hardness and wear-resistant are improved obviously.