The mesoscale eddy(ME)has a significant influence on the convergence effect in deep-sea acoustic propagation.This paper use statistical approaches to express quantitative relationships between the ME conditions and co...The mesoscale eddy(ME)has a significant influence on the convergence effect in deep-sea acoustic propagation.This paper use statistical approaches to express quantitative relationships between the ME conditions and convergence zone(CZ)characteristics.Based on the Gaussian vortex model,we construct various sound propagation scenarios under different eddy conditions,and carry out sound propagation experiments to obtain simulation samples.With a large number of samples,we first adopt the unified regression to set up analytic relationships between eddy conditions and CZ parameters.The sensitivity of eddy indicators to the CZ is quantitatively analyzed.Then,we adopt the machine learning(ML)algorithms to establish prediction models of CZ parameters by exploring the nonlinear relationships between multiple ME indicators and CZ parameters.Through the research,we can express the influence of ME on the CZ quantitatively,and achieve the rapid prediction of CZ parameters in ocean eddies.The prediction accuracy(R)of the CZ distance(mean R:0.9815)is obviously better than that of the CZ width(mean R:0.8728).Among the three ML algorithms,Gradient Boosting Decision Tree has the best prediction ability(root mean square error(RMSE):0.136),followed by Random Forest(RMSE:0.441)and Extreme Learning Machine(RMSE:0.518).展开更多
The rapid proliferation of Internet of Things(IoT)technology has facilitated automation across various sectors.Nevertheless,this advancement has also resulted in a notable surge in cyberattacks,notably botnets.As a re...The rapid proliferation of Internet of Things(IoT)technology has facilitated automation across various sectors.Nevertheless,this advancement has also resulted in a notable surge in cyberattacks,notably botnets.As a result,research on network analysis has become vital.Machine learning-based techniques for network analysis provide a more extensive and adaptable approach in comparison to traditional rule-based methods.In this paper,we propose a framework for analyzing communications between IoT devices using supervised learning and ensemble techniques and present experimental results that validate the efficacy of the proposed framework.The results indicate that using the proposed ensemble techniques improves accuracy by up to 1.7%compared to singlealgorithm approaches.These results also suggest that the proposed framework can flexibly adapt to general IoT network analysis scenarios.Unlike existing frameworks,which only exhibit high performance in specific situations,the proposed framework can serve as a fundamental approach for addressing a wide range of issues.展开更多
To enhance the safety of road traffic operations,this paper proposed a model based on stacking integrated learning utilizing American road traffic accident statistics.Initially,the process involved data cleaning,trans...To enhance the safety of road traffic operations,this paper proposed a model based on stacking integrated learning utilizing American road traffic accident statistics.Initially,the process involved data cleaning,transformation,and normalization.Subsequently,various classification models were constructed,including logistic regression,k-nearest neighbors,gradient boosting,decision trees,AdaBoost,and extra trees models.Evaluation metrics such as accuracy,precision,recall,F1 score,and Hamming loss were employed.Upon analysis,the passive-aggressive classifier model exhibited superior comprehensive indices compared to other models.Based on the model’s output results,an in-depth examination of the factors influencing traffic accidents was conducted.Additionally,measures and suggestions aimed at reducing the incidence of severe traffic accidents were presented.These findings served as a valuable reference for mitigating the occurrence of traffic accidents.展开更多
To address the challenges of current college student employment management,this study designed and implemented a machine learning-based decision support system for college student employment management.The system coll...To address the challenges of current college student employment management,this study designed and implemented a machine learning-based decision support system for college student employment management.The system collects and analyzes multidimensional data,uses machine learning algorithms for prediction and matching,provides personalized employment guidance for students,and provides decision support for universities and enterprises.The research results indicate that the system can effectively improve the efficiency and accuracy of employment guidance,promote school-enterprise cooperation,and achieve a win-win situation for all parties.展开更多
Check dams are widely used on the Loess Plateau in China to control soil and water losses,develop agricultural land,and improve watershed ecology.Detailed information on the number and spatial distribution of check da...Check dams are widely used on the Loess Plateau in China to control soil and water losses,develop agricultural land,and improve watershed ecology.Detailed information on the number and spatial distribution of check dams is critical for quantitatively evaluating hydrological and ecological effects and planning the construction of new dams.Thus,this study developed a check dam detection framework for broad areas from high-resolution remote sensing images using an ensemble approach of deep learning and geospatial analysis.First,we made a sample dataset of check dams using GaoFen-2(GF-2)and Google Earth images.Next,we evaluated five popular deep-learning-based object detectors,including Faster R-CNN,You Only Look Once(version 3)(YOLOv3),Cascade R-CNN,YOLOX,and VarifocalNet(VFNet),to identify the best one for check dam detection.Finally,we analyzed the location characteristics of the check dams and used geographical constraints to optimize the detection results.Precision,recall,average precision at intersection over union(IoU)threshold of 0.50(AP_(50)),IoU threshold of 0.75(AP_(75)),and average value for 10 IoU thresholds ranging from 0.50-0.95 with a 0.05 step(AP_(50-95)),and inference time were used to evaluate model performance.All the five deep learning networks could identify check dams quickly and accurately,with AP_(50-95),AP_(50),and AP_(75)values higher than 60.0%,90.0%,and 70.0%,respectively,except for YOLOv3.The VFNet had the best performance,followed by YOLOX.The proposed framework was tested in the Yanhe River Basin and yielded promising results,with a recall rate of 87.0%for 521 check dams.Furthermore,the geographic analysis deleted about 50%of the false detection boxes,increasing the identification accuracy of check dams from 78.6%to 87.6%.Simultaneously,this framework recognized 568 recently constructed check dams and small check dams not recorded in the known check dam survey datasets.The extraction results will support efficient watershed management and guide future studies on soil erosion in the Loess Plateau.展开更多
Language teaching is not a one-way process.It interacts with language learning in an extremely intricate way.To improve language teaching,we need to take the process of language learning into account.This paper tries ...Language teaching is not a one-way process.It interacts with language learning in an extremely intricate way.To improve language teaching,we need to take the process of language learning into account.This paper tries to explore and understand what strategies the second language learners consciously or subconsciously adopt during their language learning process through the analyses of the linguistic errors they commit,so as to provide some insights into language teaching practice.展开更多
Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for rep...Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers.展开更多
The compaction quality of subgrade filler strongly affects subgrade settlement.The main objective of this research is to analyze the macro-and micro-mechanical compaction characteristics of subgrade filler based on th...The compaction quality of subgrade filler strongly affects subgrade settlement.The main objective of this research is to analyze the macro-and micro-mechanical compaction characteristics of subgrade filler based on the real shape of coarse particles.First,an improved Viola-Jones algorithm is employed to establish a digitalized 2D particle database for coarse particle shape evaluation and discrete modeling purposes of subgrade filler.Shape indexes of 2D subgrade filler are then computed and statistically analyzed.Finally,numerical simulations are performed to quantitatively investigate the effects of the aspect ratio(AR)and interparticle friction coefficient(μ)on the macro-and micro-mechanical compaction characteristics of subgrade filler based on the discrete element method(DEM).The results show that with the increasing AR,the coarse particles are narrower,leading to the increasing movement of fine particles during compaction,which indicates that it is difficult for slender coarse particles to inhibit the migration of fine particles.Moreover,the average displacement of particles is strongly influenced by the AR,indicating that their occlusion under power relies on particle shapes.The dis-placement and velocity of fine particles are much greater than those of the coarse particles,which shows that compaction is primarily a migration of fine particles.Under the cyclic load,the interparticle friction coefficientμhas little effect on the internal structure of the sample;under the quasi-static loads,however,the increase inμwill lead to a significant increase in the porosity of the sample.This study could not only provide a novel approach to investigate the compaction mechanism but also establish a new theoretical basis for the evaluation of intelligent subgrade compaction.展开更多
In recent years,the rapid development of computer software has led to numerous security problems,particularly software vulnerabilities.These flaws can cause significant harm to users’privacy and property.Current secu...In recent years,the rapid development of computer software has led to numerous security problems,particularly software vulnerabilities.These flaws can cause significant harm to users’privacy and property.Current security defect detection technology relies on manual or professional reasoning,leading to missed detection and high false detection rates.Artificial intelligence technology has led to the development of neural network models based on machine learning or deep learning to intelligently mine holes,reducing missed alarms and false alarms.So,this project aims to study Java source code defect detection methods for defects like null pointer reference exception,XSS(Transform),and Structured Query Language(SQL)injection.Also,the project uses open-source Javalang to translate the Java source code,conducts a deep search on the AST to obtain the empty syntax feature library,and converts the Java source code into a dependency graph.The feature vector is then used as the learning target for the neural network.Four types of Convolutional Neural Networks(CNN),Long Short-Term Memory(LSTM),Bi-directional Long Short-Term Memory(BiLSTM),and Attention Mechanism+Bidirectional LSTM,are used to investigate various code defects,including blank pointer reference exception,XSS,and SQL injection defects.Experimental results show that the attention mechanism in two-dimensional BLSTM is the most effective for object recognition,verifying the correctness of the method.展开更多
The safety assessment of high-level radioactive waste repositories requires a high predictive accuracy for radionuclide diffusion and a comprehensive understanding of the diffusion mechanism.In this study,a through-di...The safety assessment of high-level radioactive waste repositories requires a high predictive accuracy for radionuclide diffusion and a comprehensive understanding of the diffusion mechanism.In this study,a through-diffusion method and six machine-learning methods were employed to investigate the diffusion of ReO_(4)^(−),HCrO_(4)^(−),and I−in saturated compacted bentonite under different salinities and compacted dry densities.The machine-learning models were trained using two datasets.One dataset contained six input features and 293 instances obtained from the diffusion database system of the Japan Atomic Energy Agency(JAEA-DDB)and 15 publications.The other dataset,comprising 15,000 pseudo-instances,was produced using a multi-porosity model and contained eight input features.The results indicate that the former dataset yielded a higher predictive accuracy than the latter.Light gradient-boosting exhibited a higher prediction accuracy(R2=0.92)and lower error(MSE=0.01)than the other machine-learning algorithms.In addition,Shapley Additive Explanations,Feature Importance,and Partial Dependence Plot analysis results indicate that the rock capacity factor and compacted dry density had the two most significant effects on predicting the effective diffusion coefficient,thereby offering valuable insights.展开更多
With the advancement of retinal imaging,hyperreflective foci(HRF)on optical coherence tomography(OCT)images have gained significant attention as potential biological biomarkers for retinal neuroinflammation.However,th...With the advancement of retinal imaging,hyperreflective foci(HRF)on optical coherence tomography(OCT)images have gained significant attention as potential biological biomarkers for retinal neuroinflammation.However,these biomarkers,represented by HRF,present pose challenges in terms of localization,quantification,and require substantial time and resources.In recent years,the progress and utilization of artificial intelligence(AI)have provided powerful tools for the analysis of biological markers.AI technology enables use machine learning(ML),deep learning(DL)and other technologies to precise characterization of changes in biological biomarkers during disease progression and facilitates quantitative assessments.Based on ophthalmic images,AI has significant implications for early screening,diagnostic grading,treatment efficacy evaluation,treatment recommendations,and prognosis development in common ophthalmic diseases.Moreover,it will help reduce the reliance of the healthcare system on human labor,which has the potential to simplify and expedite clinical trials,enhance the reliability and professionalism of disease management,and improve the prediction of adverse events.This article offers a comprehensive review of the application of AI in combination with HRF on OCT images in ophthalmic diseases including age-related macular degeneration(AMD),diabetic macular edema(DME),retinal vein occlusion(RVO)and other retinal diseases and presents prospects for their utilization.展开更多
Existing researches on cyber attackdefense analysis have typically adopted stochastic game theory to model the problem for solutions,but the assumption of complete rationality is used in modeling,ignoring the informat...Existing researches on cyber attackdefense analysis have typically adopted stochastic game theory to model the problem for solutions,but the assumption of complete rationality is used in modeling,ignoring the information opacity in practical attack and defense scenarios,and the model and method lack accuracy.To such problem,we investigate network defense policy methods under finite rationality constraints and propose network defense policy selection algorithm based on deep reinforcement learning.Based on graph theoretical methods,we transform the decision-making problem into a path optimization problem,and use a compression method based on service node to map the network state.On this basis,we improve the A3C algorithm and design the DefenseA3C defense policy selection algorithm with online learning capability.The experimental results show that the model and method proposed in this paper can stably converge to a better network state after training,which is faster and more stable than the original A3C algorithm.Compared with the existing typical approaches,Defense-A3C is verified its advancement.展开更多
The prediction of slope stability is considered as one of the critical concerns in geotechnical engineering.Conventional stochastic analysis with spatially variable slopes is time-consuming and highly computation-dema...The prediction of slope stability is considered as one of the critical concerns in geotechnical engineering.Conventional stochastic analysis with spatially variable slopes is time-consuming and highly computation-demanding.To assess the slope stability problems with a more desirable computational effort,many machine learning(ML)algorithms have been proposed.However,most ML-based techniques require that the training data must be in the same feature space and have the same distribution,and the model may need to be rebuilt when the spatial distribution changes.This paper presents a new ML-based algorithm,which combines the principal component analysis(PCA)-based neural network(NN)and transfer learning(TL)techniques(i.e.PCAeNNeTL)to conduct the stability analysis of slopes with different spatial distributions.The Monte Carlo coupled with finite element simulation is first conducted for data acquisition considering the spatial variability of cohesive strength or friction angle of soils from eight slopes with the same geometry.The PCA method is incorporated into the neural network algorithm(i.e.PCA-NN)to increase the computational efficiency by reducing the input variables.It is found that the PCA-NN algorithm performs well in improving the prediction of slope stability for a given slope in terms of the computational accuracy and computational effort when compared with the other two algorithms(i.e.NN and decision trees,DT).Furthermore,the PCAeNNeTL algorithm shows great potential in assessing the stability of slope even with fewer training data.展开更多
Sentiment analysis, a crucial task in discerning emotional tones within the text, plays a pivotal role in understandingpublic opinion and user sentiment across diverse languages.While numerous scholars conduct sentime...Sentiment analysis, a crucial task in discerning emotional tones within the text, plays a pivotal role in understandingpublic opinion and user sentiment across diverse languages.While numerous scholars conduct sentiment analysisin widely spoken languages such as English, Chinese, Arabic, Roman Arabic, and more, we come to grapplingwith resource-poor languages like Urdu literature which becomes a challenge. Urdu is a uniquely crafted language,characterized by a script that amalgamates elements from diverse languages, including Arabic, Parsi, Pashtu,Turkish, Punjabi, Saraiki, and more. As Urdu literature, characterized by distinct character sets and linguisticfeatures, presents an additional hurdle due to the lack of accessible datasets, rendering sentiment analysis aformidable undertaking. The limited availability of resources has fueled increased interest among researchers,prompting a deeper exploration into Urdu sentiment analysis. This research is dedicated to Urdu languagesentiment analysis, employing sophisticated deep learning models on an extensive dataset categorized into fivelabels: Positive, Negative, Neutral, Mixed, and Ambiguous. The primary objective is to discern sentiments andemotions within the Urdu language, despite the absence of well-curated datasets. To tackle this challenge, theinitial step involves the creation of a comprehensive Urdu dataset by aggregating data from various sources such asnewspapers, articles, and socialmedia comments. Subsequent to this data collection, a thorough process of cleaningand preprocessing is implemented to ensure the quality of the data. The study leverages two well-known deeplearningmodels, namely Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN), for bothtraining and evaluating sentiment analysis performance. Additionally, the study explores hyperparameter tuning tooptimize the models’ efficacy. Evaluation metrics such as precision, recall, and the F1-score are employed to assessthe effectiveness of the models. The research findings reveal that RNN surpasses CNN in Urdu sentiment analysis,gaining a significantly higher accuracy rate of 91%. This result accentuates the exceptional performance of RNN,solidifying its status as a compelling option for conducting sentiment analysis tasks in the Urdu language.展开更多
Heart monitoring improves life quality.Electrocardiograms(ECGs or EKGs)detect heart irregularities.Machine learning algorithms can create a few ECG diagnosis processing methods.The first method uses raw ECG and time-s...Heart monitoring improves life quality.Electrocardiograms(ECGs or EKGs)detect heart irregularities.Machine learning algorithms can create a few ECG diagnosis processing methods.The first method uses raw ECG and time-series data.The second method classifies the ECG by patient experience.The third technique translates ECG impulses into Q waves,R waves and S waves(QRS)features using richer information.Because ECG signals vary naturally between humans and activities,we will combine the three feature selection methods to improve classification accuracy and diagnosis.Classifications using all three approaches have not been examined till now.Several researchers found that Machine Learning(ML)techniques can improve ECG classification.This study will compare popular machine learning techniques to evaluate ECG features.Four algorithms—Support Vector Machine(SVM),Decision Tree,Naive Bayes,and Neural Network—compare categorization results.SVM plus prior knowledge has the highest accuracy(99%)of the four ML methods.QRS characteristics failed to identify signals without chaos theory.With 99.8%classification accuracy,the Decision Tree technique outperformed all previous experiments.展开更多
Recent medical literature shows that the application of artificial intelligence(AI)models in gastrointestinal pathology is an exponentially growing field,with pro-mising models that show very high performances.Regardi...Recent medical literature shows that the application of artificial intelligence(AI)models in gastrointestinal pathology is an exponentially growing field,with pro-mising models that show very high performances.Regarding inflammatory bowel disease(IBD),recent reviews demonstrate promising diagnostic and prognostic AI models.However,studies are generally at high risk of bias(especially in AI models that are image-based).The creation of specific AI models that improve diagnostic performance and allow the establishment of a general prognostic fo-recast in IBD is of great interest,as it may allow the stratification of patients into subgroups and,in turn,allow the creation of different diagnostic and therapeutic protocols for these patients.Regarding surgical models,predictive models of post-operative complications have shown great potential in large-scale studies.In this work,the authors present the development of a predictive algorithm for early post-surgical complications in Crohn's disease based on a Random Forest model with exceptional predictive ability for complications within the cohort.The pre-sent work,based on logical and reasoned,clinical,and applicable aspects,lays a solid foundation for future prospective work to further develop post-surgical prognostic tools for IBD.The next step is to develop in a prospective and mul-ticenter way,a collaborative path to optimize this line of research and make it applicable to our patients.展开更多
In this study, a hybrid machine learning (HML)-based approach, incorporating Genetic data analysis (GDA), is proposed to accurately identify the presence of adenomatous colorectal polyps (ACRP) which is a crucial earl...In this study, a hybrid machine learning (HML)-based approach, incorporating Genetic data analysis (GDA), is proposed to accurately identify the presence of adenomatous colorectal polyps (ACRP) which is a crucial early detector of colorectal cancer (CRC). The present study develops a classification ensemble model based on tuned hyperparameters. Surpassing accuracy percentages of early detection approaches used in previous studies, the current method exhibits exceptional performance in identifying ACRP and diagnosing CRC, overcoming limitations of CRC traditional methods that are based on error-prone manual examination. Particularly, the method demonstrates the following CRP identification accuracy data: 97.7 ± 1.1, precision: 94.3 ± 5, recall: 96.0 ± 3, F1-score: 95.7 ± 4, specificity: 97.3 ± 1.2, average AUC: 0.97.3 ± 0.02, and average p-value: 0.0425 ± 0.07. The findings underscore the potential of this method for early detection of ACRP as well as clinical use in the development of CRC treatment planning strategies. The advantages of this approach are highly expected to contribute to the prevention and reduction of CRC mortality.展开更多
This study aims to explore the application of Bayesian analysis based on neural networks and deep learning in data visualization.The research background is that with the increasing amount and complexity of data,tradit...This study aims to explore the application of Bayesian analysis based on neural networks and deep learning in data visualization.The research background is that with the increasing amount and complexity of data,traditional data analysis methods have been unable to meet the needs.Research methods include building neural networks and deep learning models,optimizing and improving them through Bayesian analysis,and applying them to the visualization of large-scale data sets.The results show that the neural network combined with Bayesian analysis and deep learning method can effectively improve the accuracy and efficiency of data visualization,and enhance the intuitiveness and depth of data interpretation.The significance of the research is that it provides a new solution for data visualization in the big data environment and helps to further promote the development and application of data science.展开更多
This study investigates university English teachers’acceptance and willingness to use learning management system(LMS)data analysis tools in their teaching practices.The research employs a mixed-method approach,combin...This study investigates university English teachers’acceptance and willingness to use learning management system(LMS)data analysis tools in their teaching practices.The research employs a mixed-method approach,combining quantitative surveys and qualitative interviews to understand teachers’perceptions and attitudes,and the factors influencing their adoption of LMS data analysis tools.The findings reveal that perceived usefulness,perceived ease of use,technical literacy,organizational support,and data privacy concerns significantly impact teachers’willingness to use these tools.Based on these insights,the study offers practical recommendations for educational institutions to enhance the effective adoption of LMS data analysis tools in English language teaching.展开更多
This study investigated the Chinese learning motivation,learning goals and learning strategies of 26 international students majoring in MBA and MPA at a university with The belt and road college,mainly by questionnair...This study investigated the Chinese learning motivation,learning goals and learning strategies of 26 international students majoring in MBA and MPA at a university with The belt and road college,mainly by questionnaire and interview method,supplemented by classroom observation method.The survey found that 20 of the 24 international students were zero-start Chinese learners,and their learning motivation was mainly"instrumental"and"intrinsic",and they had high enthusiasm for Chinese language and Chinese culture.They have a high enthusiasm for Chinese language and culture,and will actively solve the difficulties they encounter in learning Chinese.At the same time,this study conducted a questionnaire survey on the needs of international students in terms of curriculum and content,teaching materials,teaching assessment and extracurricular activities,combined with the results of individual and group interviews and classroom observations,to summarize the real needs of international students in various aspects of Chinese language learning,so as to provide teaching reference for teachers teaching international students,and to provide a reference for colleges and universities to develop Chinese teaching programs.The survey will provide a basis for the colleges and universities to formulate Chinese teaching programs and coordinate teaching activities,so as to help international students learn Chinese better.展开更多
基金The National Natural Science Foundation of China under contract Nos 41875061 and 41775165.
文摘The mesoscale eddy(ME)has a significant influence on the convergence effect in deep-sea acoustic propagation.This paper use statistical approaches to express quantitative relationships between the ME conditions and convergence zone(CZ)characteristics.Based on the Gaussian vortex model,we construct various sound propagation scenarios under different eddy conditions,and carry out sound propagation experiments to obtain simulation samples.With a large number of samples,we first adopt the unified regression to set up analytic relationships between eddy conditions and CZ parameters.The sensitivity of eddy indicators to the CZ is quantitatively analyzed.Then,we adopt the machine learning(ML)algorithms to establish prediction models of CZ parameters by exploring the nonlinear relationships between multiple ME indicators and CZ parameters.Through the research,we can express the influence of ME on the CZ quantitatively,and achieve the rapid prediction of CZ parameters in ocean eddies.The prediction accuracy(R)of the CZ distance(mean R:0.9815)is obviously better than that of the CZ width(mean R:0.8728).Among the three ML algorithms,Gradient Boosting Decision Tree has the best prediction ability(root mean square error(RMSE):0.136),followed by Random Forest(RMSE:0.441)and Extreme Learning Machine(RMSE:0.518).
基金supported by Innovative Human Resource Development for Local Intellectualization program through the Institute of Information&Communications Technology Planning&Evaluation(IITP)grant funded by the Korea government(MSIT)(IITP2024-00156287,50%)funded by the Institute for Information&Communications Technology Planning&Evaluation(IITP)grant funded by the Korea government(MSIT)(No.2022-0-01203,Regional Strategic Industry Convergence Security Core Talent Training Business,50%).
文摘The rapid proliferation of Internet of Things(IoT)technology has facilitated automation across various sectors.Nevertheless,this advancement has also resulted in a notable surge in cyberattacks,notably botnets.As a result,research on network analysis has become vital.Machine learning-based techniques for network analysis provide a more extensive and adaptable approach in comparison to traditional rule-based methods.In this paper,we propose a framework for analyzing communications between IoT devices using supervised learning and ensemble techniques and present experimental results that validate the efficacy of the proposed framework.The results indicate that using the proposed ensemble techniques improves accuracy by up to 1.7%compared to singlealgorithm approaches.These results also suggest that the proposed framework can flexibly adapt to general IoT network analysis scenarios.Unlike existing frameworks,which only exhibit high performance in specific situations,the proposed framework can serve as a fundamental approach for addressing a wide range of issues.
文摘To enhance the safety of road traffic operations,this paper proposed a model based on stacking integrated learning utilizing American road traffic accident statistics.Initially,the process involved data cleaning,transformation,and normalization.Subsequently,various classification models were constructed,including logistic regression,k-nearest neighbors,gradient boosting,decision trees,AdaBoost,and extra trees models.Evaluation metrics such as accuracy,precision,recall,F1 score,and Hamming loss were employed.Upon analysis,the passive-aggressive classifier model exhibited superior comprehensive indices compared to other models.Based on the model’s output results,an in-depth examination of the factors influencing traffic accidents was conducted.Additionally,measures and suggestions aimed at reducing the incidence of severe traffic accidents were presented.These findings served as a valuable reference for mitigating the occurrence of traffic accidents.
文摘To address the challenges of current college student employment management,this study designed and implemented a machine learning-based decision support system for college student employment management.The system collects and analyzes multidimensional data,uses machine learning algorithms for prediction and matching,provides personalized employment guidance for students,and provides decision support for universities and enterprises.The research results indicate that the system can effectively improve the efficiency and accuracy of employment guidance,promote school-enterprise cooperation,and achieve a win-win situation for all parties.
基金This research was supported by the National Natural Science Foundation of China(41977064)the National Key R&D Program of China(2021YFD1900700).
文摘Check dams are widely used on the Loess Plateau in China to control soil and water losses,develop agricultural land,and improve watershed ecology.Detailed information on the number and spatial distribution of check dams is critical for quantitatively evaluating hydrological and ecological effects and planning the construction of new dams.Thus,this study developed a check dam detection framework for broad areas from high-resolution remote sensing images using an ensemble approach of deep learning and geospatial analysis.First,we made a sample dataset of check dams using GaoFen-2(GF-2)and Google Earth images.Next,we evaluated five popular deep-learning-based object detectors,including Faster R-CNN,You Only Look Once(version 3)(YOLOv3),Cascade R-CNN,YOLOX,and VarifocalNet(VFNet),to identify the best one for check dam detection.Finally,we analyzed the location characteristics of the check dams and used geographical constraints to optimize the detection results.Precision,recall,average precision at intersection over union(IoU)threshold of 0.50(AP_(50)),IoU threshold of 0.75(AP_(75)),and average value for 10 IoU thresholds ranging from 0.50-0.95 with a 0.05 step(AP_(50-95)),and inference time were used to evaluate model performance.All the five deep learning networks could identify check dams quickly and accurately,with AP_(50-95),AP_(50),and AP_(75)values higher than 60.0%,90.0%,and 70.0%,respectively,except for YOLOv3.The VFNet had the best performance,followed by YOLOX.The proposed framework was tested in the Yanhe River Basin and yielded promising results,with a recall rate of 87.0%for 521 check dams.Furthermore,the geographic analysis deleted about 50%of the false detection boxes,increasing the identification accuracy of check dams from 78.6%to 87.6%.Simultaneously,this framework recognized 568 recently constructed check dams and small check dams not recorded in the known check dam survey datasets.The extraction results will support efficient watershed management and guide future studies on soil erosion in the Loess Plateau.
文摘Language teaching is not a one-way process.It interacts with language learning in an extremely intricate way.To improve language teaching,we need to take the process of language learning into account.This paper tries to explore and understand what strategies the second language learners consciously or subconsciously adopt during their language learning process through the analyses of the linguistic errors they commit,so as to provide some insights into language teaching practice.
文摘Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers.
基金This work was supported by the National Key R&D Program‘Transportation Infrastructure’project(No.2022YFB2603400).
文摘The compaction quality of subgrade filler strongly affects subgrade settlement.The main objective of this research is to analyze the macro-and micro-mechanical compaction characteristics of subgrade filler based on the real shape of coarse particles.First,an improved Viola-Jones algorithm is employed to establish a digitalized 2D particle database for coarse particle shape evaluation and discrete modeling purposes of subgrade filler.Shape indexes of 2D subgrade filler are then computed and statistically analyzed.Finally,numerical simulations are performed to quantitatively investigate the effects of the aspect ratio(AR)and interparticle friction coefficient(μ)on the macro-and micro-mechanical compaction characteristics of subgrade filler based on the discrete element method(DEM).The results show that with the increasing AR,the coarse particles are narrower,leading to the increasing movement of fine particles during compaction,which indicates that it is difficult for slender coarse particles to inhibit the migration of fine particles.Moreover,the average displacement of particles is strongly influenced by the AR,indicating that their occlusion under power relies on particle shapes.The dis-placement and velocity of fine particles are much greater than those of the coarse particles,which shows that compaction is primarily a migration of fine particles.Under the cyclic load,the interparticle friction coefficientμhas little effect on the internal structure of the sample;under the quasi-static loads,however,the increase inμwill lead to a significant increase in the porosity of the sample.This study could not only provide a novel approach to investigate the compaction mechanism but also establish a new theoretical basis for the evaluation of intelligent subgrade compaction.
基金This work is supported by the Provincial Key Science and Technology Special Project of Henan(No.221100240100)。
文摘In recent years,the rapid development of computer software has led to numerous security problems,particularly software vulnerabilities.These flaws can cause significant harm to users’privacy and property.Current security defect detection technology relies on manual or professional reasoning,leading to missed detection and high false detection rates.Artificial intelligence technology has led to the development of neural network models based on machine learning or deep learning to intelligently mine holes,reducing missed alarms and false alarms.So,this project aims to study Java source code defect detection methods for defects like null pointer reference exception,XSS(Transform),and Structured Query Language(SQL)injection.Also,the project uses open-source Javalang to translate the Java source code,conducts a deep search on the AST to obtain the empty syntax feature library,and converts the Java source code into a dependency graph.The feature vector is then used as the learning target for the neural network.Four types of Convolutional Neural Networks(CNN),Long Short-Term Memory(LSTM),Bi-directional Long Short-Term Memory(BiLSTM),and Attention Mechanism+Bidirectional LSTM,are used to investigate various code defects,including blank pointer reference exception,XSS,and SQL injection defects.Experimental results show that the attention mechanism in two-dimensional BLSTM is the most effective for object recognition,verifying the correctness of the method.
基金the Key Program of National Natural Science Foundation of China(No.12335008),the Postgraduate Research and Innovation Project of Huzhou University(No.2023KYCX62)the Scientific Research Fund of Zhejiang Provincial Education Department(No.Y202352712)the Huzhou science and technology planning project(No.2021GZ60)。
文摘The safety assessment of high-level radioactive waste repositories requires a high predictive accuracy for radionuclide diffusion and a comprehensive understanding of the diffusion mechanism.In this study,a through-diffusion method and six machine-learning methods were employed to investigate the diffusion of ReO_(4)^(−),HCrO_(4)^(−),and I−in saturated compacted bentonite under different salinities and compacted dry densities.The machine-learning models were trained using two datasets.One dataset contained six input features and 293 instances obtained from the diffusion database system of the Japan Atomic Energy Agency(JAEA-DDB)and 15 publications.The other dataset,comprising 15,000 pseudo-instances,was produced using a multi-porosity model and contained eight input features.The results indicate that the former dataset yielded a higher predictive accuracy than the latter.Light gradient-boosting exhibited a higher prediction accuracy(R2=0.92)and lower error(MSE=0.01)than the other machine-learning algorithms.In addition,Shapley Additive Explanations,Feature Importance,and Partial Dependence Plot analysis results indicate that the rock capacity factor and compacted dry density had the two most significant effects on predicting the effective diffusion coefficient,thereby offering valuable insights.
基金Supported by Zhejiang Provincial Natural Science Foundation of China(No.LGF22H120013)the Ningbo Natural Science Foundation(No.2023J209,No.2021J023)+2 种基金Ningbo Medical Science and Technology Project(No.2021Y57)Ningbo Yinzhou District Agricultural Community Development Science and Technology Project(No.2022AS022)Ningbo Eye Hospital Scientific Technology Plan Project and Talent Introduction Start Subject(No.2022RC001).
文摘With the advancement of retinal imaging,hyperreflective foci(HRF)on optical coherence tomography(OCT)images have gained significant attention as potential biological biomarkers for retinal neuroinflammation.However,these biomarkers,represented by HRF,present pose challenges in terms of localization,quantification,and require substantial time and resources.In recent years,the progress and utilization of artificial intelligence(AI)have provided powerful tools for the analysis of biological markers.AI technology enables use machine learning(ML),deep learning(DL)and other technologies to precise characterization of changes in biological biomarkers during disease progression and facilitates quantitative assessments.Based on ophthalmic images,AI has significant implications for early screening,diagnostic grading,treatment efficacy evaluation,treatment recommendations,and prognosis development in common ophthalmic diseases.Moreover,it will help reduce the reliance of the healthcare system on human labor,which has the potential to simplify and expedite clinical trials,enhance the reliability and professionalism of disease management,and improve the prediction of adverse events.This article offers a comprehensive review of the application of AI in combination with HRF on OCT images in ophthalmic diseases including age-related macular degeneration(AMD),diabetic macular edema(DME),retinal vein occlusion(RVO)and other retinal diseases and presents prospects for their utilization.
基金supported by the Major Science and Technology Programs in Henan Province(No.241100210100)The Project of Science and Technology in Henan Province(No.242102211068,No.232102210078)+2 种基金The Key Field Special Project of Guangdong Province(No.2021ZDZX1098)The China University Research Innovation Fund(No.2021FNB3001,No.2022IT020)Shenzhen Science and Technology Innovation Commission Stable Support Plan(No.20231128083944001)。
文摘Existing researches on cyber attackdefense analysis have typically adopted stochastic game theory to model the problem for solutions,but the assumption of complete rationality is used in modeling,ignoring the information opacity in practical attack and defense scenarios,and the model and method lack accuracy.To such problem,we investigate network defense policy methods under finite rationality constraints and propose network defense policy selection algorithm based on deep reinforcement learning.Based on graph theoretical methods,we transform the decision-making problem into a path optimization problem,and use a compression method based on service node to map the network state.On this basis,we improve the A3C algorithm and design the DefenseA3C defense policy selection algorithm with online learning capability.The experimental results show that the model and method proposed in this paper can stably converge to a better network state after training,which is faster and more stable than the original A3C algorithm.Compared with the existing typical approaches,Defense-A3C is verified its advancement.
基金supported by the National Natural Science Foundation of China(Grant No.52008402)the Central South University autonomous exploration project(Grant No.2021zzts0790).
文摘The prediction of slope stability is considered as one of the critical concerns in geotechnical engineering.Conventional stochastic analysis with spatially variable slopes is time-consuming and highly computation-demanding.To assess the slope stability problems with a more desirable computational effort,many machine learning(ML)algorithms have been proposed.However,most ML-based techniques require that the training data must be in the same feature space and have the same distribution,and the model may need to be rebuilt when the spatial distribution changes.This paper presents a new ML-based algorithm,which combines the principal component analysis(PCA)-based neural network(NN)and transfer learning(TL)techniques(i.e.PCAeNNeTL)to conduct the stability analysis of slopes with different spatial distributions.The Monte Carlo coupled with finite element simulation is first conducted for data acquisition considering the spatial variability of cohesive strength or friction angle of soils from eight slopes with the same geometry.The PCA method is incorporated into the neural network algorithm(i.e.PCA-NN)to increase the computational efficiency by reducing the input variables.It is found that the PCA-NN algorithm performs well in improving the prediction of slope stability for a given slope in terms of the computational accuracy and computational effort when compared with the other two algorithms(i.e.NN and decision trees,DT).Furthermore,the PCAeNNeTL algorithm shows great potential in assessing the stability of slope even with fewer training data.
文摘Sentiment analysis, a crucial task in discerning emotional tones within the text, plays a pivotal role in understandingpublic opinion and user sentiment across diverse languages.While numerous scholars conduct sentiment analysisin widely spoken languages such as English, Chinese, Arabic, Roman Arabic, and more, we come to grapplingwith resource-poor languages like Urdu literature which becomes a challenge. Urdu is a uniquely crafted language,characterized by a script that amalgamates elements from diverse languages, including Arabic, Parsi, Pashtu,Turkish, Punjabi, Saraiki, and more. As Urdu literature, characterized by distinct character sets and linguisticfeatures, presents an additional hurdle due to the lack of accessible datasets, rendering sentiment analysis aformidable undertaking. The limited availability of resources has fueled increased interest among researchers,prompting a deeper exploration into Urdu sentiment analysis. This research is dedicated to Urdu languagesentiment analysis, employing sophisticated deep learning models on an extensive dataset categorized into fivelabels: Positive, Negative, Neutral, Mixed, and Ambiguous. The primary objective is to discern sentiments andemotions within the Urdu language, despite the absence of well-curated datasets. To tackle this challenge, theinitial step involves the creation of a comprehensive Urdu dataset by aggregating data from various sources such asnewspapers, articles, and socialmedia comments. Subsequent to this data collection, a thorough process of cleaningand preprocessing is implemented to ensure the quality of the data. The study leverages two well-known deeplearningmodels, namely Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN), for bothtraining and evaluating sentiment analysis performance. Additionally, the study explores hyperparameter tuning tooptimize the models’ efficacy. Evaluation metrics such as precision, recall, and the F1-score are employed to assessthe effectiveness of the models. The research findings reveal that RNN surpasses CNN in Urdu sentiment analysis,gaining a significantly higher accuracy rate of 91%. This result accentuates the exceptional performance of RNN,solidifying its status as a compelling option for conducting sentiment analysis tasks in the Urdu language.
基金The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work through Large Groups(Grant Number RGP.2/246/44),B.B.,and https://www.kku.edu.sa/en.
文摘Heart monitoring improves life quality.Electrocardiograms(ECGs or EKGs)detect heart irregularities.Machine learning algorithms can create a few ECG diagnosis processing methods.The first method uses raw ECG and time-series data.The second method classifies the ECG by patient experience.The third technique translates ECG impulses into Q waves,R waves and S waves(QRS)features using richer information.Because ECG signals vary naturally between humans and activities,we will combine the three feature selection methods to improve classification accuracy and diagnosis.Classifications using all three approaches have not been examined till now.Several researchers found that Machine Learning(ML)techniques can improve ECG classification.This study will compare popular machine learning techniques to evaluate ECG features.Four algorithms—Support Vector Machine(SVM),Decision Tree,Naive Bayes,and Neural Network—compare categorization results.SVM plus prior knowledge has the highest accuracy(99%)of the four ML methods.QRS characteristics failed to identify signals without chaos theory.With 99.8%classification accuracy,the Decision Tree technique outperformed all previous experiments.
文摘Recent medical literature shows that the application of artificial intelligence(AI)models in gastrointestinal pathology is an exponentially growing field,with pro-mising models that show very high performances.Regarding inflammatory bowel disease(IBD),recent reviews demonstrate promising diagnostic and prognostic AI models.However,studies are generally at high risk of bias(especially in AI models that are image-based).The creation of specific AI models that improve diagnostic performance and allow the establishment of a general prognostic fo-recast in IBD is of great interest,as it may allow the stratification of patients into subgroups and,in turn,allow the creation of different diagnostic and therapeutic protocols for these patients.Regarding surgical models,predictive models of post-operative complications have shown great potential in large-scale studies.In this work,the authors present the development of a predictive algorithm for early post-surgical complications in Crohn's disease based on a Random Forest model with exceptional predictive ability for complications within the cohort.The pre-sent work,based on logical and reasoned,clinical,and applicable aspects,lays a solid foundation for future prospective work to further develop post-surgical prognostic tools for IBD.The next step is to develop in a prospective and mul-ticenter way,a collaborative path to optimize this line of research and make it applicable to our patients.
文摘In this study, a hybrid machine learning (HML)-based approach, incorporating Genetic data analysis (GDA), is proposed to accurately identify the presence of adenomatous colorectal polyps (ACRP) which is a crucial early detector of colorectal cancer (CRC). The present study develops a classification ensemble model based on tuned hyperparameters. Surpassing accuracy percentages of early detection approaches used in previous studies, the current method exhibits exceptional performance in identifying ACRP and diagnosing CRC, overcoming limitations of CRC traditional methods that are based on error-prone manual examination. Particularly, the method demonstrates the following CRP identification accuracy data: 97.7 ± 1.1, precision: 94.3 ± 5, recall: 96.0 ± 3, F1-score: 95.7 ± 4, specificity: 97.3 ± 1.2, average AUC: 0.97.3 ± 0.02, and average p-value: 0.0425 ± 0.07. The findings underscore the potential of this method for early detection of ACRP as well as clinical use in the development of CRC treatment planning strategies. The advantages of this approach are highly expected to contribute to the prevention and reduction of CRC mortality.
文摘This study aims to explore the application of Bayesian analysis based on neural networks and deep learning in data visualization.The research background is that with the increasing amount and complexity of data,traditional data analysis methods have been unable to meet the needs.Research methods include building neural networks and deep learning models,optimizing and improving them through Bayesian analysis,and applying them to the visualization of large-scale data sets.The results show that the neural network combined with Bayesian analysis and deep learning method can effectively improve the accuracy and efficiency of data visualization,and enhance the intuitiveness and depth of data interpretation.The significance of the research is that it provides a new solution for data visualization in the big data environment and helps to further promote the development and application of data science.
文摘This study investigates university English teachers’acceptance and willingness to use learning management system(LMS)data analysis tools in their teaching practices.The research employs a mixed-method approach,combining quantitative surveys and qualitative interviews to understand teachers’perceptions and attitudes,and the factors influencing their adoption of LMS data analysis tools.The findings reveal that perceived usefulness,perceived ease of use,technical literacy,organizational support,and data privacy concerns significantly impact teachers’willingness to use these tools.Based on these insights,the study offers practical recommendations for educational institutions to enhance the effective adoption of LMS data analysis tools in English language teaching.
文摘This study investigated the Chinese learning motivation,learning goals and learning strategies of 26 international students majoring in MBA and MPA at a university with The belt and road college,mainly by questionnaire and interview method,supplemented by classroom observation method.The survey found that 20 of the 24 international students were zero-start Chinese learners,and their learning motivation was mainly"instrumental"and"intrinsic",and they had high enthusiasm for Chinese language and Chinese culture.They have a high enthusiasm for Chinese language and culture,and will actively solve the difficulties they encounter in learning Chinese.At the same time,this study conducted a questionnaire survey on the needs of international students in terms of curriculum and content,teaching materials,teaching assessment and extracurricular activities,combined with the results of individual and group interviews and classroom observations,to summarize the real needs of international students in various aspects of Chinese language learning,so as to provide teaching reference for teachers teaching international students,and to provide a reference for colleges and universities to develop Chinese teaching programs.The survey will provide a basis for the colleges and universities to formulate Chinese teaching programs and coordinate teaching activities,so as to help international students learn Chinese better.