期刊文献+
共找到28篇文章
< 1 2 >
每页显示 20 50 100
Employee Attrition Classification Model Based on Stacking Algorithm
1
作者 CHEN Yanming LIN Xinyu ZHAN Kunye 《Psychology Research》 2023年第6期279-285,共7页
This paper aims to build an employee attrition classification model based on the Stacking algorithm.Oversampling algorithm is applied to address the issue of data imbalance and the Randomforest feature importance rank... This paper aims to build an employee attrition classification model based on the Stacking algorithm.Oversampling algorithm is applied to address the issue of data imbalance and the Randomforest feature importance ranking method is used to resolve the overfitting problem after data cleaning and preprocessing.Then,different algorithms are used to establish classification models as control experiments,and R-squared indicators are used to compare.Finally,the Stacking algorithm is used to establish the final classification model.This model has practical and significant implications for both human resource management and employee attrition analysis. 展开更多
关键词 employee attrition classification model machine learning ensemble learning oversampling algorithm Randomforest stacking algorithm
下载PDF
STOCHASTIC SIMULATION OF TROPICAL CYCLONE TRACKS IN THE NORTHWEST PACIFIC REGION WITH CLASSIFICATION MODEL 被引量:2
2
作者 黄文锋 刘大伟 邵梦凯 《Journal of Tropical Meteorology》 SCIE 2020年第1期47-56,共10页
Accurate simulation of tropical cyclone tracks is a prerequisite for tropical cyclone risk assessment.Against the spatial characteristics of tropical cyclone tracks in the Northwest Pacific region,stochastic simulatio... Accurate simulation of tropical cyclone tracks is a prerequisite for tropical cyclone risk assessment.Against the spatial characteristics of tropical cyclone tracks in the Northwest Pacific region,stochastic simulation method based on classification model is used to simulate tropical cyclone tracks in this region.Such simulation includes the classification method,the genesis model,the traveling model,and the lysis model.Tropical cyclone tracks in the Northwest Pacific region are classified into five categories on the basis of its movement characteristics and steering positions.In the genesis model,Gaussian kernel probability density functions with the biased cross validation method are used to simulate the annual occurrence number and genesis positions.The traveling model is established on the basis of the mean and mean square error of the historical 6 h latitude and longitude displacements.The termination probability is used as the discrimination standard in the lysis model.Then,this stochastic simulation method of tropical cyclone tracks is applied and qualitatively evaluated with different diagnostics.Results show that the tropical cyclone tracks in Northwest Pacific can be satisfactorily simulated with this classification model. 展开更多
关键词 classification model genesis model lysis model traveling model tropical cyclone track
下载PDF
Data-driven casting defect prediction model for sand casting based on random forest classification algorithm
3
作者 Bang Guan Dong-hong Wang +3 位作者 Da Shu Shou-qin Zhu Xiao-yuan Ji Bao-de Sun 《China Foundry》 SCIE EI CAS CSCD 2024年第2期137-146,共10页
The complex sand-casting process combined with the interactions between process parameters makes it difficult to control the casting quality,resulting in a high scrap rate.A strategy based on a data-driven model was p... The complex sand-casting process combined with the interactions between process parameters makes it difficult to control the casting quality,resulting in a high scrap rate.A strategy based on a data-driven model was proposed to reduce casting defects and improve production efficiency,which includes the random forest(RF)classification model,the feature importance analysis,and the process parameters optimization with Monte Carlo simulation.The collected data includes four types of defects and corresponding process parameters were used to construct the RF model.Classification results show a recall rate above 90% for all categories.The Gini Index was used to assess the importance of the process parameters in the formation of various defects in the RF model.Finally,the classification model was applied to different production conditions for quality prediction.In the case of process parameters optimization for gas porosity defects,this model serves as an experimental process in the Monte Carlo method to estimate a better temperature distribution.The prediction model,when applied to the factory,greatly improved the efficiency of defect detection.Results show that the scrap rate decreased from 10.16% to 6.68%. 展开更多
关键词 sand casting process data-driven method classification model quality prediction feature importance
下载PDF
Artificial Intelligence-Based Fusion Model for Paddy Leaf Disease Detection and Classification
4
作者 Ahmed SAlmasoud Abdelzahir Abdelmaboud +5 位作者 Taiseer Abdalla Elfadil Eisa Mesfer Al Duhayyim Asma Abbas Hassan Elnour Manar Ahmed Hamza Abdelwahed Motwakel Abu Sarwar Zamani 《Computers, Materials & Continua》 SCIE EI 2022年第7期1391-1407,共17页
In agriculture,rice plant disease diagnosis has become a challenging issue,and early identification of this disease can avoid huge loss incurred from less crop productivity.Some of the recently-developed computer visi... In agriculture,rice plant disease diagnosis has become a challenging issue,and early identification of this disease can avoid huge loss incurred from less crop productivity.Some of the recently-developed computer vision and Deep Learning(DL)approaches can be commonly employed in designing effective models for rice plant disease detection and classification processes.With this motivation,the current research work devises an Efficient Deep Learning based FusionModel for Rice Plant Disease(EDLFM-RPD)detection and classification.The aim of the proposed EDLFM-RPD technique is to detect and classify different kinds of rice plant diseases in a proficient manner.In addition,EDLFM-RPD technique involves median filtering-based preprocessing and K-means segmentation to determine the infected portions.The study also used a fusion of handcrafted Gray Level Co-occurrence Matrix(GLCM)and Inception-based deep features to derive the features.Finally,Salp Swarm Optimization with Fuzzy Support Vector Machine(FSVM)model is utilized for classification.In order to validate the enhanced outcomes of EDLFM-RPD technique,a series of simulations was conducted.The results were assessed under different measures.The obtained values infer the improved performance of EDLFM-RPD technique over recent approaches and achieved a maximum accuracy of 96.170%. 展开更多
关键词 Rice plant disease classification model artificial intelligence deep learning fusion model parameter optimization
下载PDF
Classification of Acupuncture Points Based on the Bert Model*
5
作者 Xi Zhong Yangli Jia +1 位作者 Dekui Li Xiangliang Zhang 《Journal of Data Analysis and Information Processing》 2021年第3期123-135,共13页
In this paper, we explore the multi-classification problem of acupuncture acupoints bas</span><span><span style="font-family:Verdana;">ed on </span><span style="font-family:Ve... In this paper, we explore the multi-classification problem of acupuncture acupoints bas</span><span><span style="font-family:Verdana;">ed on </span><span style="font-family:Verdana;">Bert</span><span style="font-family:Verdana;"> model, </span><i><span style="font-family:Verdana;">i.e.</span></i><span style="font-family:Verdana;">, we try to recommend the best main acupuncture point for treating the disease by classifying and predicting the main acupuncture point for the disease, and further explore its acupuncture point grouping to provide the medical practitioner with the optimal solution for treating the disease and improv</span></span></span><span style="font-family:Verdana;">ing</span><span style="font-family:""><span style="font-family:Verdana;"> the clinical decision-making ability. The Bert-Chinese-Acupoint model was constructed by retraining </span><span style="font-family:Verdana;">on the basis of</span><span style="font-family:Verdana;"> the Bert model, and the semantic features in terms of acupuncture points were added to the acupunctu</span></span><span style="font-family:""><span style="font-family:Verdana;">re point corpus in the fine-tuning process to increase the semantic features in terms of acupuncture </span><span style="font-family:Verdana;">points,</span><span style="font-family:Verdana;"> and compared with the machine learning method. The results show that the Bert-Chinese Acupoint model proposed in this paper has a 3% improvement in accuracy compared to the </span><span style="font-family:Verdana;">best performing</span><span style="font-family:Verdana;"> model in the machine learning approach. 展开更多
关键词 Bert model Machine Learning classification model Comparison
下载PDF
Cardiac Arrhythmia Disease Classifier Model Based on a Fuzzy Fusion Approach
6
作者 Fatma Taher Hamoud Alshammari +3 位作者 Lobna Osman Mohamed Elhoseny Abdulaziz Shehab Eman Elayat 《Computers, Materials & Continua》 SCIE EI 2023年第5期4485-4499,共15页
Cardiac diseases are one of the greatest global health challenges.Due to the high annual mortality rates,cardiac diseases have attracted the attention of numerous researchers in recent years.This article proposes a hy... Cardiac diseases are one of the greatest global health challenges.Due to the high annual mortality rates,cardiac diseases have attracted the attention of numerous researchers in recent years.This article proposes a hybrid fuzzy fusion classification model for cardiac arrhythmia diseases.The fusion model is utilized to optimally select the highest-ranked features generated by a variety of well-known feature-selection algorithms.An ensemble of classifiers is then applied to the fusion’s results.The proposed model classifies the arrhythmia dataset from the University of California,Irvine into normal/abnormal classes as well as 16 classes of arrhythmia.Initially,at the preprocessing steps,for the miss-valued attributes,we used the average value in the linear attributes group by the same class and the most frequent value for nominal attributes.However,in order to ensure the model optimality,we eliminated all attributes which have zero or constant values that might bias the results of utilized classifiers.The preprocessing step led to 161 out of 279 attributes(features).Thereafter,a fuzzy-based feature-selection fusion method is applied to fuse high-ranked features obtained from different heuristic feature-selection algorithms.In short,our study comprises three main blocks:(1)sensing data and preprocessing;(2)feature queuing,selection,and extraction;and(3)the predictive model.Our proposed method improves classification performance in terms of accuracy,F1measure,recall,and precision when compared to state-of-the-art techniques.It achieves 98.5%accuracy for binary class mode and 98.9%accuracy for categorized class mode. 展开更多
关键词 CARDIAC ARRHYTHMIA PREPROCESSING missing values classification model FUSION
下载PDF
A Novel Multi-Stage Bispectral Deep Learning Method for Protein Family Classification
7
作者 Amjed Al Fahoum Ala’a Zyout +1 位作者 Hiam Alquran Isam Abu-Qasmieh 《Computers, Materials & Continua》 SCIE EI 2023年第7期1173-1193,共21页
Complex proteins are needed for many biological activities.Folding amino acid chains reveals their properties and functions.They support healthy tissue structure,physiology,and homeostasis.Precision medicine and treat... Complex proteins are needed for many biological activities.Folding amino acid chains reveals their properties and functions.They support healthy tissue structure,physiology,and homeostasis.Precision medicine and treatments require quantitative protein identification and function.Despite technical advances and protein sequence data exploration,bioinformatics’“basic structure”problem—the automatic deduction of a protein’s properties from its amino acid sequence—remains unsolved.Protein function inference from amino acid sequences is the main biological data challenge.This study analyzes whether raw sequencing can characterize biological facts.A massive corpus of protein sequences and the Globin-like superfamily’s related protein families generate a solid vector representation.A coding technique for each sequence in each family was devised using two representations to identify each amino acid precisely.A bispectral analysis converts encoded protein numerical sequences into images for better protein sequence and family discrimination.Training and validation employed 70%of the dataset,while 30%was used for testing.This paper examined the performance of multistage deep learning models for differentiating between sixteen protein families after encoding and representing each encoded sequence by a higher spectral representation image(Bispectrum).Cascading minimized false positive and negative cases in all phases.The initial stage focused on two classes(six groups and ten groups).The subsequent stages focused on the few classes almost accurately separated in the first stage and decreased the overlapping cases between families that appeared in single-stage deep learning classification.The single-stage technique had 64.2%+/-22.8%accuracy,63.3%+/-17.1%precision,and a 63.2%+/19.4%F1-score.The two-stage technique yielded 92.2%+/-4.9%accuracy,92.7%+/-7.0%precision,and a 92.3%+/-5.0%F1-score.This work provides balanced,reliable,and precise forecasts for all families in all measures.It ensured that the new model was resilient to family variances and provided high-scoring results. 展开更多
关键词 Globin-like superfamily numerical encoding bispectral analysis classification model deep convolutional neural network
下载PDF
An Innovative Bispectral Deep Learning Method for Protein Family Classification
8
作者 Isam Abu-Qasmieh Amjed Al Fahoum +1 位作者 Hiam Alquran Ala’a Zyout 《Computers, Materials & Continua》 SCIE EI 2023年第5期3971-3991,共21页
Proteins are essential for many biological functions.For example,folding amino acid chains reveals their functionalities by maintaining tissue structure,physiology,and homeostasis.Note that quantifiable protein charac... Proteins are essential for many biological functions.For example,folding amino acid chains reveals their functionalities by maintaining tissue structure,physiology,and homeostasis.Note that quantifiable protein characteristics are vital for improving therapies and precision medicine.The automatic inference of a protein’s properties from its amino acid sequence is called“basic structure”.Nevertheless,it remains a critical unsolved challenge in bioinformatics,although with recent technological advances and the investigation of protein sequence data.Inferring protein function from amino acid sequences is crucial in biology.This study considers using raw sequencing to explain biological facts using a large corpus of protein sequences and the Globin-like superfamily to generate a vector representation.The power of two representations was used to identify each amino acid,and a coding technique was established for each sequence family.Subsequently,the encoded protein numerical sequences are transformed into an image using bispectral analysis to identify essential characteristics for discriminating between protein sequences and their families.A deep Convolutional Neural Network(CNN)classifies the resulting images and developed non-normalized and normalized encoding techniques.Initially,the dataset was split 70/30 for training and testing.Correspondingly,the dataset was utilized for 70%training,15%validation,and 15%testing.The suggested methods are evaluated using accuracy,precision,and recall.The non-normalized method had 70%accuracy,72%precision,and 71%recall.68%accuracy,67%precision,and 67%recall after validation.Meanwhile,the normalized approach without validation had 92.4%accuracy,94.3%precision,and 91.1%recall.Validation showed 90%accuracy,91.2%precision,and 89.7%recall.Note that both algorithms outperform the rest.The paper presents that bispectrum-based nonlinear analysis using deep learning models outperforms standard machine learning methods and other deep learning methods based on convolutional architecture.They offered the best inference performance as the proposed approach improves categorization and prediction.Several instances show successful multi-class prediction in molecular biology’s massive data. 展开更多
关键词 Globin-like superfamily numerical encoding bispectral analysis classification model deep convolutional neural network(CNN)
下载PDF
Question-Answering Pair Matching Based on Question Classification and Ensemble Sentence Embedding
9
作者 Jae-Seok Jang Hyuk-Yoon Kwon 《Computer Systems Science & Engineering》 SCIE EI 2023年第9期3471-3489,共19页
Question-answering(QA)models find answers to a given question.The necessity of automatically finding answers is increasing because it is very important and challenging from the large-scale QA data sets.In this paper,w... Question-answering(QA)models find answers to a given question.The necessity of automatically finding answers is increasing because it is very important and challenging from the large-scale QA data sets.In this paper,we deal with the QA pair matching approach in QA models,which finds the most relevant question and its recommended answer for a given question.Existing studies for the approach performed on the entire dataset or datasets within a category that the question writer manually specifies.In contrast,we aim to automatically find the category to which the question belongs by employing the text classification model and to find the answer corresponding to the question within the category.Due to the text classification model,we can effectively reduce the search space for finding the answers to a given question.Therefore,the proposed model improves the accuracy of the QA matching model and significantly reduces the model inference time.Furthermore,to improve the performance of finding similar sentences in each category,we present an ensemble embedding model for sentences,improving the performance compared to the individual embedding models.Using real-world QA data sets,we evaluate the performance of the proposed QA matching model.As a result,the accuracy of our final ensemble embedding model based on the text classification model is 81.18%,which outperforms the existing models by 9.81%∼14.16%point.Moreover,in terms of the model inference speed,our model is faster than the existing models by 2.61∼5.07 times due to the effective reduction of search spaces by the text classification model. 展开更多
关键词 Question-answering text classification model data augmentation text embedding
下载PDF
The Establishment of Mathematical Models for the Composition Analysis and Identification of Ancient Glass Products
10
作者 Jenny Zhang Ding Li +1 位作者 Yu Xie Junfeng Xiang 《Open Journal of Applied Sciences》 2023年第11期2149-2171,共23页
Glass is the precious material evidence of the trade of the early Silk Road. The ancient glass was easily affected by the environmental impact and weathering, and the change of composition ratios affected the correct ... Glass is the precious material evidence of the trade of the early Silk Road. The ancient glass was easily affected by the environmental impact and weathering, and the change of composition ratios affected the correct judgment of its category. In this paper, mathematical models and methods such as Chi-square test, weighted average method, principal component analysis, cluster analysis, binary classification model and grey correlation analysis were used comprehensively to analyze the data of sample glass products combined with their categories. The results showed that the weathered high-potassium glass could be divided into 12, 9, 10 and 27, 7, 22 and so on. 展开更多
关键词 Principal Component Analysis System Clustering Sensitivity Analysis Binary classification model Logistic Regression Analysis Grey Correlation Analysis
下载PDF
Classification and Facies Sequence Model of Subaqueous Debris Flows 被引量:6
11
作者 XIAN Benzhong LIU Jianping +3 位作者 DONG Yanlei LU Zhiyong HE Yanxin WANG Junhui 《Acta Geologica Sinica(English Edition)》 SCIE CAS CSCD 2017年第2期751-752,共2页
Objective Debris flows are cohesive sediment gravity flows which occur in both subaerial and subaqueous settings.Compared to subaerial debris flows which have been well studied as a geological hazard,subaqueous debris... Objective Debris flows are cohesive sediment gravity flows which occur in both subaerial and subaqueous settings.Compared to subaerial debris flows which have been well studied as a geological hazard,subaqueous debris flows showing complicated sediment composition and sedimentary processes were poorly understood.The main objective 展开更多
关键词 classification and Facies Sequence model of Subaqueous Debris Flows
下载PDF
Bootstrap inference of the skew-normal two-way classification random effects model with interaction
12
作者 YE Ren-dao AN Na +1 位作者 LUO Kun LIN Ya 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2022年第3期435-452,共18页
In this paper,we consider the statistical inference problems for the fixed effect and variance component functions in the two-way classification random effects model with skewnormal errors.Firstly,the exact test stati... In this paper,we consider the statistical inference problems for the fixed effect and variance component functions in the two-way classification random effects model with skewnormal errors.Firstly,the exact test statistic for the fixed effect is constructed.Secondly,using the Bootstrap approach and generalized approach,the one-sided hypothesis testing and interval estimation problems for the single variance component,the sum and ratio of variance components are discussed respectively.Further,the Monte Carlo simulation results indicate that the exact test statistic performs well in the one-sided hypothesis testing problem for the fixed effect.And the Bootstrap approach is better than the generalized approach in the one-sided hypothesis testing problems for variance component functions in most cases.Finally,the above approaches are applied to the real data examples of the consumer price index and value-added index of three industries to verify their rationality and effectiveness. 展开更多
关键词 skew-normal two-way classification random effects model with interaction fixed effect variance component functions BOOTSTRAP generalized approach
下载PDF
Swarm-Based Extreme Learning Machine Models for Global Optimization
13
作者 Mustafa Abdul Salam Ahmad Taher Azar Rana Hussien 《Computers, Materials & Continua》 SCIE EI 2022年第3期6339-6363,共25页
Extreme Learning Machine(ELM)is popular in batch learning,sequential learning,and progressive learning,due to its speed,easy integration,and generalization ability.While,Traditional ELM cannot train massive data rapid... Extreme Learning Machine(ELM)is popular in batch learning,sequential learning,and progressive learning,due to its speed,easy integration,and generalization ability.While,Traditional ELM cannot train massive data rapidly and efficiently due to its memory residence,high time and space complexity.In ELM,the hidden layer typically necessitates a huge number of nodes.Furthermore,there is no certainty that the arrangement of weights and biases within the hidden layer is optimal.To solve this problem,the traditional ELM has been hybridized with swarm intelligence optimization techniques.This paper displays five proposed hybrid Algorithms“Salp Swarm Algorithm(SSA-ELM),Grasshopper Algorithm(GOA-ELM),Grey Wolf Algorithm(GWO-ELM),Whale optimizationAlgorithm(WOA-ELM)andMoth Flame Optimization(MFO-ELM)”.These five optimizers are hybridized with standard ELM methodology for resolving the tumor type classification using gene expression data.The proposed models applied to the predication of electricity loading data,that describes the energy use of a single residence over a fouryear period.In the hidden layer,Swarm algorithms are used to pick a smaller number of nodes to speed up the execution of ELM.The best weights and preferences were calculated by these algorithms for the hidden layer.Experimental results demonstrated that the proposed MFO-ELM achieved 98.13%accuracy and this is the highest model in accuracy in tumor type classification gene expression data.While in predication,the proposed GOA-ELM achieved 0.397which is least RMSE compared to the other models. 展开更多
关键词 Extreme learning machine salp swarm optimization algorithm grasshopper optimization algorithm grey wolf optimization algorithm moth flame optimization algorithm bio-inspired optimization classification model and whale optimization algorithm
下载PDF
Analysis of the Resolution of Crime Using Predictive Modeling
14
作者 Keshab R. Dahal Jiba N. Dahal +1 位作者 Kenneth R. Goward Oluremi Abayami 《Open Journal of Statistics》 2020年第3期600-610,共11页
There has been evidence of crime in the US since colonization. In this article, we analyze the crime statistics of San Francisco and its resolution of crime recorded from January to September of the year 2018. We defi... There has been evidence of crime in the US since colonization. In this article, we analyze the crime statistics of San Francisco and its resolution of crime recorded from January to September of the year 2018. We define resolution of crime as a target variable and study its relationship with other variables. We make several classification models to predict resolution of crime using several data mining techniques and suggest the best model for predicting resolution. 展开更多
关键词 Machine Learning classification model Comparison Predictive modeling Resolution of Crime
下载PDF
From beasts to bytes:Revolutionizing zoological research with artificial intelligence
15
作者 Yu-Juan Zhang Zeyu Luo +2 位作者 Yawen Sun Junhao Liu Zongqing Chen 《Zoological Research》 SCIE CSCD 2023年第6期1115-1131,共17页
Since the late 2010s,Artificial Intelligence(AI)including machine learning,boosted through deep learning,has boomed as a vital tool to leverage computer vision,natural language processing and speech recognition in rev... Since the late 2010s,Artificial Intelligence(AI)including machine learning,boosted through deep learning,has boomed as a vital tool to leverage computer vision,natural language processing and speech recognition in revolutionizing zoological research.This review provides an overview of the primary tasks,core models,datasets,and applications of AI in zoological research,including animal classification,resource conservation,behavior,development,genetics and evolution,breeding and health,disease models,and paleontology.Additionally,we explore the challenges and future directions of integrating AI into this field.Based on numerous case studies,this review outlines various avenues for incorporating AI into zoological research and underscores its potential to enhance our understanding of the intricate relationships that exist within the animal kingdom.As we build a bridge between beast and byte realms,this review serves as a resource for envisioning novel AI applications in zoological research that have not yet been explored. 展开更多
关键词 Animal science Data extraction classification model Behavior analysis Biomolecular sequences analysis
下载PDF
Hunter Prey Optimization with Hybrid Deep Learning for Fake News Detection on Arabic Corpus
16
作者 Hala J.Alshahrani Abdulkhaleq Q.A.Hassan +5 位作者 Khaled Tarmissi Amal S.Mehanna Abdelwahed Motwakel Ishfaq Yaseen Amgad Atta Abdelmageed Mohamed I.Eldesouki 《Computers, Materials & Continua》 SCIE EI 2023年第5期4255-4272,共18页
Nowadays,the usage of socialmedia platforms is rapidly increasing,and rumours or false information are also rising,especially among Arab nations.This false information is harmful to society and individuals.Blocking an... Nowadays,the usage of socialmedia platforms is rapidly increasing,and rumours or false information are also rising,especially among Arab nations.This false information is harmful to society and individuals.Blocking and detecting the spread of fake news in Arabic becomes critical.Several artificial intelligence(AI)methods,including contemporary transformer techniques,BERT,were used to detect fake news.Thus,fake news in Arabic is identified by utilizing AI approaches.This article develops a new hunterprey optimization with hybrid deep learning-based fake news detection(HPOHDL-FND)model on the Arabic corpus.The HPOHDL-FND technique undergoes extensive data pre-processing steps to transform the input data into a useful format.Besides,the HPOHDL-FND technique utilizes long-term memory with a recurrent neural network(LSTM-RNN)model for fake news detection and classification.Finally,hunter prey optimization(HPO)algorithm is exploited for optimal modification of the hyperparameters related to the LSTM-RNN model.The performance validation of the HPOHDL-FND technique is tested using two Arabic datasets.The outcomes exemplified better performance over the other existing techniques with maximum accuracy of 96.57%and 93.53%on Covid19Fakes and satirical datasets,respectively. 展开更多
关键词 Arabic corpus fake news detection deep learning hunter prey optimizer classification model
下载PDF
Chained Dual-Generative Adversarial Network:A Generalized Defense Against Adversarial Attacks
17
作者 Amitoj Bir Singh Lalit Kumar Awasthi +3 位作者 Urvashi Mohammad Shorfuzzaman Abdulmajeed Alsufyani Mueen Uddin 《Computers, Materials & Continua》 SCIE EI 2023年第2期2541-2555,共15页
Neural networks play a significant role in the field of image classification.When an input image is modified by adversarial attacks,the changes are imperceptible to the human eye,but it still leads to misclassificatio... Neural networks play a significant role in the field of image classification.When an input image is modified by adversarial attacks,the changes are imperceptible to the human eye,but it still leads to misclassification of the images.Researchers have demonstrated these attacks to make production self-driving cars misclassify StopRoad signs as 45 Miles Per Hour(MPH)road signs and a turtle being misclassified as AK47.Three primary types of defense approaches exist which can safeguard against such attacks i.e.,Gradient Masking,Robust Optimization,and Adversarial Example Detection.Very few approaches use Generative Adversarial Networks(GAN)for Defense against Adversarial Attacks.In this paper,we create a new approach to defend against adversarial attacks,dubbed Chained Dual-Generative Adversarial Network(CD-GAN)that tackles the defense against adversarial attacks by minimizing the perturbations of the adversarial image using iterative oversampling and undersampling using GANs.CD-GAN is created using two GANs,i.e.,CDGAN’s Sub-ResolutionGANandCDGAN’s Super-ResolutionGAN.The first is CDGAN’s Sub-Resolution GAN which takes the original resolution input image and oversamples it to generate a lower resolution neutralized image.The second is CDGAN’s Super-Resolution GAN which takes the output of the CDGAN’s Sub-Resolution and undersamples,it to generate the higher resolution image which removes any remaining perturbations.Chained Dual GAN is formed by chaining these two GANs together.Both of these GANs are trained independently.CDGAN’s Sub-Resolution GAN is trained using higher resolution adversarial images as inputs and lower resolution neutralized images as output image examples.Hence,this GAN downscales the image while removing adversarial attack noise.CDGAN’s Super-Resolution GAN is trained using lower resolution adversarial images as inputs and higher resolution neutralized images as output images.Because of this,it acts as an Upscaling GAN while removing the adversarial attak noise.Furthermore,CD-GAN has a modular design such that it can be prefixed to any existing classifier without any retraining or extra effort,and 2542 CMC,2023,vol.74,no.2 can defend any classifier model against adversarial attack.In this way,it is a Generalized Defense against adversarial attacks,capable of defending any classifier model against any attacks.This enables the user to directly integrate CD-GANwith an existing production deployed classifier smoothly.CD-GAN iteratively removes the adversarial noise using a multi-step approach in a modular approach.It performs comparably to the state of the arts with mean accuracy of 33.67 while using minimal compute resources in training. 展开更多
关键词 Adversarial attacks GAN-based adversarial defense image classification models adversarial defense
下载PDF
A novel convolutional neural network with gated recurrent unit for automated speech emotion recognition and classification 被引量:1
18
作者 P.Ravi Prakash D.Anuradha +3 位作者 Javid Iqbal Mohammad Gouse Galety Ruby Singh S.Neelakandan 《Journal of Control and Decision》 EI 2023年第1期54-63,共10页
Automated Speech Emotion Recognition (SER) becomes more popular and has increased applicability.SER concentrates on the automatic identification of the emotional state of a humanbeing using speech signals. It mainly d... Automated Speech Emotion Recognition (SER) becomes more popular and has increased applicability.SER concentrates on the automatic identification of the emotional state of a humanbeing using speech signals. It mainly depends upon the in-depth analysis of the speech signal,extracts features containing emotional details from the speech signal, and utilises patternrecognition techniques for emotional state identification. The major problem in automatic SERis to extract discriminate, powerful, and emotional salient features from the acoustical content ofspeech signals. The proposed model aims to detect and classify three emotional states of speechsuch as happy, neutral, and sad. The presented model makes use of Convolution neural network– Gated Recurrent unit (CNN-GRU) based feature extraction technique which derives a set offeature vectors. A comprehensive simulation takes place using the Berlin German Database andSJTU Chinese Database which comprises numerous audio files under a collection of differentemotion labels. 展开更多
关键词 Emotion recognition speech recognition deep learning classification model Berlin emotion dataset
原文传递
An Optimal Big Data Analytics with Concept Drift Detection on High-Dimensional Streaming Data
19
作者 Romany F.Mansour Shaha Al-Otaibi +3 位作者 Amal Al-Rasheed Hanan Aljuaid Irina V.Pustokhina Denis A.Pustokhin 《Computers, Materials & Continua》 SCIE EI 2021年第9期2843-2858,共16页
Big data streams started becoming ubiquitous in recent years,thanks to rapid generation of massive volumes of data by different applications.It is challenging to apply existing data mining tools and techniques directl... Big data streams started becoming ubiquitous in recent years,thanks to rapid generation of massive volumes of data by different applications.It is challenging to apply existing data mining tools and techniques directly in these big data streams.At the same time,streaming data from several applications results in two major problems such as class imbalance and concept drift.The current research paper presents a new Multi-Objective Metaheuristic Optimization-based Big Data Analytics with Concept Drift Detection(MOMBD-CDD)method on High-Dimensional Streaming Data.The presented MOMBD-CDD model has different operational stages such as pre-processing,CDD,and classification.MOMBD-CDD model overcomes class imbalance problem by Synthetic Minority Over-sampling Technique(SMOTE).In order to determine the oversampling rates and neighboring point values of SMOTE,Glowworm Swarm Optimization(GSO)algorithm is employed.Besides,Statistical Test of Equal Proportions(STEPD),a CDD technique is also utilized.Finally,Bidirectional Long Short-Term Memory(Bi-LSTM)model is applied for classification.In order to improve classification performance and to compute the optimum parameters for Bi-LSTM model,GSO-based hyperparameter tuning process is carried out.The performance of the presented model was evaluated using high dimensional benchmark streaming datasets namely intrusion detection(NSL KDDCup)dataset and ECUE spam dataset.An extensive experimental validation process confirmed the effective outcome of MOMBD-CDD model.The proposed model attained high accuracy of 97.45%and 94.23%on the applied KDDCup99 Dataset and ECUE Spam datasets respectively. 展开更多
关键词 Streaming data concept drift classification model deep learning class imbalance data
下载PDF
Argumentative Comparative Analysis of Machine Learning on Coronary Artery Disease
20
作者 Keshab R. Dahal Yadu Gautam 《Open Journal of Statistics》 2020年第4期694-705,共12页
Cardiovascular disease (CVD) is a leading cause of death across the globe. Approximately 17.9 million of people die globally each year due to CVD, </span><span style="font-family:Verdana;">which ... Cardiovascular disease (CVD) is a leading cause of death across the globe. Approximately 17.9 million of people die globally each year due to CVD, </span><span style="font-family:Verdana;">which comprises 31% of all death. Coronary Artery Disease (CAD) is a common</span><span style="font-family:Verdana;"> type of CVD and is considered fatal.</span></span><span style="font-family:""> </span><span style="font-family:Verdana;">Predictive models that use machine learning algorithms may assist health workers in timely detection of CAD which ultimately reduce</span><span style="font-family:Verdana;">s</span><span style="font-family:Verdana;"> the mortality.</span><span style="font-family:""> </span><span style="font-family:""><span style="font-family:Verdana;">The main purpose of this study is to build a predictive model that provides doctors and health care providers with personalized information to implement better and more personalized treat</span><span style="font-family:Verdana;">ments for their patients. In</span></span><span style="font-family:""> </span><span style="font-family:Verdana;">this study, we use the publicly available Z-Alizadeh</span><span style="font-family:Verdana;"> Sani dataset which contains random samples of 216 cases with CAD and 87 normal controls with 56 different features. The binary variable “Cath” which represents case-control status, is used the target variable. We study its relationship with other predictors and develop classification models using the five different supervised classification machine learning algorithms: Logistic Regression (LR), Classification Tree</span><span style="font-family:""> </span><span style="font-family:Verdana;">with</span><span style="font-family:""> </span><span style="font-family:""><span style="font-family:Verdana;">Bagging (Bagging CART), </span><span style="font-family:Verdana;">Random </span><span style="font-family:Verdana;">Forest (RF), Support Vector Machine (SVM), and K-Nearest Neighbors (KNN).</span><span style="font-family:Verdana;"> These five classification models are used to investigate the detection of CAD. Finally, the performance of the machine learning algorithms is compared,</span></span><span style="font-family:""> </span><span style="font-family:Verdana;">and the best model is selected. Our results indicate that the SVM model is able to predict the presence of CAD more effectively and accurately than other models with an accuracy of 0.8947, sensitivity of 0.9434, specificity of 0.7826, and AUC of 0.8868. 展开更多
关键词 Machine Learning classification model Comparison Coronary Artery Disease Data Mining
下载PDF
上一页 1 2 下一页 到第
使用帮助 返回顶部