期刊文献+
共找到16篇文章
< 1 >
每页显示 20 50 100
Real-Time Spreading Thickness Monitoring of High-core Rockfill Dam Based on K-nearest Neighbor Algorithm 被引量:4
1
作者 Denghua Zhong Rongxiang Du +2 位作者 Bo Cui Binping Wu Tao Guan 《Transactions of Tianjin University》 EI CAS 2018年第3期282-289,共8页
During the storehouse surface rolling construction of a core rockfilldam, the spreading thickness of dam face is an important factor that affects the construction quality of the dam storehouse' rolling surface and... During the storehouse surface rolling construction of a core rockfilldam, the spreading thickness of dam face is an important factor that affects the construction quality of the dam storehouse' rolling surface and the overallquality of the entire dam. Currently, the method used to monitor and controlspreading thickness during the dam construction process is artificialsampling check after spreading, which makes it difficult to monitor the entire dam storehouse surface. In this paper, we present an in-depth study based on real-time monitoring and controltheory of storehouse surface rolling construction and obtain the rolling compaction thickness by analyzing the construction track of the rolling machine. Comparatively, the traditionalmethod can only analyze the rolling thickness of the dam storehouse surface after it has been compacted and cannot determine the thickness of the dam storehouse surface in realtime. To solve these problems, our system monitors the construction progress of the leveling machine and employs a real-time spreading thickness monitoring modelbased on the K-nearest neighbor algorithm. Taking the LHK core rockfilldam in Southwest China as an example, we performed real-time monitoring for the spreading thickness and conducted real-time interactive queries regarding the spreading thickness. This approach provides a new method for controlling the spreading thickness of the core rockfilldam storehouse surface. 展开更多
关键词 Core rockfill dam Dam storehouse surface construction Spreading thickness k-nearest neighbor algorithm Real-time monitor
下载PDF
Wireless Communication Signal Strength Prediction Method Based on the K-nearest Neighbor Algorithm
2
作者 Zhao Chen Ning Xiong +6 位作者 Yujue Wang Yong Ding Hengkui Xiang Chenjun Tang Lingang Liu Xiuqing Zou Decun Luo 《国际计算机前沿大会会议论文集》 2019年第1期238-240,共3页
Existing interference protection systems lack automatic evaluation methods to provide scientific, objective and accurate assessment results. To address this issue, this paper develops a layout scheme by geometrically ... Existing interference protection systems lack automatic evaluation methods to provide scientific, objective and accurate assessment results. To address this issue, this paper develops a layout scheme by geometrically modeling the actual scene, so that the hand-held full-band spectrum analyzer would be able to collect signal field strength values for indoor complex scenes. An improved prediction algorithm based on the K-nearest neighbor non-parametric kernel regression was proposed to predict the signal field strengths for the whole plane before and after being shield. Then the highest accuracy set of data could be picked out by comparison. The experimental results show that the improved prediction algorithm based on the K-nearest neighbor non-parametric kernel regression can scientifically and objectively predict the indoor complex scenes’ signal strength and evaluate the interference protection with high accuracy. 展开更多
关键词 INTERFERENCE protection k-nearest neighbor algorithm NON-PARAMETRIC KERNEL regression SIGNAL field STRENGTH
下载PDF
Enhancing Cancer Classification through a Hybrid Bio-Inspired Evolutionary Algorithm for Biomarker Gene Selection 被引量:1
3
作者 Hala AlShamlan Halah AlMazrua 《Computers, Materials & Continua》 SCIE EI 2024年第4期675-694,共20页
In this study,our aim is to address the problem of gene selection by proposing a hybrid bio-inspired evolutionary algorithm that combines Grey Wolf Optimization(GWO)with Harris Hawks Optimization(HHO)for feature selec... In this study,our aim is to address the problem of gene selection by proposing a hybrid bio-inspired evolutionary algorithm that combines Grey Wolf Optimization(GWO)with Harris Hawks Optimization(HHO)for feature selection.Themotivation for utilizingGWOandHHOstems fromtheir bio-inspired nature and their demonstrated success in optimization problems.We aimto leverage the strengths of these algorithms to enhance the effectiveness of feature selection in microarray-based cancer classification.We selected leave-one-out cross-validation(LOOCV)to evaluate the performance of both two widely used classifiers,k-nearest neighbors(KNN)and support vector machine(SVM),on high-dimensional cancer microarray data.The proposed method is extensively tested on six publicly available cancer microarray datasets,and a comprehensive comparison with recently published methods is conducted.Our hybrid algorithm demonstrates its effectiveness in improving classification performance,Surpassing alternative approaches in terms of precision.The outcomes confirm the capability of our method to substantially improve both the precision and efficiency of cancer classification,thereby advancing the development ofmore efficient treatment strategies.The proposed hybridmethod offers a promising solution to the gene selection problem in microarray-based cancer classification.It improves the accuracy and efficiency of cancer diagnosis and treatment,and its superior performance compared to other methods highlights its potential applicability in realworld cancer classification tasks.By harnessing the complementary search mechanisms of GWO and HHO,we leverage their bio-inspired behavior to identify informative genes relevant to cancer diagnosis and treatment. 展开更多
关键词 Bio-inspired algorithms BIOINFORMATICS cancer classification evolutionary algorithm feature selection gene expression grey wolf optimizer harris hawks optimization k-nearest neighbor support vector machine
下载PDF
A Study of EM Algorithm as an Imputation Method: A Model-Based Simulation Study with Application to a Synthetic Compositional Data
4
作者 Yisa Adeniyi Abolade Yichuan Zhao 《Open Journal of Modelling and Simulation》 2024年第2期33-42,共10页
Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear mode... Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear model is the most used technique for identifying hidden relationships between underlying random variables of interest. However, data quality is a significant challenge in machine learning, especially when missing data is present. The linear regression model is a commonly used statistical modeling technique used in various applications to find relationships between variables of interest. When estimating linear regression parameters which are useful for things like future prediction and partial effects analysis of independent variables, maximum likelihood estimation (MLE) is the method of choice. However, many datasets contain missing observations, which can lead to costly and time-consuming data recovery. To address this issue, the expectation-maximization (EM) algorithm has been suggested as a solution for situations including missing data. The EM algorithm repeatedly finds the best estimates of parameters in statistical models that depend on variables or data that have not been observed. This is called maximum likelihood or maximum a posteriori (MAP). Using the present estimate as input, the expectation (E) step constructs a log-likelihood function. Finding the parameters that maximize the anticipated log-likelihood, as determined in the E step, is the job of the maximization (M) phase. This study looked at how well the EM algorithm worked on a made-up compositional dataset with missing observations. It used both the robust least square version and ordinary least square regression techniques. The efficacy of the EM algorithm was compared with two alternative imputation techniques, k-Nearest Neighbor (k-NN) and mean imputation (), in terms of Aitchison distances and covariance. 展开更多
关键词 Compositional Data Linear Regression Model Least Square Method Robust Least Square Method Synthetic Data Aitchison Distance Maximum Likelihood Estimation Expectation-Maximization algorithm k-nearest neighbor and Mean imputation
下载PDF
A Memetic Algorithm With Competition for the Capacitated Green Vehicle Routing Problem 被引量:8
5
作者 Ling Wang Jiawen Lu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2019年第2期516-526,共11页
In this paper, a memetic algorithm with competition(MAC) is proposed to solve the capacitated green vehicle routing problem(CGVRP). Firstly, the permutation array called traveling salesman problem(TSP) route is used t... In this paper, a memetic algorithm with competition(MAC) is proposed to solve the capacitated green vehicle routing problem(CGVRP). Firstly, the permutation array called traveling salesman problem(TSP) route is used to encode the solution, and an effective decoding method to construct the CGVRP route is presented accordingly. Secondly, the k-nearest neighbor(k NN) based initialization is presented to take use of the location information of the customers. Thirdly, according to the characteristics of the CGVRP, the search operators in the variable neighborhood search(VNS) framework and the simulated annealing(SA) strategy are executed on the TSP route for all solutions. Moreover, the customer adjustment operator and the alternative fuel station(AFS) adjustment operator on the CGVRP route are executed for the elite solutions after competition. In addition, the crossover operator is employed to share information among different solutions. The effect of parameter setting is investigated using the Taguchi method of design-ofexperiment to suggest suitable values. Via numerical tests, it demonstrates the effectiveness of both the competitive search and the decoding method. Moreover, extensive comparative results show that the proposed algorithm is more effective and efficient than the existing methods in solving the CGVRP. 展开更多
关键词 Capacitated green VEHICLE ROUTING problem(CGVRP) COMPETITION k-nearest neighbor(kNN) local INTENSIFICATION memetic algorithm
下载PDF
An Improved Whale Optimization Algorithm for Feature Selection 被引量:4
6
作者 Wenyan Guo Ting Liu +1 位作者 Fang Dai Peng Xu 《Computers, Materials & Continua》 SCIE EI 2020年第1期337-354,共18页
Whale optimization algorithm(WOA)is a new population-based meta-heuristic algorithm.WOA uses shrinking encircling mechanism,spiral rise,and random learning strategies to update whale’s positions.WOA has merit in term... Whale optimization algorithm(WOA)is a new population-based meta-heuristic algorithm.WOA uses shrinking encircling mechanism,spiral rise,and random learning strategies to update whale’s positions.WOA has merit in terms of simple calculation and high computational accuracy,but its convergence speed is slow and it is easy to fall into the local optimal solution.In order to overcome the shortcomings,this paper integrates adaptive neighborhood and hybrid mutation strategies into whale optimization algorithms,designs the average distance from itself to other whales as an adaptive neighborhood radius,and chooses to learn from the optimal solution in the neighborhood instead of random learning strategies.The hybrid mutation strategy is used to enhance the ability of algorithm to jump out of the local optimal solution.A new whale optimization algorithm(HMNWOA)is proposed.The proposed algorithm inherits the global search capability of the original algorithm,enhances the exploitation ability,improves the quality of the population,and thus improves the convergence speed of the algorithm.A feature selection algorithm based on binary HMNWOA is proposed.Twelve standard datasets from UCI repository test the validity of the proposed algorithm for feature selection.The experimental results show that HMNWOA is very competitive compared to the other six popular feature selection methods in improving the classification accuracy and reducing the number of features,and ensures that HMNWOA has strong search ability in the search feature space. 展开更多
关键词 Whale optimization algorithm Filter and Wrapper model k-nearest neighbor method Adaptive neighborhood hybrid mutation
下载PDF
Research on Initialization on EM Algorithm Based on Gaussian Mixture Model 被引量:4
7
作者 Ye Li Yiyan Chen 《Journal of Applied Mathematics and Physics》 2018年第1期11-17,共7页
The EM algorithm is a very popular maximum likelihood estimation method, the iterative algorithm for solving the maximum likelihood estimator when the observation data is the incomplete data, but also is very effectiv... The EM algorithm is a very popular maximum likelihood estimation method, the iterative algorithm for solving the maximum likelihood estimator when the observation data is the incomplete data, but also is very effective algorithm to estimate the finite mixture model parameters. However, EM algorithm can not guarantee to find the global optimal solution, and often easy to fall into local optimal solution, so it is sensitive to the determination of initial value to iteration. Traditional EM algorithm select the initial value at random, we propose an improved method of selection of initial value. First, we use the k-nearest-neighbor method to delete outliers. Second, use the k-means to initialize the EM algorithm. Compare this method with the original random initial value method, numerical experiments show that the parameter estimation effect of the initialization of the EM algorithm is significantly better than the effect of the original EM algorithm. 展开更多
关键词 EM algorithm GAUSSIAN MIXTURE Model k-nearest neighbor K-MEANS algorithm INITIALIZATION
下载PDF
Rapid prediction of flow and concentration fields in solid-liquid suspensions of slurry electrolysis tanks
8
作者 Tingting Lu Kang Li +4 位作者 Hongliang Zhao Wei Wang Zhenhao Zhou Xiaoyi Cai Fengqin Liu 《International Journal of Minerals,Metallurgy and Materials》 SCIE EI CAS CSCD 2024年第9期2006-2016,共11页
Slurry electrolysis(SE),as a hydrometallurgical process,has the characteristic of a multitank series connection,which leads to various stirring conditions and a complex solid suspension state.The computational fluid d... Slurry electrolysis(SE),as a hydrometallurgical process,has the characteristic of a multitank series connection,which leads to various stirring conditions and a complex solid suspension state.The computational fluid dynamics(CFD),which requires high computing resources,and a combination with machine learning was proposed to construct a rapid prediction model for the liquid flow and solid concentration fields in a SE tank.Through scientific selection of calculation samples via orthogonal experiments,a comprehensive dataset covering a wide range of conditions was established while effectively reducing the number of simulations and providing reasonable weights for each factor.Then,a prediction model of the SE tank was constructed using the K-nearest neighbor algorithm.The results show that with the increase in levels of orthogonal experiments,the prediction accuracy of the model improved remarkably.The model established with four factors and nine levels can accurately predict the flow and concentration fields,and the regression coefficients of average velocity and solid concentration were 0.926 and 0.937,respectively.Compared with traditional CFD,the response time of field information prediction in this model was reduced from 75 h to 20 s,which solves the problem of serious lag in CFD applied alone to actual production and meets real-time production control requirements. 展开更多
关键词 slurry electrolysis solid-liquid suspension computational fluid dynamics k-nearest neighbor algorithm rapid prediction
下载PDF
改进的LANDMARC算法在隧道人员定位中的应用 被引量:3
9
作者 王瑞峰 席旭宁 《铁道学报》 EI CAS CSCD 北大核心 2016年第1期70-74,共5页
介绍LANDMARC k-邻居定位算法原理,结合信号回波强度传播损耗模型,考虑隧道环境的恶劣性以及电磁波的折射、反射、多径效应等因素的影响,将该算法进行改进,引入参考标签可信度的概念,对其权重进行修订,解决标签的失效和信号强度读取错... 介绍LANDMARC k-邻居定位算法原理,结合信号回波强度传播损耗模型,考虑隧道环境的恶劣性以及电磁波的折射、反射、多径效应等因素的影响,将该算法进行改进,引入参考标签可信度的概念,对其权重进行修订,解决标签的失效和信号强度读取错误的弊端;动态设置参考标签的数目,有效提高定位精度。利用Intel公司的R1000射频识别开发平台,将改进算法应用到隧道人员定位系统中。实验结果表明,改进的算法具有更高的定位精度:1m以内的标签占70%,95%的标签定位误差小于1.5m,且最大误差控制在2m以内,对提高隧道人员的安全管理水平有着重要作用。 展开更多
关键词 射频识别 隧道人员定位 LANDMARC k-邻居算法 可信度
下载PDF
A Two-Stage Vehicle Type Recognition Method Combining the Most Effective Gabor Features 被引量:5
10
作者 Wei Sun Xiaorui Zhang +2 位作者 Xiaozheng He Yan Jin Xu Zhang 《Computers, Materials & Continua》 SCIE EI 2020年第12期2489-2510,共22页
Vehicle type recognition(VTR)is an important research topic due to its significance in intelligent transportation systems.However,recognizing vehicle type on the real-world images is challenging due to the illuminatio... Vehicle type recognition(VTR)is an important research topic due to its significance in intelligent transportation systems.However,recognizing vehicle type on the real-world images is challenging due to the illumination change,partial occlusion under real traffic environment.These difficulties limit the performance of current state-of-art methods,which are typically based on single-stage classification without considering feature availability.To address such difficulties,this paper proposes a two-stage vehicle type recognition method combining the most effective Gabor features.The first stage leverages edge features to classify vehicles by size into big or small via a similarity k-nearest neighbor classifier(SKNNC).Further the more specific vehicle type such as bus,truck,sedan or van is recognized by the second stage classification,which leverages the most effective Gabor features extracted by a set of Gabor wavelet kernels on the partitioned key patches via a kernel sparse representation-based classifier(KSRC).A verification and correction step based on minimum residual analysis is proposed to enhance the reliability of the VTR.To improve VTR efficiency,the most effective Gabor features are selected through gray relational analysis that leverages the correlation between Gabor feature image and the original image.Experimental results demonstrate that the proposed method not only improves the accuracy of VTR but also enhances the recognition robustness to illumination change and partial occlusion. 展开更多
关键词 Vehicle type recognition improved Canny algorithm Gabor filter k-nearest neighbor classification grey relational analysis kernel sparse representation two-stage classification
下载PDF
Position Vectors Based Efcient Indoor Positioning System 被引量:1
11
作者 Ayesha Javed Mir Yasir Umair +3 位作者 Alina Mirza Abdul Wakeel Fazli Subhan Wazir Zada Khan 《Computers, Materials & Continua》 SCIE EI 2021年第5期1781-1799,共19页
With the advent and advancements in the wireless technologies,Wi-Fi ngerprinting-based Indoor Positioning System(IPS)has become one of the most promising solutions for localization in indoor environments.Unlike the ou... With the advent and advancements in the wireless technologies,Wi-Fi ngerprinting-based Indoor Positioning System(IPS)has become one of the most promising solutions for localization in indoor environments.Unlike the outdoor environment,the lack of line-of-sight propagation in an indoor environment keeps the interest of the researchers to develop efcient and precise positioning systems that can later be incorporated in numerous applications involving Internet of Things(IoTs)and green computing.In this paper,we have proposed a technique that combines the capabilities of multiple algorithms to overcome the complexities experienced indoors.Initially,in the database development phase,Motley Kennan propagation model is used with Hough transformation to classify,detect,and assign different attenuation factors related to the types of walls.Furthermore,important parameters for system accuracy,such as,placement and geometry of Access Points(APs)in the coverage area are also considered.New algorithm for deployment of an additional AP to an already existing infrastructure is proposed by using Genetic Algorithm(GA)coupled with Enhanced Dilution of Precision(EDOP).Moreover,classication algorithm based on k-Nearest Neighbors(k-NN)is used to nd the position of a stationary or mobile user inside the given coverage area.For k-NN to provide low localization error and reduced space dimensionality,three APs are required to be selected optimally.In this paper,we have suggested an idea to select APs based on Position Vectors(PV)as an input to the localization algorithm.Deducing from our comprehensive investigations,it is revealed that the accuracy of indoor positioning system using the proposed technique unblemished the existing solutions with signicant improvements. 展开更多
关键词 Indoor positioning systems Internet of Things access points position vectors genetic algorithm k-nearest neighbors
下载PDF
A systematic approach in load disaggregation utilizing a multi-stage classification algorithm for consumer electrical appliances classification 被引量:5
12
作者 Chuan Choong YANG Chit Siang SOH Vooi Voon YAP 《Frontiers in Energy》 SCIE CSCD 2019年第2期386-398,共13页
The potential to save energy in existing consumer electrical appliances is very high. One of the ways to achieve energy saving and improve energy use awareness is to recognize the energy consumption of individual elec... The potential to save energy in existing consumer electrical appliances is very high. One of the ways to achieve energy saving and improve energy use awareness is to recognize the energy consumption of individual electrical appliances. To recognize the energy consumption of consumer electrical appliances, the load disaggregation methodology is utilized. Non-intrusive appliance load monitoring (NIALM) is a load disaggrega-tion methodology that disaggregates the sum of power consumption in a single point into the power consumption of individual electrical appliances. In this study, load disaggregation is performed through voltage and current waveform, known as the V-I trajectory. The classification algorithm performs cropping and image pyramid reduction of the V-I trajectory plot template images before utilizing the principal component analysis (PCA) and the k-nearest neighbor (k-NN) algorithm. The novelty of this paper is to establish a systematic approach of load disaggregation through V-I trajectory-based load signature images by utilizing a multi-stage classification algorithm methodol-ogy. The contribution of this paper is in utilizing the “k- value,” the number of closest data points to the nearest neighbor, in the k-NN algorithm to be effective in classification of electrical appliances. The results of the multi-stage classification algorithm implementation have been discussed and the idea on future work has also been proposed. 展开更多
关键词 LOAD DISAGGREGATION voltage-current (V-I) trajectory MULTI-STAGE CLASSIFICATION algorithm principal component analysis (PCA) k-nearest neighbor (k-NN)
原文传递
An Optimization System for Intent Recognition Based on an Improved KNN Algorithm with Minimal Feature Set for Powered Knee Prosthesis
13
作者 Yao Zhang Xu Wang +6 位作者 Haohua Xiu Lei Ren Yang Han Yongxin Ma Wei Chen Guowu Wei Luquan Ren 《Journal of Bionic Engineering》 SCIE EI CSCD 2023年第6期2619-2632,共14页
In this article,a new optimization system that uses few features to recognize locomotion with high classification accuracy is proposed.The optimization system consists of three parts.First,the features of the mixed me... In this article,a new optimization system that uses few features to recognize locomotion with high classification accuracy is proposed.The optimization system consists of three parts.First,the features of the mixed mechanical signal data are extracted from each analysis window of 200 ms after each foot contact event.Then,the Binary version of the hybrid Gray Wolf Optimization and Particle Swarm Optimization(BGWOPSO)algorithm is used to select features.And,the selected features are optimized and assigned different weights by the Biogeography-Based Optimization(BBO)algorithm.Finally,an improved K-Nearest Neighbor(KNN)classifier is employed for intention recognition.This classifier has the advantages of high accuracy,few parameters as well as low memory burden.Based on data from eight patients with transfemoral amputations,the optimization system is evaluated.The numerical results indicate that the proposed model can recognize nine daily locomotion modes(i.e.,low-,mid-,and fast-speed level-ground walking,ramp ascent/decent,stair ascent/descent,and sit/stand)by only seven features,with an accuracy of 96.66%±0.68%.As for real-time prediction on a powered knee prosthesis,the shortest prediction time is only 9.8 ms.These promising results reveal the potential of intention recognition based on the proposed system for high-level control of the prosthetic knee. 展开更多
关键词 Intent recognition k-nearest neighbor algorithm Powered knee prosthesis Locomotion mode classification
原文传递
Multi-color space threshold segmentation and self-learning k-NN algorithm for surge test EUT status identification
14
作者 Jian HUANG Gui-xiong LIU 《Frontiers of Mechanical Engineering》 SCIE CSCD 2016年第3期311-315,共5页
The identification of targets varies in different surge tests. A multi-color space threshold segmentation and self-learning k-nearest neighbor algorithm (k-NN) for equipment under test status identification was prop... The identification of targets varies in different surge tests. A multi-color space threshold segmentation and self-learning k-nearest neighbor algorithm (k-NN) for equipment under test status identification was proposed after using feature matching to identify equipment status had to train new patterns every time before testing. First, color space (L*a*b*, hue saturation lightness (HSL), hue saturation value (HSV)) to segment was selected according to the high luminance points ratio and white luminance points ratio of the image. Second, the unknown class sample Sr was classified by the k-NN algorithm with training set T~ according to the feature vector, which was formed from number ofpixels, eccentricity ratio, compact- ness ratio, and Euler's numbers. Last, while the classification confidence coefficient equaled k, made Sr as one sample ofpre-training set Tz'. The training set Tz increased to Tz+1 by Tz' if Tz' was saturated. In nine series of illuminant, indicator light, screen, and disturbances samples (a total of 21600 frames), the algorithm had a 98.65% identification accuracy, also selected five groups of samples to enlarge the training set from To to T5 by itself. Keywords multi-color space, k-nearest neighbor algorithm (k-NN), self-learning, surge test 展开更多
关键词 multi-color space k-nearest neighbor algorithm (k-NN) SELF-LEARNING surge test
原文传递
Adaptive Fault Detection Scheme Using an Optimized Self-healing Ensemble Machine Learning Algorithm
15
作者 Levent Yavuz Ahmet Soran +2 位作者 AhmetÖnen Xiangjun Li S.M.Muyeen 《CSEE Journal of Power and Energy Systems》 SCIE EI CSCD 2022年第4期1145-1156,共12页
This paper proposes a new cost-efficient,adaptive,and self-healing algorithm in real time that detects faults in a short period with high accuracy,even in the situations when it is difficult to detect.Rather than usin... This paper proposes a new cost-efficient,adaptive,and self-healing algorithm in real time that detects faults in a short period with high accuracy,even in the situations when it is difficult to detect.Rather than using traditional machine learning(ML)algorithms or hybrid signal processing techniques,a new framework based on an optimization enabled weighted ensemble method is developed that combines essential ML algorithms.In the proposed method,the system will select and compound appropriate ML algorithms based on Particle Swarm Optimization(PSO)weights.For this purpose,power system failures are simulated by using the PSCA D-Python co-simulation.One of the salient features of this study is that the proposed solution works on real-time raw data without using any pre-computational techniques or pre-stored information.Therefore,the proposed technique will be able to work on different systems,topologies,or data collections.The proposed fault detection technique is validated by using PSCAD-Python co-simulation on a modified and standard IEEE-14 and standard IEEE-39 bus considering network faults which are difficult to detect. 展开更多
关键词 Decision tree(DT) ensemble machine learning algorithm fault detection islanding operation k-nearest neighbor(kNN) linear discriminant analysis(LDA) logistic regression(LR) Naive Bayes(NB) self-healing algorithm
原文传递
An up -to -date comparative analysis of the KNN classifier distance metrics for text categorization
16
作者 Onder Coban 《Data Science and Informetrics》 2023年第2期67-78,共12页
Text categorization(TC)is one of the widely studied branches of text mining and has many applications in different domains.It tries to automatically assign a text document to one of the predefined categories often by ... Text categorization(TC)is one of the widely studied branches of text mining and has many applications in different domains.It tries to automatically assign a text document to one of the predefined categories often by using machine learning(ML)techniques.Choosing the best classifier in this task is the most important step in which k-Nearest Neighbor(KNN)is widely employed as a classifier as well as several other well-known ones such as Support Vector Machine,Multinomial Naive Bayes,Logistic Regression,and so on.The KNN has been extensively used for TC tasks and is one of the oldest and simplest methods for pattern classification.Its performance crucially relies on the distance metric used to identify nearest neighbors such that the most frequently observed label among these neighbors is used to classify an unseen test instance.Hence,in this paper,a comparative analysis of the KNN classifier is performed on a subset(i.e.,R8)of the Reuters-21578 benchmark dataset for TC.Experimental results are obtained by using different distance metrics as well as recently proposed distance learning metrics under different cases where the feature model and term weighting scheme are different.Our comparative evaluation of the results shows that Bray-Curtis and Linear Discriminant Analysis(LDA)are often superior to the other metrics and work well with raw term frequency weights. 展开更多
关键词 Text categorization k-nearest neighbor distance metric distance learning algorithms
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部