期刊文献+
共找到8篇文章
< 1 >
每页显示 20 50 100
An up -to -date comparative analysis of the KNN classifier distance metrics for text categorization
1
作者 Onder Coban 《Data Science and Informetrics》 2023年第2期67-78,共12页
Text categorization(TC)is one of the widely studied branches of text mining and has many applications in different domains.It tries to automatically assign a text document to one of the predefined categories often by ... Text categorization(TC)is one of the widely studied branches of text mining and has many applications in different domains.It tries to automatically assign a text document to one of the predefined categories often by using machine learning(ML)techniques.Choosing the best classifier in this task is the most important step in which k-Nearest Neighbor(KNN)is widely employed as a classifier as well as several other well-known ones such as Support Vector Machine,Multinomial Naive Bayes,Logistic Regression,and so on.The KNN has been extensively used for TC tasks and is one of the oldest and simplest methods for pattern classification.Its performance crucially relies on the distance metric used to identify nearest neighbors such that the most frequently observed label among these neighbors is used to classify an unseen test instance.Hence,in this paper,a comparative analysis of the KNN classifier is performed on a subset(i.e.,R8)of the Reuters-21578 benchmark dataset for TC.Experimental results are obtained by using different distance metrics as well as recently proposed distance learning metrics under different cases where the feature model and term weighting scheme are different.Our comparative evaluation of the results shows that Bray-Curtis and Linear Discriminant Analysis(LDA)are often superior to the other metrics and work well with raw term frequency weights. 展开更多
关键词 Text categorization k-nearest neighbor distance metric distance learning algorithms
原文传递
Data-driven Transient Stability Assessment Based on Kernel Regression and Distance Metric Learning 被引量:4
2
作者 Xianzhuang Liu Yong Min +2 位作者 Lei Chen Xiaohua Zhang Changyou Feng 《Journal of Modern Power Systems and Clean Energy》 SCIE EI CSCD 2021年第1期27-36,共10页
Transient stability assessment(TSA) is of great importance in power systems. For a given contingency, one of the most widely-used transient stability indices is the critical clearing time(CCT), which is a function of ... Transient stability assessment(TSA) is of great importance in power systems. For a given contingency, one of the most widely-used transient stability indices is the critical clearing time(CCT), which is a function of the pre-fault power flow.TSA can be regarded as the fitting of this function with the prefault power flow as the input and the CCT as the output. In this paper, a data-driven TSA model is proposed to estimate the CCT. The model is based on Mahalanobis-kernel regression,which employs the Mahalanobis distance in the kernel regression method to formulate a better regressor. A distance metric learning approach is developed to determine the problem-specific distance for TSA, which describes the dissimilarity between two power flow scenarios. The proposed model is more accurate compared to other data-driven methods, and its accuracy can be further improved by supplementing more training samples.Moreover, the model provides the probability density function of the CCT, and different estimations of CCT at different conservativeness levels. Test results verify the validity and the merits of the method. 展开更多
关键词 Transient stability assessment(TSA) critical clearing time(CCT) conservativeness level distance metric learning Nadaraya-Watson kernel regression Mahalanobis distance nonparametric regression DATA-DRIVEN
原文传递
Distance metric learning guided adaptive subspace semi-supervised clustering 被引量:1
3
作者 Xuesong Yin (12) yinxs@nuaa.edu.cn Enliang Hu (1) 《Frontiers of Computer Science》 SCIE EI CSCD 2011年第1期100-108,共9页
Most existing semi-supervised clustering algorithms are not designed for handling high- dimensional data. On the other hand, semi-supervised dimensionality reduction methods may not necessarily improve the clustering ... Most existing semi-supervised clustering algorithms are not designed for handling high- dimensional data. On the other hand, semi-supervised dimensionality reduction methods may not necessarily improve the clustering performance, due to the fact that the inherent relationship between subspace selection and clustering is ignored. In order to mitigate the above problems, we present a semi-supervised clustering algo- rithm using adaptive distance metric learning (SCADM) which performs semi-supervised clustering and distance metric learning simultaneously. SCADM applies the clustering results to learn a distance metric and then projects the data onto a low-dimensional space where the separability of the data is maximized. Experimental results on real-world data sets show that the proposed method can effectively deal with high-dimensional data and provides an appealing clustering performance. 展开更多
关键词 semi-supervise clustering pairwise con-straint distance metric learning data mining
原文传递
Multi-Attribute Couplings-Based Euclidean and Nominal Distances for Unlabeled Nominal Data
4
作者 Lei Gu Furong Zhang Li Ma 《Computers, Materials & Continua》 SCIE EI 2023年第6期5911-5928,共18页
Learning unlabeled data is a significant challenge that needs to han-dle complicated relationships between nominal values and attributes.Increas-ingly,recent research on learning value relations within and between att... Learning unlabeled data is a significant challenge that needs to han-dle complicated relationships between nominal values and attributes.Increas-ingly,recent research on learning value relations within and between attributes has shown significant improvement in clustering and outlier detection,etc.However,typical existing work relies on learning pairwise value relations but weakens or overlooks the direct couplings between multiple attributes.This paper thus proposes two novel and flexible multi-attribute couplings-based distance(MCD)metrics,which learn the multi-attribute couplings and their strengths in nominal data based on information theories:self-information,entropy,and mutual information,for measuring both numerical and nominal distances.MCD enables the application of numerical and nominal clustering methods on nominal data and quantifies the influence of involving and filtering multi-attribute couplings on distance learning and clustering perfor-mance.Substantial experiments evidence the above conclusions on 15 data sets against seven state-of-the-art distance measures with various feature selection methods for both numerical and nominal clustering. 展开更多
关键词 Nominal data distance metrics attribute couplings dissimilarity measures
下载PDF
Ranking Method for Complementary Judgment Matrixes with Fuzzy Numbers Based on Hausdorff Metric Distance 被引量:1
5
作者 侯福均 吴祈宗 《Journal of Beijing Institute of Technology》 EI CAS 2005年第4期458-461,共4页
A method for ranking complementary judgment matrixes with traspezoidal fuzzy numbers based on Hausdorff metric distance and fuzzy compromise decision approach is proposed. With regard to fuzzy number complementary jud... A method for ranking complementary judgment matrixes with traspezoidal fuzzy numbers based on Hausdorff metric distance and fuzzy compromise decision approach is proposed. With regard to fuzzy number complementary judgment matrixes given by a decider group whose members have various weights, the expert's information was aggregated first by means of simple weight average(SWA) method and Bonissone calculational method. Hence a matrix including all the experts' preference information was got. Then the matrix' column members were added up and the fuzzy evaluation values of the alternatives were got. Lastly, the Hausdorff metric distance and fuzzy compromise decision approach were used to rank the fuzzy evaluation values and then the ranking values of all the alternatives were got. Because exact numbers and triangular fuzzy numbers could all be transformed into trapezoidal fuzzy numbers, the method developed can rank complementary judgment matrixes with trapezoidal fuzzy numbers, triangular fuzzy numbers and exact numbers as well. An illustrative example is also given to verify the developed method and to demonstrate its feasibility and practicality. 展开更多
关键词 complementary judgment matrix trapezoidal fuzzy number Bonissone calculational method fuzzy compromise decision approach Hausdorff metric distance
下载PDF
Design of Evolutionary Algorithm Based Energy Efficient Clustering Approach for Vehicular Adhoc Networks
6
作者 VDinesh SSrinivasan +1 位作者 Gyanendra Prasad Joshi Woong Cho 《Computer Systems Science & Engineering》 SCIE EI 2023年第7期687-699,共13页
In a vehicular ad hoc network(VANET),a massive quantity of data needs to be transmitted on a large scale in shorter time durations.At the same time,vehicles exhibit high velocity,leading to more vehicle disconnections... In a vehicular ad hoc network(VANET),a massive quantity of data needs to be transmitted on a large scale in shorter time durations.At the same time,vehicles exhibit high velocity,leading to more vehicle disconnections.Both of these characteristics result in unreliable data communication in VANET.A vehicle clustering algorithm clusters the vehicles in groups employed in VANET to enhance network scalability and connection reliability.Clustering is considered one of the possible solutions for attaining effectual interaction in VANETs.But one such difficulty was reducing the cluster number under increasing transmitting nodes.This article introduces an Evolutionary Hide Objects Game Optimization based Distance Aware Clustering(EHOGO-DAC)Scheme for VANET.The major intention of the EHOGO-DAC technique is to portion the VANET into distinct sets of clusters by grouping vehicles.In addition,the DHOGO-EAC technique is mainly based on the HOGO algorithm,which is stimulated by old games,and the searching agent tries to identify hidden objects in a given space.The DHOGO-EAC technique derives a fitness function for the clustering process,including the total number of clusters and Euclidean distance.The experimental assessment of the DHOGO-EAC technique was carried out under distinct aspects.The comparison outcome stated the enhanced outcomes of the DHOGO-EAC technique compared to recent approaches. 展开更多
关键词 Vehicular networks CLUSTERING evolutionary algorithm fitness function distance metric
下载PDF
Research on the Pedestrian Re-Identification Method Based on Local Features and Gait Energy Images 被引量:1
7
作者 Xinliang Tang Xing Sun +3 位作者 Zhenzhou Wang Pingping Yu Ning Cao Yunfeng Xu 《Computers, Materials & Continua》 SCIE EI 2020年第8期1185-1198,共14页
The appearance of pedestrians can vary greatly from image to image,and different pedestrians may look similar in a given image.Such similarities and variabilities in the appearance and clothing of individuals make the... The appearance of pedestrians can vary greatly from image to image,and different pedestrians may look similar in a given image.Such similarities and variabilities in the appearance and clothing of individuals make the task of pedestrian re-identification very challenging.Here,a pedestrian re-identification method based on the fusion of local features and gait energy image(GEI)features is proposed.In this method,the human body is divided into four regions according to joint points.The color and texture of each region of the human body are extracted as local features,and GEI features of the pedestrian gait are also obtained.These features are then fused with the local and GEI features of the person.Independent distance measure learning using the cross-view quadratic discriminant analysis(XQDA)method is used to obtain the similarity of the metric function of the image pairs,and the final similarity is acquired by weight matching.Evaluation of experimental results by cumulative matching characteristic(CMC)curves reveals that,after fusion of local and GEI features,the pedestrian re-identification effect is improved compared with existing methods and is notably better than the recognition rate of pedestrian re-identification with a single feature. 展开更多
关键词 Local features gait energy image WEIGHT independent distance metric cross-view quadratic discriminant analysis
下载PDF
Weighted L^2-Estimates of Solutions for Damped Wave Equations with Variable Coefficients
8
作者 YAO Pengfei ZHANG Zhife 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2017年第6期1270-1292,共23页
The authors establish weighted L^2-estimates of solutions for the damped wave equations with variable coefficients utt-div A(x)▽u + au_t = 0 in IR^nunder the assumption a(x) ≥ a_0[1 + ρ(x)]^(-l),where a_0 > 0, l... The authors establish weighted L^2-estimates of solutions for the damped wave equations with variable coefficients utt-div A(x)▽u + au_t = 0 in IR^nunder the assumption a(x) ≥ a_0[1 + ρ(x)]^(-l),where a_0 > 0, l < 1, ρ(x) is the distance function of the metric g = A^(-1)(x) on IR^n. The authors show that these weighted L^2-estimates are closely related to the geometrical properties of the metric g = A^(-1)(x). 展开更多
关键词 distance function of a metric Riemannian metric wave equation with variable coefficients weighted L^2-estimate
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部