期刊文献+
共找到8篇文章
< 1 >
每页显示 20 50 100
Clustering Categorical Data:A Cluster Ensemble Approach
1
作者 何增友 Xu +2 位作者 Xiaofei Deng Shengchun 《High Technology Letters》 EI CAS 2003年第4期8-12,共5页
Clustering categorical data, an integral part of data mining,has attracted much attention recently. In this paper, the authors formally define the categorical data clustering problem as an optimization problem from th... Clustering categorical data, an integral part of data mining,has attracted much attention recently. In this paper, the authors formally define the categorical data clustering problem as an optimization problem from the viewpoint of cluster ensemble, and apply cluster ensemble approach for clustering categorical data. Experimental results on real datasets show that better clustering accuracy can be obtained by comparing with existing categorical data clustering algorithms. 展开更多
关键词 clusterING categorical data cluster ensemble data mining
下载PDF
Classification of Adversarial Attacks Using Ensemble Clustering Approach
2
作者 Pongsakorn Tatongjai Tossapon Boongoen +2 位作者 Natthakan Iam-On Nitin Naik Longzhi Yang 《Computers, Materials & Continua》 SCIE EI 2023年第2期2479-2498,共20页
As more business transactions and information services have been implemented via communication networks,both personal and organization assets encounter a higher risk of attacks.To safeguard these,a perimeter defence l... As more business transactions and information services have been implemented via communication networks,both personal and organization assets encounter a higher risk of attacks.To safeguard these,a perimeter defence likeNIDS(network-based intrusion detection system)can be effective for known intrusions.There has been a great deal of attention within the joint community of security and data science to improve machine-learning based NIDS such that it becomes more accurate for adversarial attacks,where obfuscation techniques are applied to disguise patterns of intrusive traffics.The current research focuses on non-payload connections at the TCP(transmission control protocol)stack level that is applicable to different network applications.In contrary to the wrapper method introduced with the benchmark dataset,three new filter models are proposed to transform the feature space without knowledge of class labels.These ECT(ensemble clustering based transformation)techniques,i.e.,ECT-Subspace,ECT-Noise and ECT-Combined,are developed using the concept of ensemble clustering and three different ensemble generation strategies,i.e.,random feature subspace,feature noise injection and their combinations.Based on the empirical study with published dataset and four classification algorithms,new models usually outperform that original wrapper and other filter alternatives found in the literature.This is similarly summarized from the first experiment with basic classification of legitimate and direct attacks,and the second that focuses on recognizing obfuscated intrusions.In addition,analysis of algorithmic parameters,i.e.,ensemble size and level of noise,is provided as a guideline for a practical use. 展开更多
关键词 Intrusion detection adversarial attack machine learning feature transformation ensemble clustering
下载PDF
An air combat maneuver pattern extraction based on time series segmentation and clustering analysis
3
作者 Zhifei Xi Yingxin Kou +2 位作者 Zhanwu Li Yue Lv You Li 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第6期149-162,共14页
Target maneuver recognition is a prerequisite for air combat situation awareness,trajectory prediction,threat assessment and maneuver decision.To get rid of the dependence of the current target maneuver recognition me... Target maneuver recognition is a prerequisite for air combat situation awareness,trajectory prediction,threat assessment and maneuver decision.To get rid of the dependence of the current target maneuver recognition method on empirical criteria and sample data,and automatically and adaptively complete the task of extracting the target maneuver pattern,in this paper,an air combat maneuver pattern extraction based on time series segmentation and clustering analysis is proposed by combining autoencoder,G-G clustering algorithm and the selective ensemble clustering analysis algorithm.Firstly,the autoencoder is used to extract key features of maneuvering trajectory to remove the impacts of redundant variables and reduce the data dimension;Then,taking the time information into account,the segmentation of Maneuver characteristic time series is realized with the improved FSTS-AEGG algorithm,and a large number of maneuver primitives are extracted;Finally,the maneuver primitives are grouped into some categories by using the selective ensemble multiple time series clustering algorithm,which can prove that each class represents a maneuver action.The maneuver pattern extraction method is applied to small scale air combat trajectory and can recognize and correctly partition at least 71.3%of maneuver actions,indicating that the method is effective and satisfies the requirements for engineering accuracy.In addition,this method can provide data support for various target maneuvering recognition methods proposed in the literature,greatly reduce the workload and improve the recognition accuracy. 展开更多
关键词 Maneuver pattern extraction Data mining Fuzzy segmentation Selective ensemble clustering
下载PDF
Analysis of users’ electricity consumption behavior based on ensemble clustering 被引量:7
4
作者 Qi Zhao Haolin Li +2 位作者 Xinying Wang Tianjiao Pu Jiye Wang 《Global Energy Interconnection》 2019年第6期479-489,共11页
Due to the increase in the number of smart meter devices,a power grid generates a large amount of data.Analyzing the data can help in understanding the users’electricity consumption behavior and demands;thus,enabling... Due to the increase in the number of smart meter devices,a power grid generates a large amount of data.Analyzing the data can help in understanding the users’electricity consumption behavior and demands;thus,enabling better service to be provided to them.Performing power load profile clustering is the basis for mining the users’electricity consumption behavior.By examining the complexity,randomness,and uncertainty of the users’electricity consumption behavior,this paper proposes an ensemble clustering method to analyze this behavior.First,principle component analysis(PCA)is used to reduce the dimensions of the data.Subsequently,the single clustering method is used,and the majority is selected for integrated clustering.As a result,the users’electricity consumption behavior is classified into different modes,and their characteristics are analyzed in detail.This paper examines the electricity power data of 19 real users in China for simulation purposes.This manuscript provides a thorough analysis along with suggestions for the users’weekly electricity consumption behavior.The results verify the effectiveness of the proposed method. 展开更多
关键词 Users’electricity consumption ensemble clustering Dimensionality reduction cluster validity
下载PDF
Stable Label-Specific Features Generation for Multi-Label Learning via Mixture-Based Clustering Ensemble
5
作者 Yi-Bo Wang Jun-Yi Hang Min-Ling Zhang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2022年第7期1248-1261,共14页
Multi-label learning deals with objects associated with multiple class labels,and aims to induce a predictive model which can assign a set of relevant class labels for an unseen instance.Since each class might possess... Multi-label learning deals with objects associated with multiple class labels,and aims to induce a predictive model which can assign a set of relevant class labels for an unseen instance.Since each class might possess its own characteristics,the strategy of extracting label-specific features has been widely employed to improve the discrimination process in multi-label learning,where the predictive model is induced based on tailored features specific to each class label instead of the identical instance representations.As a representative approach,LIFT generates label-specific features by conducting clustering analysis.However,its performance may be degraded due to the inherent instability of the single clustering algorithm.To improve this,a novel multi-label learning approach named SENCE(stable label-Specific features gENeration for multi-label learning via mixture-based Clustering Ensemble)is proposed,which stabilizes the generation process of label-specific features via clustering ensemble techniques.Specifically,more stable clustering results are obtained by firstly augmenting the original instance repre-sentation with cluster assignments from base clusters and then fitting a mixture model via the expectation-maximization(EM)algorithm.Extensive experiments on eighteen benchmark data sets show that SENCE performs better than LIFT and other well-established multi-label learning algorithms. 展开更多
关键词 clustering ensemble expectation-maximization al-gorithm label-specific features multi-label learning
下载PDF
Information Theoretic Weighted Fuzzy Clustering Ensemble
6
作者 Yixuan Wang Liping Yuan +4 位作者 Harish Garg Ali Bagherinia Hamïd Parvïn Kim-Hung Pho Zulkei Mansor 《Computers, Materials & Continua》 SCIE EI 2021年第4期369-392,共24页
In order to improve performance and robustness of clustering,it is proposed to generate and aggregate a number of primary clusters via clustering ensemble technique.Fuzzy clustering ensemble approaches attempt to impr... In order to improve performance and robustness of clustering,it is proposed to generate and aggregate a number of primary clusters via clustering ensemble technique.Fuzzy clustering ensemble approaches attempt to improve the performance of fuzzy clustering tasks.However,in these approaches,cluster(or clustering)reliability has not paid much attention to.Ignoring cluster(or clustering)reliability makes these approaches weak in dealing with low-quality base clustering methods.In this paper,we have utilized cluster unreliability estimation and local weighting strategy to propose a new fuzzy clustering ensemble method which has introduced Reliability Based weighted co-association matrix Fuzzy C-Means(RBFCM),Reliability Based Graph Partitioning(RBGP)and Reliability Based Hyper Clustering(RBHC)as three new fuzzy clustering consensus functions.Our fuzzy clustering ensemble approach works based on fuzzy cluster unreliability estimation.Cluster unreliability is estimated according to an entropic criterion using the cluster labels in the entire ensemble.To do so,the new metric is dened to estimate the fuzzy cluster unreliability;then,the reliability value of any cluster is determined using a Reliability Driven Cluster Indicator(RDCI).The time complexities of RBHC and RBGP are linearly proportional with thnumber of data objects.Performance and robustness of the proposed method are experimentally evaluated for some benchmark datasets.The experimental results demonstrate efciency and suitability of the proposed method. 展开更多
关键词 Fuzzy clustering ensemble cluster unreliability consensus function
下载PDF
Adaptive Spectral Clustering Ensemble Selection via Resampling and Population-Based Incremental Learning Algorithm 被引量:5
7
作者 XU Yuanchun JIA Jianhua 《Wuhan University Journal of Natural Sciences》 CAS 2011年第3期228-236,共9页
In this paper, we explore a novel ensemble method for spectral clustering. In contrast to the traditional clustering ensemble methods that combine all the obtained clustering results, we propose the adaptive spectral ... In this paper, we explore a novel ensemble method for spectral clustering. In contrast to the traditional clustering ensemble methods that combine all the obtained clustering results, we propose the adaptive spectral clustering ensemble method to achieve a better clustering solution. This method can adaptively assess the number of the component members, which is not owned by many other algorithms. The component clusterings of the ensemble system are generated by spectral clustering (SC) which bears some good characteristics to engender the diverse committees. The selection process works by evaluating the generated component spectral clustering through resampling technique and population-based incremental learning algorithm (PBIL). Experimental results on UCI datasets demonstrate that the proposed algorithm can achieve better results compared with traditional clustering ensemble methods, especially when the number of component clusterings is large. 展开更多
关键词 spectral clustering clustering ensemble selective ensemble RESAMPLING population-based incremental learning algorithm (PBIL) data clustering
原文传递
Wind and Photovoltaic Power Time Series Data Aggregation Method Based on an Ensemble Clustering and Markov Chain 被引量:4
8
作者 Jingxin Jin Lin Ye +4 位作者 Jiachen Li Yongning Zhao Peng Lu Weisheng Wang Xuebin Wang 《CSEE Journal of Power and Energy Systems》 SCIE EI CSCD 2022年第3期757-768,共12页
Reducing the input wind and photovoltaic power time series data can improve the efficiency of time sequential simulations.In this paper,a wind and photovoltaic power time series data aggregation method based on an ens... Reducing the input wind and photovoltaic power time series data can improve the efficiency of time sequential simulations.In this paper,a wind and photovoltaic power time series data aggregation method based on an ensemble clustering and Markov chain(ECMC)is proposed.The ECMC method can effectively reduce redundant information in the data.First,the wind and photovoltaic power time series data were divided into scenarios,and ensemble clustering was used to cluster the divided scenarios.At the same time,the Davies-Bouldin Index(DBI)is adopted to select the optimal number of clusters.Then,according to the temporal correlation between wind and photovoltaic scenarios,the wind and photovoltaic clustering results are merged and reduced to form a set of combined typical day scenarios that can reflect the characteristics of historical data within the calculation period.Finally,based on the Markov Chain,the state transition probability matrix of various combined typical day scenarios is constructed,and the aggregation state sequence of random length is generated,and then,the combined typical day scenarios of wind and photovoltaic were sampled in a sequential one-way sequence according to the state sequence and then are built into a representative wind and photovoltaic power time series aggregation sequence.A provincial power grid was chosen as an example to compare the multiple evaluation indexes of different aggregation methods.The results show that the ECMC aggregation method improves the accuracy and efficiency of time sequential simulations. 展开更多
关键词 Aggregation method ensemble clustering markov chain time sequential simulations wind and photovoltaic power time series data
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部