期刊文献+
共找到9篇文章
< 1 >
每页显示 20 50 100
Analysis of users’ electricity consumption behavior based on ensemble clustering 被引量:7
1
作者 Qi Zhao Haolin Li +2 位作者 Xinying Wang Tianjiao Pu Jiye Wang 《Global Energy Interconnection》 2019年第6期479-489,共11页
Due to the increase in the number of smart meter devices,a power grid generates a large amount of data.Analyzing the data can help in understanding the users’electricity consumption behavior and demands;thus,enabling... Due to the increase in the number of smart meter devices,a power grid generates a large amount of data.Analyzing the data can help in understanding the users’electricity consumption behavior and demands;thus,enabling better service to be provided to them.Performing power load profile clustering is the basis for mining the users’electricity consumption behavior.By examining the complexity,randomness,and uncertainty of the users’electricity consumption behavior,this paper proposes an ensemble clustering method to analyze this behavior.First,principle component analysis(PCA)is used to reduce the dimensions of the data.Subsequently,the single clustering method is used,and the majority is selected for integrated clustering.As a result,the users’electricity consumption behavior is classified into different modes,and their characteristics are analyzed in detail.This paper examines the electricity power data of 19 real users in China for simulation purposes.This manuscript provides a thorough analysis along with suggestions for the users’weekly electricity consumption behavior.The results verify the effectiveness of the proposed method. 展开更多
关键词 Users’electricity consumption Ensemble clustering Dimensionality reduction cluster validity
下载PDF
Internal Validity Index for Fuzzy Clustering Based on Relative Uncertainty
2
作者 Refik Tanju Sirmen Burak Berk Üstündag 《Computers, Materials & Continua》 SCIE EI 2022年第8期2909-2926,共18页
Unsupervised clustering and clustering validity are used as essential instruments of data analytics.Despite clustering being realized under uncertainty,validity indices do not deliver any quantitative evaluation of th... Unsupervised clustering and clustering validity are used as essential instruments of data analytics.Despite clustering being realized under uncertainty,validity indices do not deliver any quantitative evaluation of the uncertainties in the suggested partitionings.Also,validity measures may be biased towards the underlying clustering method.Moreover,neglecting a confidence requirement may result in over-partitioning.In the absence of an error estimate or a confidence parameter,probable clustering errors are forwarded to the later stages of the system.Whereas,having an uncertainty margin of the projected labeling can be very fruitful for many applications such as machine learning.Herein,the validity issue was approached through estimation of the uncertainty and a novel low complexity index proposed for fuzzy clustering.It involves only uni-dimensional membership weights,regardless of the data dimension,stipulates no specific distribution,and is independent of the underlying similarity measure.Inclusive tests and comparisons returned that it can reliably estimate the optimum number of partitions under different data distributions,besides behaving more robust to over partitioning.Also,in the comparative correlation analysis between true clustering error rates and some known internal validity indices,the suggested index exhibited the highest strong correlations.This relationship has been also proven stable through additional statistical acceptance tests.Thus the provided relative uncertainty measure can be used as a probable error estimate in the clustering as well.Besides,it is the only method known that can exclusively identify data points in dubiety and is adjustable according to the required confidence level. 展开更多
关键词 Machine learning data science clustering validity fuzzy clustering UNCERTAINTY intelligent systems data analytics
下载PDF
An Efficient Agglomerative Clustering Algorithm for Web Navigation Pattern Identification
3
作者 A. Anitha 《Circuits and Systems》 2016年第9期2349-2356,共9页
Web log mining is analysis of web log files with web page sequences. Discovering user access patterns from web access are necessary for building adaptive web servers, to improve e-commerce, to carry out cross-marketin... Web log mining is analysis of web log files with web page sequences. Discovering user access patterns from web access are necessary for building adaptive web servers, to improve e-commerce, to carry out cross-marketing, for web personalization, to predict web access sequence etc. In this paper, a new agglomerative clustering technique is proposed to identify users with similar interest, and to determine the motivation for visiting a website. Using this approach, web usage mining is done through different stages namely data cleaning, preprocessing, pattern discovery and pattern analysis. Results are given to explain how this approach produces tight usage clusters than the existing web usage mining techniques. Rather than traditional distance based clustering, the similarity measure is considered during clustering process in order to reduce computational complexity. This paper also deals with the problem of assessing the quality of user session clusters and cluster validity is measured by using statistical test, which measures the distances of clusters distributions to infer their dissimilarity and distinguish level. Using such statistical measures, it is proved that cluster accuracy is improved to the extent of 0.83, over existing k-means clustering with validity measure 0.26, FCM (Fuzzy C Means) clustering with validity measure 0.56. Rough set based clustering with validity measure 0.54 Generation of dense clusters is essential for finding interesting patterns needed for further mining and analysis. 展开更多
关键词 Agglomerative clustering Similarity Measure cluster Validity Clickstream Sequence TRANSACTION
下载PDF
Evolutionary Multi-Tasking Optimization for High-Efficiency Time Series Data Clustering
4
作者 Rui Wang Wenhua Li +2 位作者 Kaili Shen Tao Zhang Xiangke Liao 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2024年第2期343-355,共13页
Time series clustering is a challenging problem due to the large-volume,high-dimensional,and warping characteristics of time series data.Traditional clustering methods often use a single criterion or distance measure,... Time series clustering is a challenging problem due to the large-volume,high-dimensional,and warping characteristics of time series data.Traditional clustering methods often use a single criterion or distance measure,which may not capture all the features of the data.This paper proposes a novel method for time series clustering based on evolutionary multi-tasking optimization,termed i-MFEA,which uses an improved multifactorial evolutionary algorithm to optimize multiple clustering tasks simultaneously,each with a different validity index or distance measure.Therefore,i-MFEA can produce diverse and robust clustering solutions that satisfy various preferences of decision-makers.Experiments on two artificial datasets show that i-MFEA outperforms single-objective evolutionary algorithms and traditional clustering methods in terms of convergence speed and clustering quality.The paper also discusses how i-MFEA can address two long-standing issues in time series clustering:the choice of appropriate similarity measure and the number of clusters. 展开更多
关键词 time series clustering evolutionary multi-tasking multifactorial optimization clustering validity index distance measure
原文传递
Novel Cluster Validity Index for FCM Algorithm 被引量:6
5
作者 于剑 李翠霞 《Journal of Computer Science & Technology》 SCIE EI CSCD 2006年第1期137-140,共4页
How to determine an appropriate number of clusters is very important when implementing a specific clustering algorithm, like c-means, fuzzy c-means (FCM). In the literature, most cluster validity indices are origina... How to determine an appropriate number of clusters is very important when implementing a specific clustering algorithm, like c-means, fuzzy c-means (FCM). In the literature, most cluster validity indices are originated from partition or geometrical property of the data set. In this paper, the authors developed a novel cluster validity index for FCM, based on the optimality test of FCM. Unlike the previous cluster validity indices, this novel cluster validity index is inherent in FCM itself. Comparison experiments show that the stability index can be used as cluster validity index for the fuzzy c-means. 展开更多
关键词 cluster validity optimality test FCM
原文传递
Electric Load Clustering in Smart Grid:Methodologies,Applications,and Future Trends 被引量:4
6
作者 Caomingzhe Si Shenglan Xu +3 位作者 Can Wan Dawei Chen Wenkang Cui Junhua Zhao 《Journal of Modern Power Systems and Clean Energy》 SCIE EI CSCD 2021年第2期237-252,共16页
With the increasingly widespread of advanced metering infrastructure,electric load clustering is becoming more essential for its great potential in analytics of consumers’energy consumption patterns and preference th... With the increasingly widespread of advanced metering infrastructure,electric load clustering is becoming more essential for its great potential in analytics of consumers’energy consumption patterns and preference through data mining.Moreover,a variety of electric load clustering techniques have been put into practice to obtain the distribution of load data,observe the characteristics of load clusters,and classify the components of the total load.This can give rise to the development of related techniques and research in the smart grid,such as demand-side response.This paper summarizes the basic concepts and the general process in electric load clustering.Several similarity measurements and five major categories in electric load clustering are then comprehensively summarized along with their advantages and disadvantages.Afterwards,eight indices widely used to evaluate the validity of electric load clustering are described.Finally,vital applications are discussed thoroughly along with future trends including the tariff design,anomaly detection,load forecasting,data security and big data,etc. 展开更多
关键词 Electric load clustering similarity measurement clustering technique cluster validity indicator smart grid
原文传递
The upper bound of the optimal number of clusters in fuzzy clustering 被引量:6
7
作者 于剑 程乾生 《Science in China(Series F)》 2001年第2期119-125,共7页
The upper bound of the optimal number of clusters in clustering algorithm is studied in this paper. A new method is proposed to solve this issue. This method shows that the rule cmax≤n, which is popular in current pa... The upper bound of the optimal number of clusters in clustering algorithm is studied in this paper. A new method is proposed to solve this issue. This method shows that the rule cmax≤n, which is popular in current papers, is reasonable in some sense. The above conclusion is tested and analyzed by some typical examples in the literature, which demonstrates the validity of the new method. 展开更多
关键词 clustering algorithm cluster validity the optimal number of clusters UNCERTAINTY fuzzy clustering.
原文传递
A new cluster validity index using maximum cluster spread based compactness measure
8
作者 M.Arif Wani Romana Riyaz 《International Journal of Intelligent Computing and Cybernetics》 EI 2016年第2期179-204,共26页
Purpose-The most commonly used approaches for cluster validation are based on indices but the majority of the existing cluster validity indices do not work well on data sets of different complexities.The purpose of th... Purpose-The most commonly used approaches for cluster validation are based on indices but the majority of the existing cluster validity indices do not work well on data sets of different complexities.The purpose of this paper is to propose a new cluster validity index(ARSD index)that works well on all types of data sets.Design/methodology/approach-The authors introduce a new compactness measure that depicts the typical behaviour of a cluster where more points are located around the centre and lesser points towards the outer edge of the cluster.A novel penalty function is proposed for determining the distinctness measure of clusters.Random linear search-algorithm is employed to evaluate and compare the performance of the five commonly known validity indices and the proposed validity index.The values of the six indices are computed for all nc ranging from(nc_(min),nc_(max))to obtain the optimal number of clusters present in a data set.The data sets used in the experiments include shaped,Gaussian-like and real data sets.Findings-Through extensive experimental study,it is observed that the proposed validity index is found to be more consistent and reliable in indicating the correct number of clusters compared to other validity indices.This is experimentally demonstrated on 11 data sets where the proposed index has achieved better results.Originality/value-The originality of the research paper includes proposing a novel cluster validity index which is used to determine the optimal number of clusters present in data sets of different complexities. 展开更多
关键词 clusterING cluster analysis cluster validity Compactness measure Optimal number Distinctness measure
原文传递
New Results on PWARX Model Identification Based on Clustering Approach 被引量:1
9
作者 Zeineb Lassoued Kamel Abderrahim 《International Journal of Automation and computing》 EI CSCD 2014年第2期180-188,共9页
This paper deals with the problem of piecewise auto regressive systems with exogenous input(PWARX) model identification based on clustering solution. This problem involves both the estimation of the parameters of the ... This paper deals with the problem of piecewise auto regressive systems with exogenous input(PWARX) model identification based on clustering solution. This problem involves both the estimation of the parameters of the affine sub-models and the hyper planes defining the partitions of the state-input regression. The existing identification methods present three main drawbacks which limit its effectiveness. First, most of them may converge to local minima in the case of poor initializations because they are based on the optimization using nonlinear criteria. Second, they use simple and ineffective techniques to remove outliers. Third, most of them assume that the number of sub-models is known a priori. To overcome these drawbacks, we suggest the use of the density-based spatial clustering of applications with noise(DBSCAN) algorithm. The results presented in this paper illustrate the performance of our methods in comparison with the existing approach. An application of the developed approach to an olive oil esterification reactor is also proposed in order to validate the simulation results. 展开更多
关键词 Hybrid systems piecewise autoregressive systems with exogenous input(PWARX) model clustering identification density-based spatial clustering of applications with noise(DBSCAN) clustering technique experimental validation.
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部