Rough set axiomatization is one aspect of rough set study to characterize rough set theory using dependable and minimal axiom groups. Thus, rough set theory can be studied by logic and axiom system methods. The classi...Rough set axiomatization is one aspect of rough set study to characterize rough set theory using dependable and minimal axiom groups. Thus, rough set theory can be studied by logic and axiom system methods. The classic rough set theory is based on equivalent relation, but rough set theory based on reflexive and transitive relation (called quasi-ordering) has wide applications in the real world. To characterize topological rough set theory, an axiom group named RT, consisting of 4 axioms, is proposed. It is proved that the axiom group reliability in characterizing rough set theory based on similar relation is reasonable. Simultaneously, the minimization of the axiom group, which requires that each axiom is an equation and each is independent, is proved. The axiom group is helpful for researching rough set theory by logic and axiom system methods.展开更多
The Circle algorithm was proposed for large datasets.The idea of the algorithm is to find a set of vertices that are close to each other and far from other vertices.This algorithm makes use of the connection between c...The Circle algorithm was proposed for large datasets.The idea of the algorithm is to find a set of vertices that are close to each other and far from other vertices.This algorithm makes use of the connection between clustering aggregation and the problem of correlation clustering.The best deterministic approximation algorithm was provided for the variation of the correlation of clustering problem,and showed how sampling can be used to scale the algorithms for large datasets.An extensive empirical evaluation was given for the usefulness of the problem and the solutions.The results show that this method achieves more than 50% reduction in the running time without sacrificing the quality of the clustering.展开更多
As to the fact that it is difficult to obtain analytical form of optimal sampling density and tracking performance of standard particle probability hypothesis density(P-PHD) filter would decline when clustering algori...As to the fact that it is difficult to obtain analytical form of optimal sampling density and tracking performance of standard particle probability hypothesis density(P-PHD) filter would decline when clustering algorithm is used to extract target states,a free clustering optimal P-PHD(FCO-P-PHD) filter is proposed.This method can lead to obtainment of analytical form of optimal sampling density of P-PHD filter and realization of optimal P-PHD filter without use of clustering algorithms in extraction target states.Besides,as sate extraction method in FCO-P-PHD filter is coupled with the process of obtaining analytical form for optimal sampling density,through decoupling process,a new single-sensor free clustering state extraction method is proposed.By combining this method with standard P-PHD filter,FC-P-PHD filter can be obtained,which significantly improves the tracking performance of P-PHD filter.In the end,the effectiveness of proposed algorithms and their advantages over other algorithms are validated through several simulation experiments.展开更多
Clustering categorical data, an integral part of data mining,has attracted much attention recently. In this paper, the authors formally define the categorical data clustering problem as an optimization problem from th...Clustering categorical data, an integral part of data mining,has attracted much attention recently. In this paper, the authors formally define the categorical data clustering problem as an optimization problem from the viewpoint of cluster ensemble, and apply cluster ensemble approach for clustering categorical data. Experimental results on real datasets show that better clustering accuracy can be obtained by comparing with existing categorical data clustering algorithms.展开更多
In a special case of type-2 fuzzy logic systems (FLS), i.e. geometric inteIval type-2 fuzzy logic systems (GIT-2FLS), the crisp output is obtained by computing the geometric center of footprint of uncertainly (FO...In a special case of type-2 fuzzy logic systems (FLS), i.e. geometric inteIval type-2 fuzzy logic systems (GIT-2FLS), the crisp output is obtained by computing the geometric center of footprint of uncertainly (FOU) without type-reduction, but the defuzzifying method acts against the corner concepts of type-2 fuzzy sets in some cases. In this paper, a PSO type-reduction method for GIT-2FLS based on the particle swarm optimization (PSO) algorithm is presented. With the PSO type-reduction, the inference principle of geometric interval FLS operating on the continuous domain is consistent with that of traditional interval type-2 FLS operating on the discrete domain. With comparative experiments, it is proved that the PSO type-reduction exhibits good performance, and is a satisfactory complement for the theory of GIT-2FLS.展开更多
Data analysis and automatic processing is often interpreted as knowledge acquisition. In many cases it is necessary to somehow classify data or find regularities in them. Results obtained in the search of regularities...Data analysis and automatic processing is often interpreted as knowledge acquisition. In many cases it is necessary to somehow classify data or find regularities in them. Results obtained in the search of regularities in intelligent data analyzing applications are mostly represented with the help of IF-THEN rules. With the help of these rules the following tasks are solved: prediction, classification, pattern recognition and others. Using different approaches---clustering algorithms, neural network methods, fuzzy rule processing methods--we can extract rules that in an understandable language characterize the data. This allows interpreting the data, finding relationships in the data and extracting new rules that characterize them. Knowledge acquisition in this paper is defined as the process of extracting knowledge from numerical data in the form of rules. Extraction of rules in this context is based on clustering methods K-means and fuzzy C-means. With the assistance of K-means, clustering algorithm rules are derived from trained neural networks. Fuzzy C-means is used in fuzzy rule based design method. Rule extraction methodology is demonstrated in the Fisher's Iris flower data set samples. The effectiveness of the extracted rules is evaluated. Clustering and rule extraction methodology can be widely used in evaluating and analyzing various economic and financial processes.展开更多
One of the most important problems of clustering is to define the number of classes. In fact, it is not easy to find an appropriate method to measure whether the cluster configuration is acceptable or not. In this pap...One of the most important problems of clustering is to define the number of classes. In fact, it is not easy to find an appropriate method to measure whether the cluster configuration is acceptable or not. In this paper we propose a possible and non-automatic solution considering different criteria of clustering and comparing their results. In this way robust structures of an analyzed dataset can be often caught (or established) and an optimal cluster configuration, which presents a meaningful association, may be defined. In particular, we also focus on the variables which may be used in cluster analysis. In fact, variables which contain little clustering information can cause misleading and not-robustness results. Therefore, three algorithms are employed in this study: K-means partitioning methods, Partitioning Around Medoids (PAM) and the Heuristic Identification of Noisy Variables (HINoV). The results are compared with robust methods ones.展开更多
The clustering of trajectories over huge volumes of streaming data has been rec- ognized as critical for many modem applica- tions. In this work, we propose a continuous clustering of trajectories of moving objects ov...The clustering of trajectories over huge volumes of streaming data has been rec- ognized as critical for many modem applica- tions. In this work, we propose a continuous clustering of trajectories of moving objects over high speed data streams, which updates online trajectory clusters on basis of incremental line- segment clustering. The proposed clustering algorithm obtains trajectory clusters efficiently and stores all closed trajectory clusters in a bi- tree index with efficient search capability. Next, we present two query processing methods by utilising three proposed pruning strategies to fast handle two continuous spatio-temporal queries, threshold-based trajectory clustering queries and threshold-based trajectory outlier detections. Finally, the comprehensive experi- mental studies demonstrate that our algorithm achieves excellent effectiveness and high effi- ciency for continuous clustering on both syn- thetic and real streaming data, and the propo- sed query processing methods utilise average 90% less time than the naive query methods.展开更多
A Single Image Super-Resolution (SISR) reconstruction method that uses clustered sparse representation and adaptive patch aggregation is proposed. First, we randomly extract image patch pairs from the training images,...A Single Image Super-Resolution (SISR) reconstruction method that uses clustered sparse representation and adaptive patch aggregation is proposed. First, we randomly extract image patch pairs from the training images, and divide these patch pairs into different groups by K-means clustering. Then, we learn an over-complete sub-dictionary pair offline from corresponding group patch pairs. For a given low-resolution patch, we adaptively select one sub-dictionary to reconstruct the high resolution patch online. In addition, non-local self-similarity and steering kernel regression constraints are integrated into patch aggregation to improve the quality of the recovered images. Experiments show that the proposed method is able to realize state-of-the-art performance in terms of both objective evaluation and visual perception.展开更多
Owing to the potential for intercell cochannel interference mitigation and significant spectral efficiency improvement, coordinating transmission techniques by multiple radio access points have recently attracted a lo...Owing to the potential for intercell cochannel interference mitigation and significant spectral efficiency improvement, coordinating transmission techniques by multiple radio access points have recently attracted a lot of attention. In this paper, the system structure and mathematical signal model based on clustered structure are presented for multipoint coordinating downlink transmission, the clustered supercell configurations with static/dynamic approaches are discussed, and then optimal precod- ing design is provided for an accepted level of scheduling complexity and reduced signaling over- head. Some simulation results are given to evaluate the performance of different cell-clustering approaches, and to show that a clustered supercell size of 7 is a reasonable choice for clustered coordination with the given transmit power and the reduced feedback.展开更多
OBJECTIVE: Apply spectral clustering to analyze the patterns of symptoms in patients with chronic gastritis(CG).METHODS: Based on 919 CG subjects, we applied mutual information feature selection to choose the positive...OBJECTIVE: Apply spectral clustering to analyze the patterns of symptoms in patients with chronic gastritis(CG).METHODS: Based on 919 CG subjects, we applied mutual information feature selection to choose the positively correlated symptoms with each pattern.Then, we used the Shi and Malik spectral clustering algorithm to select the top 20 correlated symptoms.RESULTS: We ascertained the results of six patterns.There were three categories for the pattern of accumulation of damp heat in the spleen-stomach(0.00332). There were six categories for the pattern of dampness obstructing the spleen-stomach(0.02466). There were two categories for the pattern of spleen-stomach Qi deficiency(0.013 89).There were three categories for the pattern of spleen-stomach deficiency cold(0.009 15). There were five categories for the pattern of liver-Qistagnation(0.01910).There were four categories for the pattern of stagnant heat in the liver-stomach(0.00585).CONCLUSION: Most of the spectral clustering results of the symptoms of CG patterns were in accordance with clinical experience and Traditional Chinese Medicine theory. Most categories suggested the nature and/or location of the disease.展开更多
Dual clustering performs object clustering in both spatial and non-spatial domains that cannot be dealt with well by traditional clustering methods.However,recent dual clustering research has often omitted spatial out...Dual clustering performs object clustering in both spatial and non-spatial domains that cannot be dealt with well by traditional clustering methods.However,recent dual clustering research has often omitted spatial outliers,subjectively determined the weights of hybrid distance measures,and produced diverse clustering results.In this study,we first redefined the dual clustering problem and related concepts to highlight the clustering criteria.We then presented a self-organizing dual clustering algorithm (SDC) based on the self-organizing feature map and certain spatial analysis operations,including the Voronoi diagram and polygon aggregation and amalgamation.The algorithm employs a hybrid distance measure that combines geometric distance and non-spatial similarity,while the clustering spectrum analysis helps to determine the weight of non-spatial similarity in the measure.A case study was conducted on a spatial database of urban land price samples in Wuhan,China.SDC detected spatial outliers and clustered the points into spatially connective and attributively homogenous sub-groups.In particular,SDC revealed zonal areas that describe the actual distribution of land prices but were not demonstrated by other methods.SDC reduced the subjectivity in dual clustering.展开更多
This paper studies the dynamics of the analytic family z + 1/z + b alld describes the topologyof the parameter space, structural stability and J-stability. The mapping class group of almostall maps of the above family...This paper studies the dynamics of the analytic family z + 1/z + b alld describes the topologyof the parameter space, structural stability and J-stability. The mapping class group of almostall maps of the above family is determined.展开更多
The author reviews some recent developments in Chern-Simons theory on a hyperbolic 3-manifold M with complex gauge group G. The author focuses on the case of G = SL(N, C) and M being a knot complement: M = S^3\ K. The...The author reviews some recent developments in Chern-Simons theory on a hyperbolic 3-manifold M with complex gauge group G. The author focuses on the case of G = SL(N, C) and M being a knot complement: M = S^3\ K. The main result presented in this note is the cluster partition function, a computational tool that uses cluster algebra techniques to evaluate the Chern-Simons path integral for G = SL(N, C). He also reviews various applications and open questions regarding the cluster partition function and some of its relation with string theory.展开更多
introduce a new kind of swarm intelligence algorithm, the Ant Colony Optimization (ACO) algorithm. Propose a clustering analysis model based on ACO, apply the model to recognition and diagnosis of operation state fo...introduce a new kind of swarm intelligence algorithm, the Ant Colony Optimization (ACO) algorithm. Propose a clustering analysis model based on ACO, apply the model to recognition and diagnosis of operation state for gearbox. Testing four kinds of gears and clustering some characteristic parameters of the gear vibration signal, the conclusion shows that this method can recognize running state with accuracy and all speed. It is a new method for fault recognition and diagnosis.展开更多
文摘Rough set axiomatization is one aspect of rough set study to characterize rough set theory using dependable and minimal axiom groups. Thus, rough set theory can be studied by logic and axiom system methods. The classic rough set theory is based on equivalent relation, but rough set theory based on reflexive and transitive relation (called quasi-ordering) has wide applications in the real world. To characterize topological rough set theory, an axiom group named RT, consisting of 4 axioms, is proposed. It is proved that the axiom group reliability in characterizing rough set theory based on similar relation is reasonable. Simultaneously, the minimization of the axiom group, which requires that each axiom is an equation and each is independent, is proved. The axiom group is helpful for researching rough set theory by logic and axiom system methods.
基金Projects(60873265,60903222) supported by the National Natural Science Foundation of China Project(IRT0661) supported by the Program for Changjiang Scholars and Innovative Research Team in University of China
文摘The Circle algorithm was proposed for large datasets.The idea of the algorithm is to find a set of vertices that are close to each other and far from other vertices.This algorithm makes use of the connection between clustering aggregation and the problem of correlation clustering.The best deterministic approximation algorithm was provided for the variation of the correlation of clustering problem,and showed how sampling can be used to scale the algorithms for large datasets.An extensive empirical evaluation was given for the usefulness of the problem and the solutions.The results show that this method achieves more than 50% reduction in the running time without sacrificing the quality of the clustering.
文摘As to the fact that it is difficult to obtain analytical form of optimal sampling density and tracking performance of standard particle probability hypothesis density(P-PHD) filter would decline when clustering algorithm is used to extract target states,a free clustering optimal P-PHD(FCO-P-PHD) filter is proposed.This method can lead to obtainment of analytical form of optimal sampling density of P-PHD filter and realization of optimal P-PHD filter without use of clustering algorithms in extraction target states.Besides,as sate extraction method in FCO-P-PHD filter is coupled with the process of obtaining analytical form for optimal sampling density,through decoupling process,a new single-sensor free clustering state extraction method is proposed.By combining this method with standard P-PHD filter,FC-P-PHD filter can be obtained,which significantly improves the tracking performance of P-PHD filter.In the end,the effectiveness of proposed algorithms and their advantages over other algorithms are validated through several simulation experiments.
文摘Clustering categorical data, an integral part of data mining,has attracted much attention recently. In this paper, the authors formally define the categorical data clustering problem as an optimization problem from the viewpoint of cluster ensemble, and apply cluster ensemble approach for clustering categorical data. Experimental results on real datasets show that better clustering accuracy can be obtained by comparing with existing categorical data clustering algorithms.
基金Sponsored by the National Hi-Tech Program of China(Grant No. 2005AA420050)the National Key Technology R&D Program of China(Grant No.2006BAD10A0401, 2006BAH02A01)
文摘In a special case of type-2 fuzzy logic systems (FLS), i.e. geometric inteIval type-2 fuzzy logic systems (GIT-2FLS), the crisp output is obtained by computing the geometric center of footprint of uncertainly (FOU) without type-reduction, but the defuzzifying method acts against the corner concepts of type-2 fuzzy sets in some cases. In this paper, a PSO type-reduction method for GIT-2FLS based on the particle swarm optimization (PSO) algorithm is presented. With the PSO type-reduction, the inference principle of geometric interval FLS operating on the continuous domain is consistent with that of traditional interval type-2 FLS operating on the discrete domain. With comparative experiments, it is proved that the PSO type-reduction exhibits good performance, and is a satisfactory complement for the theory of GIT-2FLS.
文摘Data analysis and automatic processing is often interpreted as knowledge acquisition. In many cases it is necessary to somehow classify data or find regularities in them. Results obtained in the search of regularities in intelligent data analyzing applications are mostly represented with the help of IF-THEN rules. With the help of these rules the following tasks are solved: prediction, classification, pattern recognition and others. Using different approaches---clustering algorithms, neural network methods, fuzzy rule processing methods--we can extract rules that in an understandable language characterize the data. This allows interpreting the data, finding relationships in the data and extracting new rules that characterize them. Knowledge acquisition in this paper is defined as the process of extracting knowledge from numerical data in the form of rules. Extraction of rules in this context is based on clustering methods K-means and fuzzy C-means. With the assistance of K-means, clustering algorithm rules are derived from trained neural networks. Fuzzy C-means is used in fuzzy rule based design method. Rule extraction methodology is demonstrated in the Fisher's Iris flower data set samples. The effectiveness of the extracted rules is evaluated. Clustering and rule extraction methodology can be widely used in evaluating and analyzing various economic and financial processes.
文摘One of the most important problems of clustering is to define the number of classes. In fact, it is not easy to find an appropriate method to measure whether the cluster configuration is acceptable or not. In this paper we propose a possible and non-automatic solution considering different criteria of clustering and comparing their results. In this way robust structures of an analyzed dataset can be often caught (or established) and an optimal cluster configuration, which presents a meaningful association, may be defined. In particular, we also focus on the variables which may be used in cluster analysis. In fact, variables which contain little clustering information can cause misleading and not-robustness results. Therefore, three algorithms are employed in this study: K-means partitioning methods, Partitioning Around Medoids (PAM) and the Heuristic Identification of Noisy Variables (HINoV). The results are compared with robust methods ones.
基金supported by the National Natural Science Foundation of China under Grants No.61172049,No.61003251the National High Technology Research and Development Program of China(863 Program)under Grant No.2011AA040101the Doctoral Fund of Ministry of Education of Chinaunder Grant No.20100006110015
文摘The clustering of trajectories over huge volumes of streaming data has been rec- ognized as critical for many modem applica- tions. In this work, we propose a continuous clustering of trajectories of moving objects over high speed data streams, which updates online trajectory clusters on basis of incremental line- segment clustering. The proposed clustering algorithm obtains trajectory clusters efficiently and stores all closed trajectory clusters in a bi- tree index with efficient search capability. Next, we present two query processing methods by utilising three proposed pruning strategies to fast handle two continuous spatio-temporal queries, threshold-based trajectory clustering queries and threshold-based trajectory outlier detections. Finally, the comprehensive experi- mental studies demonstrate that our algorithm achieves excellent effectiveness and high effi- ciency for continuous clustering on both syn- thetic and real streaming data, and the propo- sed query processing methods utilise average 90% less time than the naive query methods.
基金partially supported by the National Natural Science Foundation of China under Grants No. 61071146, No. 61171165the Natural Science Foundation of Jiangsu Province under Grant No. BK2010488+1 种基金sponsored by Qing Lan Project, Project 333 "The Six Top Talents" of Jiangsu Province
文摘A Single Image Super-Resolution (SISR) reconstruction method that uses clustered sparse representation and adaptive patch aggregation is proposed. First, we randomly extract image patch pairs from the training images, and divide these patch pairs into different groups by K-means clustering. Then, we learn an over-complete sub-dictionary pair offline from corresponding group patch pairs. For a given low-resolution patch, we adaptively select one sub-dictionary to reconstruct the high resolution patch online. In addition, non-local self-similarity and steering kernel regression constraints are integrated into patch aggregation to improve the quality of the recovered images. Experiments show that the proposed method is able to realize state-of-the-art performance in terms of both objective evaluation and visual perception.
文摘Owing to the potential for intercell cochannel interference mitigation and significant spectral efficiency improvement, coordinating transmission techniques by multiple radio access points have recently attracted a lot of attention. In this paper, the system structure and mathematical signal model based on clustered structure are presented for multipoint coordinating downlink transmission, the clustered supercell configurations with static/dynamic approaches are discussed, and then optimal precod- ing design is provided for an accepted level of scheduling complexity and reduced signaling over- head. Some simulation results are given to evaluate the performance of different cell-clustering approaches, and to show that a clustered supercell size of 7 is a reasonable choice for clustered coordination with the given transmit power and the reduced feedback.
基金Supported by the National Natural Science Foundation of China[the Patterns Differentiation Mode of Main TCM Clinical Symptoms Based on Complex System Method(No.81270050)Information Extraction From TCM Inquiry and the Deducting Method of Patterns Differentiation Based on Feature Selection(No.30901897)+2 种基金Common Syndrome Diagnosis of Traditional Chinese Medicine Based on The Integration of Four Diagnosis Methods(No.81173199)]College Students' Scientific Innovation Foundation of Shanghai University of TCM[SHUTCMCXHDZ(2011)03]the Foundation for Training Talents of National Basic Scientific Research(No.J1103607)
文摘OBJECTIVE: Apply spectral clustering to analyze the patterns of symptoms in patients with chronic gastritis(CG).METHODS: Based on 919 CG subjects, we applied mutual information feature selection to choose the positively correlated symptoms with each pattern.Then, we used the Shi and Malik spectral clustering algorithm to select the top 20 correlated symptoms.RESULTS: We ascertained the results of six patterns.There were three categories for the pattern of accumulation of damp heat in the spleen-stomach(0.00332). There were six categories for the pattern of dampness obstructing the spleen-stomach(0.02466). There were two categories for the pattern of spleen-stomach Qi deficiency(0.013 89).There were three categories for the pattern of spleen-stomach deficiency cold(0.009 15). There were five categories for the pattern of liver-Qistagnation(0.01910).There were four categories for the pattern of stagnant heat in the liver-stomach(0.00585).CONCLUSION: Most of the spectral clustering results of the symptoms of CG patterns were in accordance with clinical experience and Traditional Chinese Medicine theory. Most categories suggested the nature and/or location of the disease.
基金supported by the National Natural Science Foundation of China(Grant No.40901188)the Key Laboratory of Geo-informatics of the State Bureau of Surveying and Mapping(Grant No.200906)the Fundamental Research Funds for the Central Universities(Grant No.4082002)
文摘Dual clustering performs object clustering in both spatial and non-spatial domains that cannot be dealt with well by traditional clustering methods.However,recent dual clustering research has often omitted spatial outliers,subjectively determined the weights of hybrid distance measures,and produced diverse clustering results.In this study,we first redefined the dual clustering problem and related concepts to highlight the clustering criteria.We then presented a self-organizing dual clustering algorithm (SDC) based on the self-organizing feature map and certain spatial analysis operations,including the Voronoi diagram and polygon aggregation and amalgamation.The algorithm employs a hybrid distance measure that combines geometric distance and non-spatial similarity,while the clustering spectrum analysis helps to determine the weight of non-spatial similarity in the measure.A case study was conducted on a spatial database of urban land price samples in Wuhan,China.SDC detected spatial outliers and clustered the points into spatially connective and attributively homogenous sub-groups.In particular,SDC revealed zonal areas that describe the actual distribution of land prices but were not demonstrated by other methods.SDC reduced the subjectivity in dual clustering.
文摘This paper studies the dynamics of the analytic family z + 1/z + b alld describes the topologyof the parameter space, structural stability and J-stability. The mapping class group of almostall maps of the above family is determined.
基金supported by the U.S.Department of Energy(No.DE-SC0009988)
文摘The author reviews some recent developments in Chern-Simons theory on a hyperbolic 3-manifold M with complex gauge group G. The author focuses on the case of G = SL(N, C) and M being a knot complement: M = S^3\ K. The main result presented in this note is the cluster partition function, a computational tool that uses cluster algebra techniques to evaluate the Chern-Simons path integral for G = SL(N, C). He also reviews various applications and open questions regarding the cluster partition function and some of its relation with string theory.
文摘introduce a new kind of swarm intelligence algorithm, the Ant Colony Optimization (ACO) algorithm. Propose a clustering analysis model based on ACO, apply the model to recognition and diagnosis of operation state for gearbox. Testing four kinds of gears and clustering some characteristic parameters of the gear vibration signal, the conclusion shows that this method can recognize running state with accuracy and all speed. It is a new method for fault recognition and diagnosis.