In clustering algorithms,the selection of neighbors significantly affects the quality of the final clustering results.While various neighbor relationships exist,such as K-nearest neighbors,natural neighbors,and shared...In clustering algorithms,the selection of neighbors significantly affects the quality of the final clustering results.While various neighbor relationships exist,such as K-nearest neighbors,natural neighbors,and shared neighbors,most neighbor relationships can only handle single structural relationships,and the identification accuracy is low for datasets with multiple structures.In life,people’s first instinct for complex things is to divide them into multiple parts to complete.Partitioning the dataset into more sub-graphs is a good idea approach to identifying complex structures.Taking inspiration from this,we propose a novel neighbor method:Shared Natural Neighbors(SNaN).To demonstrate the superiority of this neighbor method,we propose a shared natural neighbors-based hierarchical clustering algorithm for discovering arbitrary-shaped clusters(HC-SNaN).Our algorithm excels in identifying both spherical clusters and manifold clusters.Tested on synthetic datasets and real-world datasets,HC-SNaN demonstrates significant advantages over existing clustering algorithms,particularly when dealing with datasets containing arbitrary shapes.展开更多
The discovery of spatio-temporal clusters in complex spatio-temporal data-sets has been a challenging issue in the domain of spatio-temporal data mining and knowledge discovery.In this paper,a novel spatio-temporal cl...The discovery of spatio-temporal clusters in complex spatio-temporal data-sets has been a challenging issue in the domain of spatio-temporal data mining and knowledge discovery.In this paper,a novel spatio-temporal clustering method based on spatio-temporal shared nearest neighbors(STSNN)is proposed to detect spatio-temporal clusters of different sizes,shapes,and densities in spatiotemporal databases with a large amount of noise.The concepts of windowed distance and shared nearest neighbor are utilized to define a novel spatiotemporal density for a spatio-temporal entity with definite mathematical meanings.Then,the density-based clustering strategy is employed to uncover spatio-temporal clusters.The spatio-temporal clustering algorithm developed in this paper is easily implemented and less sensitive to density variation among spatio-temporal entities.Experiments are undertaken on several simulated datasets to demonstrate the effectiveness and advantage of the STSNN algorithm.Also,the real-world applications on two seismic databases show that the STSNN algorithm has the ability to uncover foreshocks and aftershocks effectively.展开更多
Spatiotemporal clustering is one of the most advanced research topics in geospatial data mining.It has been challenging to discover cluster features with different spatiotemporal densities in geographic information da...Spatiotemporal clustering is one of the most advanced research topics in geospatial data mining.It has been challenging to discover cluster features with different spatiotemporal densities in geographic information data set.This paper presents an effective density-based spatiotemporal clustering algorithm(DBSTC).First,we propose a method to measure the degree of similarity of a core point to the geometric center of its spatiotemporal reachable neighborhood,which can effectively solve the isolated noise point misclassification problem that exists in the shared nearest neighbor methods.Second,we propose an ordered reachable time window distribution algorithm to calculate the reachable time window for each spatiotemporal point in the data set to solve the problem of different clusters with different temporal densities.The effectiveness and advantages of the DBSTC algorithm are demonstrated in several simulated data sets.In addition,practical applications to seismic data sets demonstrate the capability of the DBSTC algorithm to uncover clusters of foreshocks and aftershocks and help to improve the understanding of the underlying mechanisms of dynamic spatiotemporal processes in digital earth.展开更多
基金This work was supported by Science and Technology Research Program of Chongqing Municipal Education Commission(KJZD-M202300502,KJQN201800539).
文摘In clustering algorithms,the selection of neighbors significantly affects the quality of the final clustering results.While various neighbor relationships exist,such as K-nearest neighbors,natural neighbors,and shared neighbors,most neighbor relationships can only handle single structural relationships,and the identification accuracy is low for datasets with multiple structures.In life,people’s first instinct for complex things is to divide them into multiple parts to complete.Partitioning the dataset into more sub-graphs is a good idea approach to identifying complex structures.Taking inspiration from this,we propose a novel neighbor method:Shared Natural Neighbors(SNaN).To demonstrate the superiority of this neighbor method,we propose a shared natural neighbors-based hierarchical clustering algorithm for discovering arbitrary-shaped clusters(HC-SNaN).Our algorithm excels in identifying both spherical clusters and manifold clusters.Tested on synthetic datasets and real-world datasets,HC-SNaN demonstrates significant advantages over existing clustering algorithms,particularly when dealing with datasets containing arbitrary shapes.
基金The work described was supported by the Major State Basic Research Development Program of China(973 Program),No.2012CB719906Program for New Century Excellent Talents in University(NCET),No.NCET-10-0831National Natural Science Foundation of China(NSFC),No.40871180.
文摘The discovery of spatio-temporal clusters in complex spatio-temporal data-sets has been a challenging issue in the domain of spatio-temporal data mining and knowledge discovery.In this paper,a novel spatio-temporal clustering method based on spatio-temporal shared nearest neighbors(STSNN)is proposed to detect spatio-temporal clusters of different sizes,shapes,and densities in spatiotemporal databases with a large amount of noise.The concepts of windowed distance and shared nearest neighbor are utilized to define a novel spatiotemporal density for a spatio-temporal entity with definite mathematical meanings.Then,the density-based clustering strategy is employed to uncover spatio-temporal clusters.The spatio-temporal clustering algorithm developed in this paper is easily implemented and less sensitive to density variation among spatio-temporal entities.Experiments are undertaken on several simulated datasets to demonstrate the effectiveness and advantage of the STSNN algorithm.Also,the real-world applications on two seismic databases show that the STSNN algorithm has the ability to uncover foreshocks and aftershocks effectively.
基金This work was supported by the National Natural Science Foundation of China[grant numbers 41671391,41471313]the Science and Technology Project of Zhejiang Province[grant numbers 2014C33G20,2013C33051]and Major Program of China High Resolution Earth Observation System[grant number 07-Y30B10-9001].
文摘Spatiotemporal clustering is one of the most advanced research topics in geospatial data mining.It has been challenging to discover cluster features with different spatiotemporal densities in geographic information data set.This paper presents an effective density-based spatiotemporal clustering algorithm(DBSTC).First,we propose a method to measure the degree of similarity of a core point to the geometric center of its spatiotemporal reachable neighborhood,which can effectively solve the isolated noise point misclassification problem that exists in the shared nearest neighbor methods.Second,we propose an ordered reachable time window distribution algorithm to calculate the reachable time window for each spatiotemporal point in the data set to solve the problem of different clusters with different temporal densities.The effectiveness and advantages of the DBSTC algorithm are demonstrated in several simulated data sets.In addition,practical applications to seismic data sets demonstrate the capability of the DBSTC algorithm to uncover clusters of foreshocks and aftershocks and help to improve the understanding of the underlying mechanisms of dynamic spatiotemporal processes in digital earth.