
dGridTopk-FCPM:一种基于模糊理论和d-网格的Top-k空间co-location模式挖掘方法 被引量:1

dGridTopk-FCPM:A top-k spatial co-location pattern mining algorithm based on fuzzy theory and d-grids
摘要 空间co-location模式是指在空间中相互邻近且频繁出现的空间特征的集合。由于传统的co-location模式挖掘使用单一的距离阈值来定义空间邻近关系,忽略了距离变化对空间邻近关系带来的影响,并且最小频繁度阈值的设定对于没有数据相关专业知识的用户来说存在一定的困难。针对上述问题,该文提出了一种基于模糊理论和d-网格的邻近隶属度计算方法,该方法可以避免计算Euclid距离并且可以利用d-网格快速找到满足模糊邻近关系的极大团,然后结合Top-k思想,挖掘出频繁度最大的k个空间co-location模式。实验结果表明:该方法具有更高效的性能和更细致的计算结果,并且通过比较召回率,发现该方法得到的频繁度最大的k个模式与传统co-location模式挖掘算法得到的频繁度最大的k个模式大部分相同,说明提出的模糊度量和挖掘算法具有较大的实用价值。 A spatial co-location pattern is a set of spatial features that are frequently observed together in space. Traditional co-location pattern mining uses a single distance threshold to define neighbor relationships while ignoring the impact of distance differences, but the minimum prevalence threshold is difficult to determine for inexperienced users. This paper presents a method for calculating the neighborhood membership degree based on fuzzy theory and d-grids. This method does not calculate the Euclidean distance and quickly finds the maximal cliques that satisfy the fuzzy neighborhood relationship by using the d-grid. The results was then combined with the Top-k algorithm to find the k most prevalent co-location patterns. Tests show that this method is more efficient and gives more detailed results. The recall rate shows that the k most prevalent patterns obtained by this method agree well with those obtained by the traditional co-location pattern mining algorithm, which shows the effectiveness of this fuzzy measurement and mining algorithm.
作者 李钧毅 王丽珍 陈红梅 LI Junyi;WANG Lizhen;CHEN Hongmei(School of Information Science and Engineering,Yunnan University,Kunming 650500,China)
出处 《清华大学学报(自然科学版)》 CSCD 北大核心 2021年第9期943-952,共10页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金项目(61966036,61662086) 云南省创新团队项目(2018HC019)。
关键词 空间数据挖掘 空间co-location模式 TOP-K 模糊理论 d-网格 spatial data mining spatial co-location pattern Top-k fuzzy theory d-grid
  • 相关文献



  • 1Huang Y, Shekhar S, Xiong H. Discovering colocation patterns from spatial data sets: A general approach. IEEE Transactions on Knowledge and Data Engineering, 2004, 16(12) : 1472- 1485.
  • 2Yoo J S, Shekhar S. A partial join approach for mining colocation patterns//Proceedings of the ACM International Symposium on Advances in Geographic Information Systems (ACMGIS). Washington, USA, 2004:241 -249.
  • 3Yoo J S, Shekhar S, Celik M. A join less approach for co location pattern mining: A summary of resuhs//Proceedings of the IEEE International Conference on Data Mining (ICDM). Houston, USA, 2005:813 816.
  • 4Wang Li-Zhen, Bao Yu Zhen, l.u J, Yip J. A new join less approach for co-location pattern mining//Proceedings of the IEEE 8th International Conference on Computer and lnfor mation Technology (CIT 2008). Sydney, AustraLia, 2008 197-202.
  • 5Wang Li-Zhen, Zhou Li-Hua, Lu J. Yip J. An order clique based approach for mining maximal co locations. Information Sciences, 2009, 179(19): 3370 -3382.
  • 6Wang Li-Zhen, Chen Hong-Mei, Zhao Li-Hong et al. Efficiently mining co location rules on interval data//Proceedings of the 6th International Conference on Advanced Data Mining and Applications(ADMA 2010). Chongqing, China, 2010: 477-488.
  • 7Zadeh L. Fuzzy sets. Information and Control, 1965, 8(3) 338-353.
  • 8Altman D. Fuzzy set theoretic approaches for handling im precision in spatial analysis. International Journal of Geo granhical Information Science, 1994, 8(3): 271- 289.
  • 9Schneider M. Fuzzy topological predicates, their properties, and their integration into query languages//Proceedings of the ACM International Symposium on Advances in Geographic Information Systems (ACMGIS). New York, USA, 2001: 9-14.
  • 10Schneider M. Uncertainty management for spatial data in databases: Fuzzy spatial data types//Proceedings of the International Symposium on Advauces in Spatial Databases. Berlin, Germany, 1999:330 351.












使用帮助 返回顶部