期刊文献+

基于特征效用参与率的空间高效用co-location模式挖掘方法 被引量:12

Mining Spatial High Utility Co-location Patterns Based on Feature Utility Ratio
下载PDF
导出
摘要 空间co-location模式是指其实例在空间邻域内频繁一起出现的空间特征子集.与传统的空间co-location模式挖掘不同,在空间高效用co-location模式挖掘中,不再将参与度(PI)作为有趣模式的度量指标,而是将效用值作为挖掘有趣模式的兴趣度量指标.现有的空间高效用co-location模式挖掘方法分为特征带效用和实例带效用两类.特征带效用的现有方法没有考虑不同特征效用之间的差异,挖掘的结果往往包含了许多不尽合理的"高效用"模式;而实例带效用的现有方法,则考虑了不同特征对模式效用的影响,但没有客观地度量这种影响.该文提出了一种确定特征在模式中的效用权重ω(fi,c)的方法,定义了更为合理的空间高效用co-location模式概念,设计了一个有效的挖掘算法.大量的实验表明提出的高效用co-location模式度量方法和相应的挖掘算法能够处理特征效用差异性和特征间的相互影响问题,能更有效地挖掘到空间高效用co-location模式. A spatial co-location pattern is a subset of spatial features which shows frequent association relationships based on the spatial neighborhood.Different from the participation index(PI)which is regarded as a measure of interests in traditional spatial co-location pattern mining,the utility of co-location pattern is considered as the measure of interests in the spatial high utility co-location pattern mining.The purpose of the spatial high utility co-location pattern mining is to compensate for the knowledge omission in traditional co-location pattern mining,and the high utility co-location can reflect the interactions between different spatial features or different spatial instances.It is noteworthy that there are lots of differences between the traditional high utility pattern mining and the spatial high utility co-location pattern mining.Firstly,due to the specificity of different spatial features,it is irrational to measure the utilities of different spatial instances using a unified standard.Second,although the high utility pattern mining technology in traditional databases is very mature,these techniques cannot be directly applied to spatial high utility co-location pattern mining,because the prevalence of spatial co-location patterns,which is completely different from the itemsets in transaction database,is measured by the cluster relations formed by the proximity relationships.So far,the existing methods of the spatial high utility co-location pattern mining can be classified into two classes:spatial features with utilities and spatial instances with utilities.The spatial features with utilities method considers the utilities of different features,which will weaken the value of the features that are highly participated but relatively low-valued in the co-location pattern.The interactions of the features in a co-location pattern are very important to evaluate a co-location,but the method of spatial features with utilities has not considered this important factor.Moreover,this method has also not considered the different utility measure standards of different spatial features,the utilities of spatial features are added directly.In the method of spatial instances with utilities,the utility participation ratio(UPR)is calculated and used to evaluate the features’ utility value in a co-location pattern.At the same time,it considers the influences between the different features.However,the method requires that the UPR of all features in a co-location pattern is greater than a specified utility threshold,which leads to some high utility co-location patterns are missed,so the method can not measure the spatial high utility co-location patterns objectively.In summary,the former does not consider the difference of different features’ utilities which lead to mine some unreasonable patterns,and the latter considers the influence of different features,but the different influence has not been measured objectively.This paper proposes an efficient method to measure the utility weightω(fi,c)of different features in a spatial co-location pattern,which can reflect the interaction of features in the co-location pattern more correctly,and define a more reasonable concept of spatial high utility co-location pattern.A basic mining algorithm and two efficient pruning algorithms are designed.Experimental results show that the method proposed in this paper can help users to mine the useful,reasonable and interesting spatial high utility co-location patterns.
作者 王晓璇 王丽珍 陈红梅 方圆 杨培忠 WANG Xiao-Xuan;WANG Li-Zhen;CHEN Hong-Mei;FANG Yuan YANG;Pei-Zhong(School of Information Science and Engineering , Yunnan University, Kunming 650504)
出处 《计算机学报》 EI CSCD 北大核心 2019年第8期1721-1738,共18页 Chinese Journal of Computers
基金 国家自然科学基金项目(61472346,61662086) 云南省自然科学基金项目(2016FA026,2015FB114) 云南省创新团队项目资助(20181tc019)~~
关键词 空间数据挖掘 空间co-location模式 高效用 效用权重 数据挖掘 spatial data mining spatial co-location pattern high utility utility weight data mining
  • 相关文献

参考文献1

二级参考文献13

  • 1Huang Y, Shekhar S, Xiong H. Discovering colocation patterns from spatial data sets: A general approach. IEEE Transactions on Knowledge and Data Engineering, 2004, 16(12) : 1472- 1485.
  • 2Yoo J S, Shekhar S. A partial join approach for mining colocation patterns//Proceedings of the ACM International Symposium on Advances in Geographic Information Systems (ACMGIS). Washington, USA, 2004:241 -249.
  • 3Yoo J S, Shekhar S, Celik M. A join less approach for co location pattern mining: A summary of resuhs//Proceedings of the IEEE International Conference on Data Mining (ICDM). Houston, USA, 2005:813 816.
  • 4Wang Li-Zhen, Bao Yu Zhen, l.u J, Yip J. A new join less approach for co-location pattern mining//Proceedings of the IEEE 8th International Conference on Computer and lnfor mation Technology (CIT 2008). Sydney, AustraLia, 2008 197-202.
  • 5Wang Li-Zhen, Zhou Li-Hua, Lu J. Yip J. An order clique based approach for mining maximal co locations. Information Sciences, 2009, 179(19): 3370 -3382.
  • 6Wang Li-Zhen, Chen Hong-Mei, Zhao Li-Hong et al. Efficiently mining co location rules on interval data//Proceedings of the 6th International Conference on Advanced Data Mining and Applications(ADMA 2010). Chongqing, China, 2010: 477-488.
  • 7Zadeh L. Fuzzy sets. Information and Control, 1965, 8(3) 338-353.
  • 8Altman D. Fuzzy set theoretic approaches for handling im precision in spatial analysis. International Journal of Geo granhical Information Science, 1994, 8(3): 271- 289.
  • 9Schneider M. Fuzzy topological predicates, their properties, and their integration into query languages//Proceedings of the ACM International Symposium on Advances in Geographic Information Systems (ACMGIS). New York, USA, 2001: 9-14.
  • 10Schneider M. Uncertainty management for spatial data in databases: Fuzzy spatial data types//Proceedings of the International Symposium on Advauces in Spatial Databases. Berlin, Germany, 1999:330 351.

共引文献25

同被引文献33

引证文献12

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部