期刊文献+

基于信息粒数据重构的多关系数据聚类仿真 被引量:1

Clustering Simulation of Multi-Relational Data Based on Information Granular Data Reconstruction
下载PDF
导出
摘要 在大数据挖掘过程中,海量数据之间存在着交错复杂的非单一关系,如何准确有效的对多关系数据进行聚类处理,是大数据分析领域亟待解决的难题。传统方法在多关系数据聚类时通常转化为单关系处理,导致出现数据维度增加和数据稀疏等问题,为此提出信息粒数据重构聚类方法。方法首先利用数据的邻域构造信息粒,根据距离关系得到信息粒中数据的相似度,并对信息粒数据采取重构操作,改善数据聚类的细粒度与柔和度;然后基于重构数据与隶属程度修复数据集中的非完整数据,考虑到简化算法和统一约束,引入增益项,并把信息粒限定条件融入目标计算里,从而得到只有隶属程度限定的聚类模型;最后优化聚类过程中的重构数据与隶属程度,保证重构数据的精准,抑制聚类偏差。实验结果表明,信息粒数据重构方法提高了多关系数据的聚类精度,对于不同类型的复杂数据集具有良好的普适性。 In the process of big data mining,there is a complex and non-single relationship between massive data.How to cluster multi relation data accurately and effectively is a problem to be solved in the field of big data analysis.The traditional method usually turns into single relation processing when clustering multi relation data,which leads to the problems of increasing data dimension and data sparsity.Therefore,the information granular data reconstruction clustering method is proposed in the paper.Firstly,the neighborhood of the data was used to construct the information grain,and the similarity of the data in the information grain was obtained according to the distance relationship.The information granularity data were reconstructed to improve the granularity and softness of data clustering.Then,based on the reconstruction data and membership degree,the incomplete data in the data set were repaired.Considering the simplified algorithm and unified constraints,the gain term was introduced,and the information particle restriction condition was incorporated into the target calculation,so that the clustering model with only limited membership degree was obtained.Finally,we optimized the reconstruction data and membership degree in the clustering process,ensured the accuracy of the reconstruction data,and restrained the clustering deviation.The experimental results show that the method improves the clustering accuracy of multi relation data and has good universality for different types of complex data sets.
作者 谭翔纬 程学军 TAN Xiang-wei;CHENG Xue-jun(Guangzhou University,Institute of Software,Guangzhou Guangdong 510990,China;Luohe Institute of industry,Henan University of Technology,Luohe Henan 450001,China)
出处 《计算机仿真》 北大核心 2020年第6期406-409,461,共5页 Computer Simulation
关键词 信息粒 数据重构 多关系数据聚类 隶属程度 Information granule Data reconstruction Multi-relational data clustering Subjection degree
  • 相关文献

参考文献7

二级参考文献44

  • 1程克非,张聪.基于特征加权的朴素贝叶斯分类器[J].计算机仿真,2006,23(10):92-94. 被引量:40
  • 2邓维斌,王国胤,王燕.基于Rough Set的加权朴素贝叶斯分类算法[J].计算机科学,2007,34(2):204-206. 被引量:43
  • 3蒙肖莲,杨毓.商业银行基于客户价值的客户识别模型研究[J].价值工程,2007,26(6):6-10. 被引量:5
  • 4Ratanamahatana C A,Gunopulos D.Scaling up the Naive Bayesian Classifier:Using Decision Trees for Feature Selection[C]∥Proceedings of Workshop on Data Cleaning and Preprocessing,at IEEE International Conference on Data Mining.Maebashi,Japan,2002.
  • 5Ong H C,Khoo M Y,Saw S L.An Improvement on the Nave Bayes Classifier[C]∥International Conference on Information and Knowledge Management (2012).Singapore,2012:190-194.
  • 6Yin Xiao-xin,Han Jia-wei,Yang Jiong,et al.Efficient classification across multiple database relations:a CrossMine approach[J].IEEE Transactions on Knowledge and Data Engineering,2006,18(6):770-783.
  • 7Liu Hong-yan,Yin Xiao-xin,Han Jia-wei.An efficient multi-relational naive bayesian classifier based on semantic relationship graphs[C]∥Proceedings of the 4th international workshop on Multi-relational mining.Chicago,Illinois,2005:39-48.
  • 8Zhang H,Sheng Li.Learning Weighted Naive Bayes with accurate ranking[C]∥The 4th IEEE International Conference on Data Mining (ICDM04).Brighton:IEEE Computer Society,2004:567-570.
  • 9Guo Bao-en,Liu Hai-tao.Assigning Hybrid-Weight for Feature Attribute in Nave Bayesian Classifier[C]∥2012 International Proceedings of Computer Science and Information Technology(2012).Hongkong,2012:86-90.
  • 10Frank E,Hall M,Pfahringer B.Locally weighted naive Bayes[C]∥The 19th Conference in Uncertainty in Artificial Intelligence(2003).Acapulco,Mexico:Morgan Kaufmann,2003:249-256.

共引文献24

同被引文献5

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部