期刊文献+

NEW METHOD OF MINING INCOMPLETE DATA

NEW METHOD OF MINING INCOMPLETE DATA
下载PDF
导出
摘要 The data used in the process of knowledge discovery often includes noise and incomplete information. The boundaries of different classes of these data are blur and unobvious. When these data are clustered or classified, we often get the coverings instead of the partitions, and it usually makes our information system insecure. In this paper, optimal partitioning of incomplete data is researched. Firstly, the relationship of set cover and set partition is discussed, and the distance between set cover and set partition is defined. Secondly, the optimal partitioning of given cover is researched by the combing and parting method, acquiring the optimal partition from three different partitions set family is discussed. Finally, the corresponding optimal algorithm is given. The real wireless signals offten contain a lot of noise, and there are many errors in boundaries when these data is clustered based on the tradional method. In our experimant, the proposed method improves correct rate greatly, and the experimental results demonstrate the method's validity. The data used in the process of knowledge discovery often includes noise and incomplete information. The boundaries of different classes of these data are blur and unobvious. When these data are clustered or classified, we often get the coverings instead of the partitions, and it usually makes our information system insecure. In this paper, optimal partitioning of incomplete data is researched. Firstly, the relationship of set cover and set partition is discussed, and the distance between set cover and set partition is defined. Secondly, the optimal partitioning of given cover is researched by the combing and parting method, acquiring the optimal partition from three different partitions set family is discussed. Finally, the corresponding optimal algorithm is given. The real wireless signals offten contain a lot of noise, and there are many errors in boundaries when these data is clustered based on the tradional method. In our experimant, the proposed method improves correct rate greatly, and the experimental results demonstrate the method's validity.
出处 《Journal of Electronics(China)》 2013年第4期411-416,共6页 电子科学学刊(英文版)
基金 Supported by the National Natural Science Foundation of China (No. 61273302) partially by the Natural Science Foundation of Anhui Province (No. 1208085MF98, 1208085MF94)
关键词 CLUSTERING Incomplete Information PARTITION Data Mining 数据挖掘 信息系统 知识发现 数据集中 集合覆盖 最优算法 无线信号 覆盖物
  • 相关文献

参考文献8

  • 1N. C. Vinod and Dr. M. Punithavalli. Classification of incomplete data handling techniques-an overview. International Journal on Computer Science and En- gineering, 3(2011)1,340-344.
  • 2李长清,李克典,李进金.不完备信息系统确定性和集对联系度的粗集拓展模型[J].工程数学学报,2010,27(2):342-346. 被引量:3
  • 3徐怡,李龙澍.基于(α,λ)联系度容差关系的变精度粗糙集模型[J].自动化学报,2011,37(3):303-308. 被引量:13
  • 4Byron J. Gao, Martin Ester, and Jin-Yi Cai. The minimum consistent subset cover problem and its applications in data mining. Conference on Knowl- edge Discovery and Data Mining, 2007, San Jose, CA, USA. 310-319.
  • 5Ludmila Himmelspach, Daniel Hommers, and Stefan Conrad. Cluster tendency assessment for fuzzy clus- tering of incomplete data. Proceedings of Wuropean Society for Fuzzy Logic and Technology, Aix-Les- Bains, France, 2011, 290-297.
  • 6Matthias Templ, Andreas Alfons, and Peter Filzmoser Exploring incomplete data using visualization tech- niques. Advances in Data Analysis and Classification, 2011, 29-47.
  • 7Zhang Ling and Zhang Bo. Fuzzy tolerance quotient spaces and fuzzy subsets. Science China, Information Science, 41(2011)1, 1-11.
  • 8王伦文.聚类的粒度分析[J].计算机工程与应用,2006,42(5):29-31. 被引量:19

二级参考文献27

  • 1刘富春.变集对联系度的扩充粗糙集模型及其属性约简[J].计算机科学,2006,33(3):185-187. 被引量:7
  • 2周磊,舒兰.基于新集对分析的粗糙集模型(英文)[J].模糊系统与数学,2006,20(4):111-116. 被引量:10
  • 3Jain A K,Dubes R C.Algorithms for clustering[M].Englewood Cliffs: N J Prentice Hall, 1988.
  • 4Jain A K,Murry M N,Flynn P J.Data clustefing:A survey[J].ACM Comput Surve, 1999 ;31:264-323.
  • 5JiaweiHan.Data Mining Concepts and Techniques[M].China Machine Press. 2001-08 : 223 -263.
  • 6S L Chiu.Fuzzy Model Identification Based on Cluster Estimation[J].Journal of Intelligence and Fuzzy System,1994;(2):267-278.
  • 7Agrawal R,Gehrke J,Gunopolos D et al.Automatie subspace clustering of high dimensional data for data mining application[C].In:Hass L M,Tiwary A eds.Proceedings of the ACM SIGMOD International Conference on Management of Data Seattle:ACM Press ,1998 : 94-105.
  • 8Guha S,Rastogi R,Shim K.CURE:an efficient clustering algorithm for large databases[C].In:Hass LM ,Tiwary A eds.Procecdings of the ACM SIGMOD International Conference on Management of Data Seattle:ACM Press, 1998:73-84.
  • 9Alexandros N, Yannis T, Yannis M.C^2P : clustering based on closest pairs[C].In:Apers PMG,Atzeni P,Ceri S eds.Proceedings of the 27^th International conference on Very Large Data Bases.Roma:Morgan Kaufmann Publishers,2001:331-340.
  • 10T Y Lin,Xiaohua Hu,Eric Louie.A Fast Association Rule Algo fithm Based On Bitmap and Granular Computing[C].In:The IEEE International Conference on Fuzzy,2003:678-683.

共引文献31

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部