期刊文献+

基于可拓集的企业数据挖掘应用方案初探 被引量:12

Study on enterprise data mining solution based on extension set
下载PDF
导出
摘要 数据本身的质量差造成数据挖掘结论的可信度低已经成为影响数据挖掘应用的重要因素,针对不完备数据设计的清洗算法、容忍算法等都不能从根本上解决这个问题.通过深入分析这一矛盾现象的原因,对企业数据建立物元可拓集合,提出基于可拓方法的数据挖掘企业应用方案.以数据挖掘所需的完备数据集做为条件物元,发现数据质量差距,以事元“数据挖掘咨询”促使物元集可拓域的发展变换,推出以数据挖掘应用咨询带动数据质量改进的措施,从而解决了数据质量的矛盾问题,使数据质量不高的企业也可以实施数据挖掘项目,提高信息决策水平. Data mining needs high quality data while many enterprises have no good enough data to get a credible conclusion, which prevents its implementation. The reasons have been analysed based on Extension Tbeory, and a new method has been by using Matter Element Analysis and extension set, which solves the conflicting problem by data mining consulting. Its application in a web company shows that it has good practicality which can do data mining projects in low quality data enterprises and increase the decision levels.
出处 《哈尔滨工业大学学报》 EI CAS CSCD 北大核心 2006年第7期1124-1128,共5页 Journal of Harbin Institute of Technology
基金 国家自然科学基金委管理科学部与信息科学部跨学科重点(70531040) 国家自然科学基金(70501030)
关键词 可拓学 可拓集合 物元分析 数据挖掘 企业信息化 数据质量 数据挖掘咨询 Extenics extension set matter-element analysis data mining enterprise information system data quality data mining consulting
  • 相关文献

参考文献16

  • 1HAN J, MICHELINE K. Data Mining: Concepts and Techniques[ M ]. [ s. l. ] : Morgan Kaufmann, 2006.
  • 2KARGUPTA H, PARK B H, PITI'IE S, et al. Contributed articles on online, interactive, and anytime data mining: MobiMine: monitoring the stock market from a PDA [ J ]. ACM SIGKDD Explorations Newsletter,2002, 3 (2): 37-46.
  • 3CHEN Yongqiang, HU Leifang. Study on data mining application in CRM system based on insurance trade[ A]. Proceedings of the 7th International Conference on Electronic Commerce ICEC 05 [ C ]. [ s. l. ] : CM Press,2005. 839 - 841.
  • 4LARRY T, YU H, CHUNG Fulai, et al. Using emerging pattern based projected clustering and gene expression data for cancer detection [ A ]. Proceedings of the Second Conference on Asia - Pacific Bioinformatics -Volume 29 CRPIT 04 [ C ]. [ s. l. ]: Australian Computer Society, Inc, 2004. 75 - 84.
  • 5DASU T, VESONDER G T, WRIGHT J R. Data quality through knowledge engineering [ A ]. Conference on Knowledge Discovery in Data Archive, Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining [ C ]. Washington, D. C:[s.n. ], 2003. 705 -710.
  • 6JOHNSON T, DASU T. Data quality and data cleaning-An overview [ A ]. International Conference on Management of Data Archive, Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data[C]. San Diego:ACM Press, 2003. 681 -681.
  • 7PIERCE E M. A Progress Report from MIT Information Quality Conference [ EB/OL]. Http://www. Iqconference. org,2003.
  • 8朱如,李庆峰.数据质量管理与企业信息化建设[J].计算机时代,2005(6):31-33. 被引量:19
  • 9李斌,李蔚田.基于信息化的数据质量管理系统模式探析[J].平顶山工学院学报,2005,14(1):65-67. 被引量:3
  • 10KUBICA J, MOORE A. Probabilistic noise identification and data cleaning[ A]. Proceedings of the 3^rd IEEE International Conference on Data Mining [ C ]. FL: [ s.n. ] , 2003.

二级参考文献88

共引文献257

同被引文献179

引证文献12

二级引证文献69

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部