期刊文献+

基于粗糙集和决策树的数据挖掘方法 被引量:15

Approach to Data Mining Based on Rough Sets and Decision Tree
下载PDF
导出
摘要 从粗糙集和决策树两种方法具有的优势互补性出发,提出了一种基于粗糙集和决策树相结合的数据挖掘新方法·以胶合板缺陷检测数据分析为应用对象,利用粗糙集理论对胶合板数据库中的特征信息进行缺陷识别·利用谱系聚类重心距离法对数据进行离散化处理,采用粗糙集进行属性约简,得到低维样本数据,最后用决策树方法产生决策规则·实验证明,这种数据挖掘方法保留了原始数据的内部特点,加快了获取知识的进程,提高了模型的分类准确率,增强了规则的可解释性,取得了满意的研究结果· Rough sets and decision tree have complementary characteristics. A new approach to data mining is thus proposed combining both advantages. Taking the detected data of plywood defects as example, the defects are recognized as follow using eigen information in the database of plywood on the basis of rough sets theory. Decentralizes the data in the database by the algorithm of center-of-gravity distance of pedigree cluster, then reduces the conditional attribute by use of rough sets to obtain the low dimensional sample data. Decision rules are finally obtained by decision tree. The experimental result shows that, in this way, the original characteristics of data remained unchanged, and the knowledge acquisition process become speedier ,so as to improve the classification accuracy of model and interpretability of rules. Comparing with other the methods, such as rough sets or preelsion-varied rough sets, the method is proved more mtisfactory.
出处 《东北大学学报(自然科学版)》 EI CAS CSCD 北大核心 2006年第5期481-484,共4页 Journal of Northeastern University(Natural Science)
基金 科技部国际合作重点项目(2003DF020009)
关键词 粗糙集 决策树 数据离散化 数据挖掘 谱系聚类 属性约简 rough sets decision tree data decentralization data mining pedigree cluster attribute reduction
  • 相关文献

参考文献9

  • 1吴成东,许可,王欣,韩中华.软计算方法在数据挖掘中的应用[J].计算机测量与控制,2005,13(3):294-297. 被引量:8
  • 2李永敏,朱善君,陈湘晖,张岱崎,韩曾晋.基于粗糙集理论的数据挖掘模型[J].清华大学学报(自然科学版),1999,39(1):110-113. 被引量:109
  • 3Pawlak Z.Vagueness and uncertainty-a rough set perspective[J].Computational Intelligence,1995,10(2):227-232.
  • 4Pawlak Z.Rough sets[J].Communications of ACM,1995,38(11):89-95.
  • 5王珏,苗夺谦.Analysis on Attribute Reduction Strategies of Rough Set[J].Journal of Computer Science & Technology,1998,13(2):189-192. 被引量:47
  • 6韩家炜 Michelin K.数据挖掘:概念与技术[M].北京:机械工业出版社,2001..
  • 7Berson A,Smith S.Data warehousing data mining & OLAP[M].London:Mcraw-HillBook,1999.272-320.
  • 8Huber H,Mcmilin C,Mckinney J.Lumber defect detection abilities of furniture rough mill employees[J].Forest Products Journal,1985,35(11):79-82.
  • 9Polzleitner W,Schwingshakl G.Real-time surface grading of profiled wooden boards[J].Industrial Metrology,1992,6(2):283-298.

二级参考文献20

  • 1苗夺谦,Technical Report Institute of Automation Chinese Academy of Sciences,1996年
  • 2Hu X H,Int J Comput Intell,1995年,11卷,323页
  • 3Pedrycz W. Fuzzy set technology in knowledge discovery[J].Fuzzy Sets System, 98, 279-290.
  • 4Pedrycx W. Conditional fuzzy c-means, pattern recognition Lett [J]. 2000, 17: 625-632.
  • 5Mazlack L J. Softly focusing on data [A]. Proc. NAFIPS99[C].New York, 1999, 700-704.
  • 6Wei Q ,Chen G. Mining generalized association rules with fuzzy taxonomic structures[A]. Proc. NAFIPS99[C]. New York,1999, 477-481.
  • 7Au, Chan. An effective algorithm for discovering fuzzy rules in relational databases[A]. Proc. IEEE Int. Conf. Fuzzy Syst. FUZZ IEEE 98 [C], 1998, 1314-1319.
  • 8Kacprzyk J, Zadrozny S. Data mining via linguistic summaries of data: An interactive approach [R]. Proc. IIZUKA 99, Fukuoka,Japan, Oct. 1999, 668-671.
  • 9Chiang D A, Chow L R, Wang Y E. Mining time series data by a fuzzy linguistic summary system[J].Fuzzy Sets System, 2002,112: 419-432.
  • 10Kohonen T, Kaski S, etal. Self organization of a massive document election[J].IEEE Trans. Neural Networks, 2002, 11, 574-585.

共引文献220

同被引文献132

引证文献15

二级引证文献43

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部