摘要
目前采用的区间划分的关联分类法不能有效地体现出数据的实际分布情况 ,并存在划分边界过硬的缺点 .文中首先讨论了通过挖掘最长的语言值关联规则构建分类系统的方法并分析了其不足 ,然后给出了通过挖掘短的语言值关联规则构建分类系统的方法 .实验表明 ,基于语言值关联规则的分类系统能在精度上优于 2种流行的分类方法 :C4 .5和关联分类法 .
A partition of intervals method is adopted in current classification based on associations (CBA), but this method cannot reflect the actual distribution of data and exists the problem of sharp boundary problem. The classification system based on the longest association rules with linguistic terms is discussed, and the shortcoming of this classification system is analyzed. Then, the classification system based on the short association rules with linguistic terms is presented. The example shows that the accuracy of the classification system based on the association rules with linguistic terms is better than two popular classification methods: C4.5 and CBA.
基金
YoungScientist sFundoftheNationalNaturalScienceFoundationofChina (No .60 3 0 3 0 2 4) ,theStateKeyBasicResearchandPlanProgram (973Program) (No .2 0 0 2CB3 12 0 0 0 ),SpecializedResearchFundfortheDoctoralProgramofHigherEducation,theOpeningFoundationo