期刊文献+

一种分类挖掘算法及其应用 被引量:1

A Classification Algorithm and Its Application
下载PDF
导出
摘要 提出一种增量式混合型分类挖掘算法 ,将基于概率论的符号学习与神经网络学习相结合 ,能够对既包含离散属性又包含连续属性的多个概念进行有效的分类处理 ,且具有较强的增量挖掘能力 .该算法在法院决策支持系统中得到了运用 ,取得了较好的效果 . Classification is a main method of data mining. The purpose of the classification is to find the common specifications from the objects stored in the database, and then, use the schema to classify them. Artificial neutral network, decision tree and legacy algorithm are methods of classification. In the paper, an incremental compound classification algorithm is proposed. Artificial neutral network learning and symbol learning based on the theory of the probability are combined in the algorithm. The main idea of the algorithm is given below. First, the symbol learning algorithm is used to classify a set of training instances by the discrete attributes. When the instances that can't be classified accurately are encounted, FTART network is used to process these instances, and the incessant attributes is made use of in the algorithm to learn new schemas. The incremental learning of the algorithm is based on the component decision tree and the FTART network. When new instances are added, the algorithm only needs to make a single pass of the incremental learning, and the decision tree and the neutral network needn't be regenerated. The algorithm can correctly classify them by easily adjusting the existing structure. Another important advantage is that when a new input schema is added to a trained FTART network, the new network structure can easily be generated by adding some nodes to the second layer of the network, which is different to the traditional BP algorithm. Therefore, the efficiency and the speed of the learning are greatly raised. The main classes of the algorithm are class ROOT and class NODE. ROOT class includes the learning algorithm and the attributes which the decision tree stores in. NODE class is included in the ROOT class, and it is used to define the nodes of the decision tree. The algorithm is based on the two classes. The detail steps of the algorithm are discussed in the paper. The incremental compound classification algorithm can process multi concept collections that contain both discrete attributes and incessant attributes, and it has good performance in the incremental data mining. The algorithm has been tested in the Decision Support System for Court. In the decision support system, a new structure of data mining system based on the data warehouse is adopted to process the different data mining tasks. The incremental compound classification algorithm is a component of its data mining tool set. In the practice, the algorithm achieves good effect.
出处 《南京大学学报(自然科学版)》 CAS CSCD 北大核心 2001年第2期142-147,共6页 Journal of Nanjing University(Natural Science)
基金 国家自然科学基金! ( 60 0 0 3 0 1 0 )
关键词 数据挖掘 符号学习 人工神经网络 分类挖掘算法 增量挖掘能力 数据库 classification, data mining, symbol learning, artificial neutral network
  • 相关文献

参考文献2

二级参考文献2

共引文献20

同被引文献15

  • 1周蕾,朱巧明,李培峰.一种基于统计和规则的未登录词识别方法[J].南京大学学报(自然科学版),2005,41(z1):819-825. 被引量:1
  • 2陈晓云,胡运发.基于自适应加权的文本关联分类[J].小型微型计算机系统,2007,28(1):116-121. 被引量:6
  • 3Liu B, Hsu W, Ma Y. Integrating classification and association rule mining. Proceedings of ACM International Conference on Knowledge Discovery and Data Mining. New York: ACM, 1998, 80-86.
  • 4I.iW, HanJ, PeiJ. Accurate and efficient classification based on muhiple class-association rules. Proceedings of the 2001 IEEE International Conference on Data Mining. California, 2001, 369-376.
  • 5Aaiane O R, Antonie M. Classifying text documents by associating terms with text categories Proceedings of 13^th Australasian Database Conference. Melbournel Australian Computer Society, 2002, 24(2):215-222.
  • 6Gourab K, Md. Monirul I, Sirajum M. ACN An associative classifier with negative rules IEEE International Conference on Computational Science and Engineering, 2008, 369--325.
  • 7Cheng H, Yan X F, Han J W, et al. Direct discriminative pattern mining for effective classification. 2008 IEEE 24^th International Conference on Data Engineering, 2008,169- 178.
  • 8Elena B, Silvia C, Paolo G. A lazy approach to associative classification. IEEE Transactions on Knowledge and Data Engineering, 2008, 20 (2) :156-171.
  • 9商炳章 白清源.基于特征项权重改进的关联文本分类[J].计算机研究与发展,2008,45:252-256.
  • 10陈东亮 白清源.基于词频向量的关联文本分类[J].计算机研究与发展,2009,46:464-469.

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部