摘要
研究了基于纠错输出编码实现多类代价敏感分类的方法,提出了一种新的将多类代价敏感分类问题分解为多个二类代价敏感分类问题的框架。为获得其中每个二类代价敏感基分类器的二类代价矩阵,提出了利用已知多类代价矩阵计算误分类代价的期望值的方法,给出了计算二类代价矩阵的通用计算公式。为验证所提方法的有效性,在人工和UCI数据集上将其与现有方法进行了比较,实验结果表明所提方法具有相似甚至更好的性能。
Approach of multiclass cost-sensitive classification based on error correcting output codes is studied in this paper,and a new framework to decompose the complex multiclass cost-sensitive classification problem into a series of binary cost-sensitive classification problems is proposed.In order to obtain the binary cost matrix of each binary cost-sensitive base classifier,a method of computing the expected misclassification costs from the given multiclass cost matrix is proposed,and the general formula for computing the binary costs are given.Experimental results on artificial datasets and UCI datasets show that the proposed method has similar or even better performance in comparison with the existing methods.
作者
吴崇明
王晓丹
薛爱军
来杰
WU Chong-ming;WANG Xiao-dan;XUE Ai-Jun;LAI Jie(Business School,XiJing University,Xi’an 710123,China;College of Air and Missile Defense,Air force Engineering University,Xi’an 710051,China)
出处
《计算机科学》
CSCD
北大核心
2020年第S01期89-94,共6页
Computer Science
基金
国家自然科学基金(61876189,61273275,61703426)。
关键词
多类代价敏感分类
纠错输出编码
多类代价矩阵
二类代价矩阵
Multiclass cost-sensitive classification
Error correcting output codes
Multiclass cost matrix
Binary cost matrix