基于特征变换的Tri-Training算法

Tri-Training Algorithm Based on Feature Transformation

下载PDF

导出

摘要提出一种基于特征变换的Tri Training算法。通过特征变换将已标记实例集映射到新空间,得到有差异的训练集,从而构建准确又存在差异的基分类器,避免自助采样不能充分利用全部已标记实例集的问题。为充分利用数据类分布信息,设计基于Must link和Cannot link约束集合的特征变换方法(TMC),并将其用于基于特征变换的Tri Training算法中。在UCI数据集上的实验结果表明,在不同未标记率下,与经典的Co Training、Tri Trainng算法相比,基于特征变换的Tri Training算法可在多数数据集上得到更高的准确率。此外,与Tri LDA和Tri CP算法相比,基于TMC的Tri Training算法具有更好的泛化性能。 This paper proposes a new Tri-Training algorithm based on feature transformation. It employs feature transformation to transform labeled instances into new space to obtain new training sets, and constructs accurate and diverse classifiers. In this way, it avoids the weakness of bootstrap sampling which only adopts training data samples to train base classifiers. In order to make full use of the data distribution information, this paper introduces a new transformation method called Transformation Based on Must-link Constrains and Cannot-link Constrains（TMC）, and uses it to this new Tri-Training algorithm. Experimental results on UCI data sets show that, in different unlabeled rate, compared with the classic Co-Training and Tri-Training algorithm, the proposed algorithm based on feature transformation gets the highest accuracy in most data sets. In addition, compared with the Tri-LDA and Tri-CP algorithm, the Tri-Training algorithm based on TMC has better generalization ability.

作者赵文亮郭华平范明

机构地区郑州大学信息工程学院

出处《计算机工程》 CAS CSCD 2014年第5期183-187,191,共6页 Computer Engineering

关键词特征变换已标记实例集差异自助抽样泛化能力 feature transformation labeled instances set difference bootstrap sampling generalization ability

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1陈凯.基于回归问题的选择性集成算法[J].计算机工程,2009,35(21):17-19. 被引量：2
2王雷,杨思春.基于改进Tri-training算法的中文问句分类[J].安徽工业大学学报（自然科学版）,2016,33(2):172-176. 被引量：1
3张雁,林英,吕丹桔.基于Tri-Training算法的数据编辑技术[J].计算机与数字工程,2013,41(10):1583-1585.
4张雁,吕丹桔,吴保国.基于Tri-Training半监督分类算法的研究[J].计算机技术与发展,2013,23(7):77-79. 被引量：9
5张雁,吴保国,吕丹桔,林英.基于Tri-training的主动学习算法[J].计算机工程,2014,40(6):215-218. 被引量：3
6赵建华,李伟华.一种协同半监督分类算法Co-S3OM[J].计算机应用研究,2013,30(11):3237-3239. 被引量：12
7李心磊,杨思春,彭月娥.Tri-training算法中分类器组合的改进[J].苏州科技学院学报（自然科学版）,2014,31(2):52-56. 被引量：4
8琚春华,殷贤君,许翀寰.结合自助抽样的动态数据流贝叶斯分类算法[J].计算机工程与应用,2011,47(8):118-121. 被引量：3
9彭雅琴,宫宁生.一种自适应的Tri-Training半监督算法[J].计算机系统应用,2016,25(8):130-134. 被引量：1
10何永洁,陈孝威.基于ASIFT的低重叠度图像拼接研究[J].计算机工程与设计,2013,34(2):561-565. 被引量：5

计算机工程

2014年第5期

浏览历史

内容加载中请稍等...

基于特征变换的Tri-Training算法

相关作者

相关机构

相关主题

浏览历史