多示例学习的包层次覆盖k近邻算法

Multi-instance Learning Bag-level Covering-kNN Algorithm

下载PDF

导出

摘要多示例学习是一种新型的机器学习框架,正包中大量的噪声使多示例数据集具有很大的歧义性.为了排除多示例数据集正包中大量的假正例,提高分类精度,结合邻域覆盖算法,提出一个新的多示例包层次覆盖k近邻算法.覆盖算法的学习结果是一系列的球形邻域,在每一个球形邻域中只含有同类样本,本文利用的覆盖算法的这一特性重新组织多示例数据集的包结构.概括的说,为了排除正包中大量的假正例,首先对原有的多示例包结构进行重新构造,使用覆盖算法生成的球形邻域做为新的包结构,从而提高多示例样本在新的特征空间中的可分离性.然后,使用包层次的k近邻算法排除正包中大量的噪声并预测测试包的类别.实验表明,多示例学习的包层次覆盖k近邻算法具有很好的性能. Multi-instance learning is a new framework in machine learning. An extensive number of noises in the positive bags is the inherent difficulty of multi-instance learning. In order to improve the classification accuracy, this paper puts forward a novel multi-in- stance learning bag-level Covering-kNN algorithm to exclude the noises in multi-instance data set. The learning results of Covering al- gorithm is a set of sphere neighbors and each sphere neighbor only contains patterns belong to the same class. This feature help us re- organize the structure of bags in multi-instance data set. Generally speaking, in order to exclude false positive instances in the positive bags, first, we reconstruct the structure of multi-instance data set by treating the sphere neighbors obtained using Covering algorithm as the new structure of bags. Thus, improving the separable of multi-instance samples in the new feature space. Then, the bag-level kNN algorithm is utilized to exclude the noises in positive bags and predict the labels of test bags. The experiments demonstrate the effectiveness of the proposed multi-instance bag-level Covering-kNN algorithm.

作者赵姝芮辰陈洁张燕平

机构地区安徽大学计算智能与信号处理教育部重点实验室安徽大学人工智能研究所

出处《小型微型计算机系统》 CSCD 北大核心 2014年第11期2511-2514,共4页 Journal of Chinese Computer Systems

基金国家自然科学基金项目(61073117 61175046)资助安徽省教育厅基金项目(kJ2013A016)资助安徽大学研究生学术创新项目(10117700183)资助

关键词机器学习多示例学习覆盖算法 K近邻算法 machine learning multi-instance learning covering algorithm kNN algorithm

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献18

1Maron O. Learning from ambiguity [ D ]. Massachusetts Institute of Technology, USA, 1998.
2Fu Z Y, Robles-Kelly A, Zhou J. MILIS : multiple instance learning with instance selection [ J ]. IEEE Transactions on Pattern Analysis and Machine Intelligence ,USA ,2011,33 ( 5 ) :958-977.
3Nguyen D T, Nguyen C D, Hargraves R, et al. mi-DS : multiple-in- stance learning algorithm[ J]. IEEE Transactions on Systems, Man, and Cybernetics Society ,2013,43 ( 1 ) : 143-154.
4Babenko B ,Yang M H,Belongie S. Robust object tracking with on- line multiple instance learning [ J ]. IEEE Transactions on Pattern A- nalysis and Machine Intelligence,2011,33 (8) : 1619-1632.
5Xie Y, Qu Y Y, Li C H, et al. Online multiple instance gradient fea- ture selection for robust visual tracking[ J]. Pattern Recognition Let- ters, 2012,33 ( 9 ) : 1075-1082.
6Qi Z Q,Xu Y T,Wang L S,et al. Online multiple instance boosting for object detection[ J ]. Neurocomputing ,2011,74 (10) : 1769-1775.
7Bellare M, Ristenpart T, Tessaro S. Multi-instance security and its application to password-based cryptography[ J]. Advances in Cryp- tology -CRYPTO, 2012 : 312 -329.
8Dietterich T G,Lathrop R H,Lozano-P&ez T. Solving the multiple instance problem with axis-parallel rectangles [ J ]. Artificial Intelli- gence, 1997,89( 1 ) :31-71.
9Maron O, Lozano-Ptrez T. A framework for multiple-instance learn- ing[ J]. Advances in Neural Information Processing Systems, 1998, 10:570-576.
10Zhang Q, Goldman S A, et al. EM-DD: an improved multiple-in- stance learning technique[ J]. Advances in Neural Information Pro- cessing Systems,2001,14 ( 2022 ) : 1073-1080.

二级参考文献1

1张铃,张钹.多层反馈神经网络的FP学习和综合算法[J].软件学报,1997,8(4):252-258. 被引量：24

共引文献134

1段震,姚芳兵,张铃.基于构造性学习方法的车牌定位[J].微机发展,2004,14(8):41-43. 被引量：2
2张燕平,张铃,吴涛,徐锋,张,王伦文.基于覆盖的构造性学习算法SLA及在股票预测中的应用[J].计算机研究与发展,2004,41(6):979-984. 被引量：18
3段震,鲁杰,张铃.基于交叉覆盖神经网络的车牌识别研究[J].安徽大学学报（自然科学版）,2004,28(5):11-14. 被引量：7
4赵姝,张燕平,张媛,陈传明.基于交叉覆盖算法的改进算法——核平移覆盖算法[J].微机发展,2004,14(11):1-3. 被引量：6
5黄国宏,邵惠鹤.一种新的基于神经网络覆盖分类算法[J].中国图象图形学报（A辑）,2004,9(10):1165-1168. 被引量：6
6张燕平,张铃,段震.构造性核覆盖算法在图像识别中的应用[J].中国图象图形学报（A辑）,2004,9(11):1304-1308. 被引量：17
7阚涛,娄天玲.基于交叉覆盖算法的模糊神经网络在车用发电机故障诊断系统中的应用研究[J].安徽电子信息职业技术学院学报,2005,4(1):76-77.
8钱峰,张蕾,赵姝.基于粗糙集的交叉覆盖算法[J].铜陵学院学报,2004,3(4):70-71.
9毛军军,吴涛,郑婷婷,张铃.基于商空间的构造性分层竞争网络算法[J].微机发展,2005,15(4):37-39. 被引量：2
10唐理兵,倪志伟,李学俊,马猛.基于交叉覆盖设计算法的空间分类挖掘[J].微机发展,2005,15(4):43-45.

1栾丽华,吉根林.一种基于四叉树的快速聚类算法[J].计算机应用,2005,25(5):1001-1003. 被引量：6
2蔡自兴,李枚毅.多示例学习及其研究现状[J].控制与决策,2004,19(6):607-610. 被引量：12
3李大湘,赵小强,李娜.图像语义分析的多示例学习算法综述[J].控制与决策,2013,28(4):481-488. 被引量：3
4郑忠龙,俞牡丹,陈中育,杨凡,杨杰.同质球形邻域投影[J].计算机学报,2014,37(11):2256-2261.
5田玉昆,孙正章.一种基于直推式回归的移动跟踪算法[J].广东科技,2011,20(10):33-35.
6陈桂兰,袁海峰,张剑飞.基于块匹配的综合图像检索技术[J].哈尔滨师范大学自然科学学报,2011,27(1):33-36. 被引量：1
7张敏灵.偏标记学习研究综述[J].数据采集与处理,2015,30(1):77-87. 被引量：13
8王潇茵,胡昌振.基于直觉模糊——神经网络的色情图像识别算法[J].信息网络安全,2009(7):12-14. 被引量：1
9陈冠雄,姚志强.一种基于量化方法的3D模型盲水印算法[J].电子与信息学报,2009,31(12):2963-2968. 被引量：4
10谢莹,赵康康,许荣斌,程凡,钱田芬.基于协作行为的半自动人员分配策略[J].计算机集成制造系统,2016,22(2):448-454.

小型微型计算机系统

2014年第11期

浏览历史

内容加载中请稍等...

多示例学习的包层次覆盖k近邻算法

参考文献18

二级参考文献1

共引文献134

相关作者

相关机构

相关主题

浏览历史