基于不稳定性采样的主动学习方法

Active learning method based on instability sampling

下载PDF

导出

摘要传统的主动学习方法往往仅基于当前的目标模型来挑选样本,而忽略了历史模型所蕴含的对未标注样本预测稳定性的信息。因此,提出基于不稳定性采样的主动学习方法,依据历史模型的预测差异来估计每个未标注样本对提高模型性能的潜在效用。该方法基于历史模型对样本的预测后验概率之间的差异来衡量无标注样本的不稳定性,并挑选最不稳定的样本进行查询。在多个数据集上的大量实验结果验证了方法的有效性。 Traditional active learning methods select examples by only considering the predictions of the current model.However,these methods neglect the information of the previous trained models,which reflect the stability of the prediction sequence for each unlabeled example during the active learning stage.Thus,a novel active learning method with instability sampling was proposed,which attempted to estimate the potential utility of each unlabeled examples for improving the model performance based on the difference among predictions of the previous models.The proposed method measured the instability of unlabeled example based on the difference between the posterior probabilities predicted by the previous models,and the example with the largest instability was selected to be queried.Extensive experiments were conducted on multiple datasets with diverse classification models.The experimental results validate the effectiveness of the proposed method.

作者何花谢明昆黄圣君 HE Hua;XIE Mingkun;HUANG Shengjun(College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China)

机构地区南京航空航天大学计算机科学与技术学院

出处《国防科技大学学报》 EI CAS CSCD 北大核心 2022年第3期50-56,共7页 Journal of National University of Defense Technology

基金新一代人工智能重大资助项目(2020AAA0107000) 江苏省自然科学基金资助项目(BK20211517)。

关键词主动学习标注代价不稳定性后验概率熵 active learning labeling cost instability posterior probability entropy

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1吴蒙,秦云虎,王晓青,杨柳,朱士飞,张震,毛礼鑫,张静,李国璋.任家庄煤矿煤层煤质测井响应及其预测模型[J].吉林大学学报（地球科学版）,2022,52(2):633-643. 被引量：3
2王亚许,吕娟,左惠强,高辉,屈艳萍,苏志诚,尹建明.东北三省春玉米旱灾动态风险评估[J].干旱地区农业研究,2022,40(3):228-237. 被引量：4

国防科技大学学报

2022年第3期

浏览历史

内容加载中请稍等...

基于不稳定性采样的主动学习方法

相关作者

相关机构

相关主题

浏览历史