基于视觉误差与语义属性的零样本图像分类被引量：4

Zero-shot image classification based on visual error and semantic attributes

下载PDF

导出

摘要在图像分类的实际应用过程中,部分类别可能完全没有带标签的训练数据。零样本学习(ZSL)的目的是将带标签类别的图像特征等知识迁移到无标签的类别上,实现无标签类别的正确分类。现有方法在测试时无法显式地区分输入图像属于已知类还是未知类,很大程度上导致未知类在传统设定下的ZSL和广义设定下的ZSL(GZSL)上的预测效果相差甚远。为此,提出一种融合视觉误差与属性语义信息的方法来缓解零样本图像分类中的预测偏置问题。首先,设计一种半监督学习方式的生成对抗网络架构来获取视觉误差信息,由此预测图像是否属于已知类;然后,提出融合属性语义信息的零样本图像分类网络来实现零样本图像分类;最后,测试融合视觉误差与属性语义的零样本图像分类方法在数据集AwA2和CUB上的效果。实验结果表明,与对比模型相比,所提方法有效缓解了预测偏置问题,其调和指标H在AwA2(Animal with Attributes)上提升了31.7个百分点,在CUB(Caltech-UCSD-Birds-200-2011)上提升了8.7个百分点。 In the practical applications of image classification,some categories may have no labeled training data at all.The purpose of Zero-Shot Learning(ZSL)is to transfer knowledge such as image features of labeled categories to unlabeled categories and to correctly classify the unlabeled categories.However,the existing state-of-the-art methods cannot explicitly distinguish the input image belonging to the known categories or unknown categories,which leads to a large performance gap for unlabeled categories between the traditional ZSL prediction and the Generalized ZSL(GZSL)prediction.Therefore,a method of fusing of visual error and semantic attributes was proposed to alleviate the prediction bias problem in zero-shot image classification.Firstly,a semi-supervised learning based generative adversarial network framework was designed to obtain visual error information,so as to predict whether the image belongs to the known categories.Then,a zero-shot image classification network combining semantic attributes was proposed to achieve zero-shot image classification.Finally,the performance of zero-shot image classification algorithm combining visual error and semantic attributes was tested on AwA2(Animal with Attributes)and CUB(Caltech-UCSD-Birds-200-2011)datasets.The experimental results show that,compared to the baseline models,the proposed method can effectively alleviate the prediction bias problem,and has the harmonic index H increased by 31.7 percentage points on AwA2 dataset and 8.7 percentage points on CUB dataset.

作者徐戈肖永强汪涛陈开志廖祥文吴运兵 XU Ge;XIAO Yongqiang;WANG Tao;CHEN Kaizhi;LIAO Xiangwen;WU Yunbing(College of Computer and Control Engineering,Minjiang University,Fuzhou Fujian 350108,China;College of Mathematics and Computer Science,Fuzhou University,Fuzhou Fujian 350116,China;Fujian Provincial Key Laboratory of Networking Computing and Intelligent Information Processing(Fuzhou University),Fuzhou Fujian 350116,China;Digital Fujian Financial Big Data Institute,Fuzhou Fujian 350116,China)

机构地区闽江学院计算机与控制工程学院福州大学数学与计算机科学学院福建省网络计算与智能信息处理重点实验室(福州大学) 数字福建金融大数据研究所

出处《计算机应用》 CSCD 北大核心 2020年第4期1016-1022,共7页 journal of Computer Applications

基金国家自然科学基金资助项目(61772135,U1605251,61703195) 中国科学院网络数据科学与技术重点实验室开放课题基金资助项目(CASNDST201708,CASNDST201606) 模式识别国家重点实验室开放课题基金资助项目(201900041) 福建省自然科学基金面上项目(2017J01755) 赛尔网络下一代互联网技术创新项目(NGII20160501)。

关键词零样本学习图像分类生成对抗网络视觉误差属性语义 Zero-Shot Learning(ZSL) image classification generative adversarial network visual error semantic attribute

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1巩萍,程玉虎,王雪松.基于属性关系图正则化特征选择的零样本分类[J].中国矿业大学学报,2015,44(6):1097-1104. 被引量：7

二级参考文献20

1FERRARI V, ZISSERMAN A. Learning visual at-tributes[C]//Proceedings of the Advances in Neural Information Processing Systems. Vancouver: Curran Associates Inc Press, 2007 : 433-440.
2WAN K W, ROY S. Identifying and learning visual attributes for object recognition[C]//Proceedings of the IEEE International Conference on Image Process- ing. Piseataway : IEEE Inc Press, 2010 : 3893-3896.
3FARHADI A, ENDRES I, HOIEM D, et al. Descri- bing objects by their attributes[C]//Proceedings of the IEEE Computer Vision and Pattern Recognition. Piscataway: IEEE Inc Press,2009 : 1778-1785.
4SONG F Y,TAN X Y,CHEN S C. Exploiting rela- tionship between attributes for improved face verifiea- tion[J].Computer Vision and Image Understanding, 2014,122(4) : 143-154.
5HENG T C,FENG T S,MARTIN G. NuActiv:Rec- ognizing unseen new activities using semantic attrib- ute-based learning[C]//Proceedings of the llth An- nual International Conference on Mobile Systems, Applications,and Services. New York: ACM Press, 2013:361-374.
6KOVASHKA A,PARIKH D,GRAUMAN K. Whit- tle Search: Image search with relative attribute feed- back[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Los Alam- itos: IEEE Computer Society Press, 2012 : 2973-2980.
7LAMPERT C H, NICKISCH H, HARMELING S. Learning to detect unseen object classes by between- class attribute transfer[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recogni- tion. Piscataway: IEEE Inc Press, 2009 : 951-958.
8PALATUCCI M, POMERLEAU D, HINTON G E, et al. Zero-shot learning with semantic output codes [C]//Proceedings of the Advances in Neural Infor- mation Processing Systems. Vancouver:Curran Asso- ciates Inc Press,2009:1410-1418.
9YU X, ALOIMONOS Y. Attribute-based transfer learning for object categorization with zero or one training example[C]//Proceedings of the llth Euro- pean Conference on Computer Vision. Berlin:Spring- er Verlag Press, 2010 : 127-140.
10ROHRBACH M, STARK M, SZARVAS G, et al. Combining language sources and robust semantic re- latedness for attribute-based knowledge transfer [C]//Proceedings of the l lth European Conference on Computer Vision. Berlin: Springer Verlag Press, 2010:15-28.

共引文献6

1臧绍飞,程玉虎,王雪松.基于样本局部判别权重的加权迁移成分分析[J].中国矿业大学学报,2016,45(5):1043-1049. 被引量：1
2冀中,谢于中,庞彦伟.基于典型相关分析和距离度量学习的零样本学习[J].天津大学学报（自然科学与工程技术版）,2017,50(8):813-820. 被引量：5
3秦牧轩,荆晓远,吴飞.基于公共空间嵌入的端到端深度零样本学习[J].计算机技术与发展,2018,28(11):44-47. 被引量：3
4芦楠楠,张欣茹,欧倪.基于粒子群算法寻最优属性关联下的零样本语义自编码器[J].电子与信息学报,2021,43(4):982-991. 被引量：1
5贾霄,郭顺心,赵红.基于图像属性的零样本分类方法综述[J].南京大学学报（自然科学版）,2021,57(4):531-543. 被引量：2
6李雨泽,张岩,陈宇,杨春玲.基于深层-浅层双流学习图模型的无监督少样本红外空中目标识别网络[J].红外与毫米波学报,2023,42(6):917-924.

同被引文献43

1宋闯,赵佳佳,王康,梁欣凯.面向智能感知的小样本学习研究综述[J].航空学报,2020(S01):15-28. 被引量：16
2翟晓燕,张新政.有向网络中具有一个枢纽点的最小支撑树的计算方法[J].系统科学与数学,2005,25(6):649-657. 被引量：2
3尹志武,黄上腾.一种自适应局部概念漂移的数据流分类算法[J].计算机科学,2008,35(2):138-139. 被引量：8
4文益民,强保华,范志刚.概念漂移数据流分类研究综述[J].智能系统学报,2013,8(2):95-104. 被引量：25
5何超,张玉峰.融合领域本体的中文文本语义特征提取算法研究[J].情报理论与实践,2013,36(9):96-99. 被引量：6
6葛伟,朱金福,吴薇薇,吴小欢.基于无容量限制的p-枢纽中位问题的随机优化[J].系统工程理论与实践,2013,33(10):2674-2678. 被引量：11
7马保雷,宋颖慧,刘亚维.基于概念漂移检测的自适应流量识别的研究[J].智能计算机与应用,2013,3(6):50-53. 被引量：1
8林武旭,成科扬,张建明.基于属性学习的图像分类研究[J].计算机科学,2014,41(5):288-291. 被引量：5
9巩萍,程玉虎,王雪松.基于属性关系图正则化特征选择的零样本分类[J].中国矿业大学学报,2015,44(6):1097-1104. 被引量：7
10魏晓聪,林鸿飞.面向迁移学习的文本特征对齐算法[J].计算机工程,2017,34(2):215-219. 被引量：7

引证文献4

1贾霄,郭顺心,赵红.基于图像属性的零样本分类方法综述[J].南京大学学报（自然科学版）,2021,57(4):531-543. 被引量：2
2夏弘睿,赵静.基于人眼视觉特性的景观图像高频细节增强方法[J].宜春学院学报,2022,44(12):39-43. 被引量：1
3倪伟,王展旭,卞悦旭.基于卷积神经网络的零样本细粒度特征识别[J].信息技术,2023,47(2):86-90.
4白万荣,张驯,张蕾,杨凡,邵洁.基于类混合高斯映射的归纳式广义零样本识别[J].计算机应用与软件,2024,41(11):206-212.

二级引证文献3

1申海锋,石颉,李莎莎,柴梓嘉.特征属性描述下设备的新故障零样本识别[J].微电子学与计算机,2023,40(6):77-84. 被引量：1
2张方泽,龚循强,周秀芳,刘卓涛.基于自训练卷积神经网络的遥感场景图像异常探测方法[J].时空信息学报,2023,30(4):482-490. 被引量：2
3杨碧香.大规模景观图像斑块特征增强算法仿真[J].现代电子技术,2024,47(12):86-90.

1冶忠林,赵海兴,张科,朱宇.基于多源信息融合的分布式词表示学习[J].中文信息学报,2019,33(10):18-30. 被引量：4
2邵一鸣,孙红星,陈虹羊.基于深度学习的人脸遮挡检测方法[J].辽宁科技大学学报,2019,42(6):454-461.
3Hamayun A. Khan.DM-L Based Feature Extraction and Classifier Ensemble for Object Recognition[J].Journal of Signal and Information Processing,2018,9(2):92-110.
4郑志蕴,吴建萍,李钝,刘允,米高扬.一种基于短文本相似度计算的知识子图融合方法[J].小型微型计算机系统,2020,41(1):6-11. 被引量：8
5李国瑞,何小海,吴晓红,卿粼波,滕奇志.基于语义信息跨层特征融合的细粒度鸟类识别[J].计算机应用与软件,2020,37(4):132-136. 被引量：4
6郑可菜.当我们对比美国教育时,我们借鉴什么?[J].师道（人文）,2020,0(3):15-17.
7陶飞,成科扬,张建明,汤宇豪.基于姿态与并行化属性学习的行人再识别方法[J].计算机工程,2020,46(3):246-253. 被引量：2
8李象远,姚晓霞,申屠江涛,孙晓慧,李娟琴,刘明夏,许诗敏.燃烧反应机理构建的双参数速率常数方法[J].高等学校化学学报,2020,41(3):512-520. 被引量：5
9彭玉海.动词多义义位的组合特征分析[J].外国语言文学,2019,36(6):633-646.
10李创,张涛,张祥伍.矢量线约束下的局部地形实时修正[J].指挥信息系统与技术,2020,11(1):74-79.

计算机应用

2020年第4期

浏览历史

内容加载中请稍等...

基于视觉误差与语义属性的零样本图像分类被引量：4

参考文献1

二级参考文献20

共引文献6

同被引文献43

引证文献4

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

基于视觉误差与语义属性的零样本图像分类 被引量：4

参考文献1

二级参考文献20

共引文献6

同被引文献43

引证文献4

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

基于视觉误差与语义属性的零样本图像分类被引量：4