基于深度互学习的多标记零样本分类

Multi-Label Zero-Shot Classification Based on Deep Mutual Learning

下载PDF

导出

摘要目前已有大量方案解决零样本图像分类问题,但对多标记零样本图像分类问题的研究很少,在现有的解决方案中,模型在训练时除了利用已标注的数据集和给定的先验知识外,只利用图像区域信息或只利用标签语义信息。基于深度互学习技术,提出一种能同时利用图像区域和标签语义两种信息的解决方法。设计两个子网络,将子网络1用于增强图像视觉特征,通过多头自注意机制关联图像中不同区域的特征信息,得到基于区域的视觉特征表示,再将该特征表示映射到语义空间中,并输出预测概率分布;使子网络2用于融合标签语义信息与图像视觉特征,通过计算标签和图像区域特征的相关性,得到基于语义的视觉特征表示,将特征表示映射到语义空间中输出概率分布。最后引入深度互学习技术,利用两个子网络得到的概率分布为对方提供训练经验以进行互相学习,该过程中子网络在训练自身分类性能的同时也学习对方的训练经验,有效提升多标记零样本图像分类的性能。实验结果表明,所提方法在MS COCO数据集上的F1值相比Deep0Tag方法提升了5.2个百分点。 Numerous methods have been proposed to solve the zero-shot image classification problem;however,there are limited studies on the multi-label zero-shot image classification problem.In the existing solutions,in addition to the use of the basic settings of the labeled dataset and the given prior knowledge,the model either only uses the image region information or only the label semantic information.Based on deep mutual learning technology,this study proposes a solution that utilizes both the image region and label semantic information.Two sub-networks are designed.Sub-network 1 is used to enhance the visual features of the image,whereas the multi-head self-attention mechanism is used to associate the feature information of different regions in the image to obtain a region-based visual feature representation and then map the feature representation to the semantic space to output the predicted probability.Sub-network 2 is used to fuse the label semantic information and image visual features by calculating the correlation between the labels and image region features to obtain a semantic-based visual feature representation,and then map the feature representation to the semantic space to output a probability distribution.Finally,the deep mutual learning technology is introduced,and the probability distribution obtained by the two sub-networks is used to provide training experience for mutual learning.In this process,the sub-network refers to the training experience of the other sub-network while training its own classification performance,which effectively improves the performance of multi-label zero-shot image classification.The experimental results show that the F1 value of the proposed method on the MS COCO dataset increased by 5.2 percentage points compared to the Deep0Tag method.

作者袁志祥王雅卿黄俊 YUAN Zhixiang;WANG Yaqing;HUANG Jun(School of Computer Science and Technology,Anhui University of Technology,Maanshan 243032,Anhui,China)

机构地区安徽工业大学计算机科学与技术学院

出处《计算机工程》 CAS CSCD 北大核心 2023年第10期64-71,共8页 Computer Engineering

基金国家自然科学基金(61806005) 安徽省高校科学研究重点项目(KJ2021A0372,KJ2021A0373) 安徽省高校优秀青年人才支持计划项目(gxyqZD2022032)。

关键词深度学习图像分类多标记学习零样本学习互学习 deep learning image classification multi-label learning zero-shot learning mutual learning

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献3

1蔡亚萍,杨明.一种利用局部标记相关性的多标记特征选择算法[J].南京大学学报（自然科学版）,2016,52(4):693-704. 被引量：9
2朱赛赛,贾修一,李泽超.一种基于全局和局部标记相关性的多标记分类算法[J].电子学报,2020,48(12):2345-2351. 被引量：3
3魏宏喜,张越.基于生成对抗网络的零样本图像分类[J].北京航空航天大学学报,2019,45(12):2345-2350. 被引量：7

二级参考文献27

1Tsoumakas G, Katakis I. Multi-label classification: An overview. International Journal of Data Warehousing and Mining, 2007,3 (3) : 1 - 13.
2Tsoumakas G, Katakis, Vlahavas L Mining multi- label data. In: Maimon O, Rokach L. Data Mining and Knowledge Discovery Handbook, Part 6. The 2^nd Edition. US : Springer, 2010,67 - 685.
3Schapire R E, Singer Y. Boostexter: A boosting- based system for text categorization. Machine Learning, 2000,39 (2/3) :135-168.
4Godbole S, Sarawagi S. Discriminative methods for multi-labeled classification. In: PAKDD'04: The 8^th Pacific-Asia Conferenee on Knowledge Discovery and Data Mining. Berlin: Springer, 2004,22-30.
5Ftirnkranz J, Htillermeier E, Mencia E L, et al. Multilabel classification via calibrated label ranking. Machine Learning, 2008, 73 (2): 133-153.
6Clare A, King R D. Knowledge discovery in multi-label phenotype data. In:De Raedt L, Siebes A. Leeture Notes in Computer Science 2168. Berlin: Springer, 2001,42 - 53.
7Elisseeff A,Weston J. A kernel method for multi- labelled classification. In: Dietteroch T G, Bercker S,Ghahramani Z. Advances in Neural Information Processing Systems 14. Cambridge, MA: MIT Press, 2002,681- 687.
8Barutcuoglu Z,Schapire R E, Troyanskaya O G. Hierarchical multi-label prediction of gene function. Bioinformatics, 2006,22 (7) : 830 - 836.
9Boutell M R, Luo J, Shen X, et al. Learning multi- label scene classification. Pattern Recognition. 2004,37(9) :1757-1771.
10Qi G J, Hua X S, Rui Y, et al. Correlative multi- label video annotation. In: Proceedings of the 15th ACM International Conference on Multimedia. New York, NY: ACM Press, 2007, 17-26.

共引文献16

1王一宾,程玉胜,裴根生.结合均值漂移的多示例多标记学习改进算法[J].南京大学学报（自然科学版）,2018,54(2):422-435. 被引量：4
2徐洪峰,孙振强.多标签学习中基于互信息的快速特征选择方法[J].计算机应用,2019,39(10):2815-2821. 被引量：13
3程玉胜,李志伟,庞淑芳.特征标记依赖自编码器的多标记特征提取方法[J].计算机科学与探索,2020,14(3):470-481. 被引量：4
4王一宾,吴陈,程玉胜,江健生.不平衡标记差异性多标记特征选择算法[J].深圳大学学报（理工版）,2020,37(3):234-242. 被引量：2
5贾霄,郭顺心,赵红.基于图像属性的零样本分类方法综述[J].南京大学学报（自然科学版）,2021,57(4):531-543. 被引量：2
6吕露露,黄毅,高君宇,杨小汕,徐常胜.多模态零样本人体动作识别[J].中国图象图形学报,2021,26(7):1658-1667. 被引量：4
7李田力,陈飞,江家宝.标记不平衡性的多标记粗糙互信息特征选择[J].忻州师范学院学报,2021,37(5):42-48. 被引量：2
8吴韵怡.新媒体背景下的视频广告分类系统设计[J].微型电脑应用,2022,38(4):65-68. 被引量：1
9张冀,曹艺,王亚茹,赵文清,翟永杰.融合VAE和StackGAN的零样本图像分类方法[J].智能系统学报,2022,17(3):593-601. 被引量：9
10孙林,陈雨生,徐久成.基于改进ReliefF的多标记特征选择算法[J].山东大学学报（理学版）,2022,57(4):1-11. 被引量：9

1唐义承,纪惠芬.基于嵌入对比学习的广义零样本预分类模型[J].计算机时代,2023(10):75-79.
2Ruth Devlin.凤头鹦鹉[J].空中英语教室（中级版）,2023(10):40-41.
3王芳.图像视觉特征的机械臂末端位姿监测方法研究[J].机械设计与制造,2023(10):281-284.
4孙林,徐枫,李硕,王振.基于ReliefF和最大相关最小冗余的多标记特征选择[J].河南师范大学学报（自然科学版）,2023,51(6):21-29. 被引量：7
5陈跃鹏,任博博,靳佳澍,吴明希.基于并联卷积神经网络的运动模糊去除模型[J].华中科技大学学报（自然科学版）,2023,51(9):140-145.
6张辉宜,夏媛龙,周克武,包向华,陶陶.一种融合标签间强相关性的多标签图像分类方法[J].重庆工商大学学报（自然科学版）,2023,40(5):8-15. 被引量：1
7冉宁,张家明,杨宏飞,郝真鸣,郝晋渊.基于语义分割网络的AGV路径规划算法[J].电子测量与仪器学报,2023,37(7):121-130.
8高国建,赵玉凤,刘颖,邹雯,王健.中医药防治艾滋病临床疗效与评价方法研究现状[J].中华中医药杂志,2023,38(10):4814-4818. 被引量：3
9麦克斯韦·芬列森.踏上湿地之旅发现烟火中国[J].中国新闻发布（实务版）,2023(7):71-73.
10翟梦.多源多尺度天地图DLG数据融合技术方法研究[J].测绘与空间地理信息,2023,46(10):125-127. 被引量：1

计算机工程

2023年第10期

浏览历史

内容加载中请稍等...

基于深度互学习的多标记零样本分类

参考文献3

二级参考文献27

共引文献16

相关作者

相关机构

相关主题

浏览历史