期刊文献+

基于Swin Transformer的嵌入式零样本学习算法

Embedded Zero-shot Learning Algorithm Based on Swin Transformer
下载PDF
导出
摘要 零样本学习旨在解决样本缺失情况下的分类问题.以往嵌入式零样本学习算法通常只利用可见类构建嵌入空间,在测试时不可避免会出现过拟合可见类的问题.基于此本文提出了一种基于类别语义相似度的多标签分类损失,该损失可在构建嵌入空间的过程中引导模型同时考虑与当前可见类语义上相似的未见类,进而将语义空间的相似性迁移到最终执行分类的嵌入空间.同时现有零样本学习算法大部分直接使用图像深度特征作为输入,特征提取过程没有考虑语义信息,基于此本文采用Swin Transformer作为骨干网络,输入原始图片利用自注意力机制得到基于语义信息的视觉特征.本文在3个零样本学习基准数据集上进行了大量实验,与目前最先进的算法相比取得了最佳的调和平均精度. Zero-shot learning aims to solve the classification problem in the absence of samples.Previous embedding-based zero-shot learning algorithms usually construct an embedding space using only seen classes,which inevitably leads to overfitting problems for seen classes during testing phase.Based on this,this paper proposes a multi-label classification loss based on the semantic similarity of classes.This loss guides the model to simultaneously consider the unseen classes that are semantically similar to the current seen classes in the process of constructing the embedding space,and then migrates the similarity of the semantic space to the embedding space where classification is finally performed.Meanwhile,most of the existing zero-shot learning algorithms directly use the deep features of images as input,and the feature extraction process does not consider semantic information.Based on this,this paper adopts Swin Transformer as the backbone and inputs the original images to obtain the visual features based on semantic information using the self-attention mechanism.In this paper,extensive experiments are conducted on three zero-shot learning benchmark datasets,and the best harmonic mean is achieved compared with the current state-of-the-art methods,which validates the effectiveness of the method in this paper.
作者 郜佳琪 魏巍 岳琴 GAO Jiaqi;WEI Wei;YUE Qin(College of Computer Science and Technology,Shanxi University,Taiyuan 030006,China;Key Laboratory of Computer Intelligence and Chinese Information Processing of Ministry of Education,Shanxi University,Taiyuan 030006,China)
出处 《小型微型计算机系统》 CSCD 北大核心 2024年第4期784-791,共8页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(62276160,61976184,61772323)资助 山西省1331工程项目资助.
关键词 零样本学习 深度学习 图像分类 注意力 Swin Transformer zero-shot learning deep learning image classification attention Swin Transformer
  • 相关文献

参考文献2

二级参考文献1

共引文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部