代表特征网络的小样本学习方法被引量：8

Representative feature networks for few-shot learning

导出

摘要目的小样本学习任务旨在仅提供少量有标签样本的情况下完成对测试样本的正确分类。基于度量学习的小样本学习方法通过将样本映射到嵌入空间,计算距离得到相似性度量以预测类别,但未能从类内多个支持向量中归纳出具有代表性的特征以表征类概念,限制了分类准确率的进一步提高。针对该问题,本文提出代表特征网络,分类效果提升显著。方法代表特征网络通过类代表特征的度量学习策略,利用类中支持向量集学习得到的代表特征有效地表达类概念,实现对测试样本的正确分类。具体地说,代表特征网络包含两个模块,首先通过嵌入模块提取抽象层次高的嵌入向量,然后堆叠嵌入向量经过代表特征模块得到各个类代表特征。随后通过计算测试样本嵌入向量与各类代表特征的距离以预测类别,最后使用提出的混合损失函数计算损失以拉大嵌入空间中相互类别间距减少相似类别错分情况。结果经过广泛实验,在Omniglot、mini Image Net和Cifar100数据集上都验证了本文模型不仅可以获得目前已知最好的分类准确率,而且能够保持较高的训练效率。结论代表特征网络可以从类中多个支持向量有效地归纳出代表特征用于对测试样本的分类,对比直接使用支持向量进行分类具有更好的鲁棒性,进一步提高了小样本条件下的分类准确率。 Objective Few-shot learning aims to build a classifier that recognizes new unseen classes given only a few samples.The solutions are mainly in the following categories:data augmentation,meta-learning,and metric learning.Data augmentation can be used to reduce certain over-fitting given a limited data regime in a new class.The corresponding solution is to augment data in the feature domain as hallucinating features.These methods exert a certain effect on few-shot classification.However,due to the extremely small data space,the transformation mode is considerably limited and cannot solve over-fitting problems.The meta-learning method is suitable for few-shot learning because it is based on the high-level strategy of learning similar tasks.Some methods learn good initial values,some learn task-level update strategies,and others construct external memory storages to remember past information for comparison during testing.The few-shot classification results of these methods are superior,but the network structure is increasingly complicated due to the use of RNNs(recurrent neural networks).The efficiency is also low.The metric learning method is simple and efficient.It first maps a sample to the embedding space and then computes the distance to obtain the similarity metric to predict the category.Some approaches improve the representation of features in the embedding space,some use learnable distance metrics to compute distance for loss,and others combine meta-learning methods to improve accuracy.However,this type of method fails to summarize representative features from multiple support vectors in a class to effectively represent the class concept.This drawback limits the further improvement of the accuracy of small sample classification.To address this problem,this study proposes a representative feature network.Method The representative feature network is a metric learning strategy based on class representative features.It uses the representative features learned from a support vector set in a class to express the class concept effectively.It also uses mixture loss to reduce the misclassification of similar classes and thus achieve excellent classification results.Specifically,the representative feature network includes two modules.The embedded vector of a high abstraction level is extracted by the embedded module,and then the representative feature per class is obtained by the representative feature module by inputting stacked support vector sets.The class representative feature fully considers the influence of the embedded vector of the support samples on the basis of the target that may or may not be obvious.The use of network learning to assign different weights to each embedded support vector can effectively avoid misclassification caused by the bias effects of representative features for unobvious target samples.Then,the distances from the embedded query vectors to each class representative feature are calculated to predict the class.In addition,the mixture loss function is proposed for the misclassification of similar classes in the embedded space.The cross-entropy loss combined with the relative error loss function is used to increase the inter-class distances and reduce the similar class error rate.Result After extensive experiments,the Omniglot,mini Image Net,and Cifar100 datasets verify that the model achieves state-of-theart results.For the simple Omniglot dataset,the five-way,five-shot classification accuracy is 99.7%,which is 1%higher than that of the original matching network.For the complex mini Image Net dataset,the five-way,five-shot classification accuracy is 75.83%,which is approximately 18%higher than that of the original matching network.Representative features provide approximately 8%improvement,indicating that it can effectively express the prototype by distinguishing the contribution of different support vectors,the target of which may or may not be obvious.Mixture loss provides approximately1%improvement,indicating that it can reduce some misclassification of similar classes in the testing set.However,the improvement is unremarkable because similar samples are uncommon in the dataset.The last 9%improvement is due to the fine-tuning on the test set,indicating that the advantage of the skip connection method benefits loss propagation relative to the original connection between the network module methods.For the Cifar100 dataset,the five-way,five-shot classification accuracy is 87.99%,which is 20%higher than that of the original matching network.Moreover,the high training efficiency is maintained while the performance is significantly improved.Conclusion To address the problem of extremely simple original embedding networks for extracting high-level features of samples,the improved embedding networks in a representative feature network use a skip connection structure so as to deepen the network and extract advanced features.To address the problem of the noise support vector that disturbs the classification accuracy of a testing sample,the representative feature network can effectively summarize the representative features from multiple support vectors in a class for classifying query samples.Compared with the performance when support vectors are used directly,the classification performance when representative features are used is more robust,and the classification accuracy under few-shot samples is further improved.In addition,the mixture loss function proposed for the classification problem of similar classes is used to enlarge the distance between categories in the embedded space and reduce the misclassification of similar classes.Detailed experiments are carried out to verify that these improved methods achieve great performance in few-shot learning tasks for the Omniglot,mini Image Net,and Cifar100 datasets.At the same time,the representative feature network presents improvement.For embedding networks,advanced structures,such as dense connections or se modules,must be included in future work to further improve the results.

作者汪荣贵郑岩杨娟薛丽霞 Wang Ronggui;Zheng Yan;Yang Juan;Xue Lixia(School of Computer Science and Information Engineering,Hefei University of Technology,Hefei 230601,China)

机构地区合肥工业大学计算机与信息学院

出处《中国图象图形学报》 CSCD 北大核心 2019年第9期1514-1527,共14页 Journal of Image and Graphics

关键词小样本学习度量学习代表特征网络混合损失函数微调 few-shot learning metric learning representative feature network mixture loss function fine-tuning

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献3

1李亚超,熊德意,张民.神经机器翻译综述[J].计算机学报,2018,41(12):2734-2755. 被引量：107
2赵文清,严海,邵绪强.改进的非极大值抑制算法的目标检测[J].中国图象图形学报,2018,23(11):1676-1685. 被引量：55
3翁雨辰,田野,路敦民,李琼砚.深度区域网络方法的细粒度图像分类[J].中国图象图形学报,2017,22(11):1521-1531. 被引量：17

二级参考文献6

1陈金辉,叶西宁.行人检测中非极大值抑制算法的改进[J].华东理工大学学报（自然科学版）,2015,41(3):371-378. 被引量：20
2石祥滨,张健,代钦,张德园,张利国.采用显著性分割与目标检测的形变目标跟踪方法[J].计算机辅助设计与图形学学报,2016,28(4):645-653. 被引量：6
3焦李成,杨淑媛,刘芳,王士刚,冯志玺.神经网络七十年:回顾与展望[J].计算机学报,2016,39(8):1697-1716. 被引量：366
4张世锋,黄心汉,王敏.红外背景抑制与小目标检测算法[J].中国图象图形学报,2016,21(8):1039-1047. 被引量：12
5蔡念,周杨,刘根,杨志景,凌永权.鲁棒主成分分析的运动目标检测综述[J].中国图象图形学报,2016,21(10):1265-1275. 被引量：23
6张慧,王坤峰,王飞跃.深度学习在目标视觉检测中的应用进展与展望[J].自动化学报,2017,43(8):1289-1305. 被引量：245

共引文献176

1熊璨.论人工智能翻译的可能性——从翻译的三个层次看非文学与文学翻译[J].中外文化与文论,2020(2):106-115. 被引量：2
2明玉琴,夏添,彭艳兵.基于GAN模型优化的神经机器翻译[J].中文信息学报,2020(4):47-54. 被引量：7
3邓凌云,余环.中外笔译质量要求的对比研究与启示[J].当代外语研究,2019,0(4):115-123.
4赵玉蓉,刘欢,龙玟月,杨鑫,杨茜,阮先玉.浅析张培基英译散文风格与AI英译散文风格[J].新东方英语（中英文版）,2019,0(12):101-102.
5李思特.基于自然语言处理的人工智能歌词创作[J].中国科技纵横,2019,0(14):41-42.
6侯强,侯瑞丽.机器翻译方法研究与发展综述[J].计算机工程与应用,2019,55(10):30-35. 被引量：26
7杨飚,周文婷.基于East的大角度倾斜车牌检测算法研究[J].现代计算机,2019,25(12):53-56.
8尹红,符祥,曾接贤,段宾,陈英.选择性卷积特征融合的花卉图像分类[J].中国图象图形学报,2019,24(5):762-772. 被引量：14
9朱天明,刘凯,刘豪志.基于改进BING模型和边缘信息的行人检测算法[J].机械与电子,2019,37(6):59-63. 被引量：2
10于海雯,易昕炜,徐少平,张贵珍,刘婷云.两阶段多层感知的随机脉冲噪声比例预测[J].中国图象图形学报,2019,24(7):1042-1054. 被引量：2

同被引文献57

1刘争平,何永富.人工神经网络在测井解释中的应用[J].地球物理学报,1995,38(A01):323-330. 被引量：25
2席道瑛,张涛.BP人工神经网络模型在测井资料岩性自动识别中的应用[J].物探化探计算技术,1995,17(1):42-48. 被引量：11
3魏宝君.利用积分方程计算阵列感应测井响应[J].石油大学学报（自然科学版）,2005,29(6):32-37. 被引量：11
4周路,王绪龙,雷德文,张年富,何登发,张越迁,张国清,吴勇.准噶尔盆地莫索湾凸起石炭系上部岩性预测[J].中国石油勘探,2006,11(1):69-79. 被引量：9
5于文芹,邓葆玲,周小鹰.岩性指示曲线重构及其在储层预测中的应用[J].石油物探,2006,45(5):482-486. 被引量：41
6陈遵德,朱广生.地震储层预测方法研究进展[J].地球物理学进展,1997,12(4):76-84. 被引量：46
7李军,熊利平,赵为永,刘建.基于确定性和随机模型的薄储层岩性预测[J].石油与天然气地质,2009,30(2):240-244. 被引量：12
8赵军龙,李纲,麻平社,巩泽文,蒙灵飞,李甘.神经网络在石油测井解释中的应用综述[J].地球物理学进展,2010(5):1744-1751. 被引量：13
9蒙启安,门广田,张正和.松辽盆地深层火山岩体、岩相预测方法及应用[J].大庆石油地质与开发,2001,20(3):21-24. 被引量：78
10单敬福,陈欣欣,赵忠军,葛雪,张芸.利用BP神经网络法对致密砂岩气藏储集层复杂岩性的识别[J].地球物理学进展,2015,30(3):1257-1263. 被引量：34

引证文献8

1潘崇煜,黄健,郝建国,龚建兴,张中杰.融合零样本学习和小样本学习的弱监督学习方法综述[J].系统工程与电子技术,2020,42(10):2246-2256. 被引量：14
2麻永田,齐晶,张秋实,罗大为,方建军.基于二阶统计量的小样本学习算法研究[J].北京联合大学学报,2021,35(4):73-78.
3魏胜楠,张景异,陈亮,耿俊香,王中洲.自适应局部关系网络的小样本学习方法[J].沈阳理工大学学报,2021,40(4):35-41. 被引量：4
4王光博,陈亮.改进关系网络的小样本图像分类方法[J].沈阳理工大学学报,2023,42(1):28-34. 被引量：1
5林李兴,夏振平,徐浩,宋玉,胡伏原.基于优化残差网络的复杂纹理表面缺陷检测[J].应用光学,2023,44(1):104-112.
6冯兴杰,王晨昊.基于特征分段度量方法的少样本学习[J].计算机应用与软件,2023,40(1):222-227.
7史航宇,文小林,丁慧霞,齐迪,韩淑华.基于深度学习神经网络的岩性剖面预测及应用[J].能源与环保,2023,45(11):187-192.
8邱云飞,牛佳璐.融合小样本元学习和原型对齐的点云分割算法[J].中国图象图形学报,2023,28(12):3884-3896. 被引量：1

二级引证文献20

1裴泽林,赵曙光,王建强.基于Meta-Network的变压器红外图像分类方法研究[J].科技风,2021(5):87-88. 被引量：1
2刘宇轩,孟凡满,李宏亮,杨嘉莹,吴庆波,许林峰.一种结合全局和局部相似性的小样本分割方法[J].北京航空航天大学学报,2021,47(3):665-674. 被引量：7
3贾霄,郭顺心,赵红.基于图像属性的零样本分类方法综述[J].南京大学学报（自然科学版）,2021,57(4):531-543. 被引量：2
4曾兰君,彭敏龙,刘雅琦,许辽萨,魏忠钰,黄萱菁.基于小样本学习的个性化Hashtag推荐[J].中文信息学报,2021,35(9):102-112.
5刘清清,周志勇,范国华,钱旭升,胡冀苏,陈光强,戴亚康.基于3D scSE-UNet的肝脏CT图像半监督学习分割方法[J].浙江大学学报（工学版）,2021,55(11):2033-2044. 被引量：5
6唐鹏,金炜东,张兴斌,张志军,邢铠鹏,霍志浩.基于知识模型的接触网缺陷智能视觉辨识[J].控制与信息技术,2021(6):84-90. 被引量：2
7石教祥,朱礼军,魏超,张玄玄.融合迁移学习与主动学习的金融科技实体识别方法[J].中国科技资源导刊,2022,54(2):35-45. 被引量：1
8韩俊,王保云,徐繁树,刘坤香.基于原型网络对泥石流沟谷的分类预测——以怒江流域为例[J].现代计算机,2022,28(2):88-90. 被引量：2
9杜娟,杨钧植.基于迁移学习的小样本连接器缺陷检测方法[J].自动化与信息工程,2022,43(5):1-7. 被引量：4
10李国强,王天雷,龚宁,王俊妍.基于空间注意力和类协方差度量的小样本学习[J].高技术通讯,2022,32(8):801-810. 被引量：1

1李绣心,凌志刚,邹文.基于卷积神经网络的半监督高光谱图像分类[J].电子测量与仪器学报,2018,32(10):95-102. 被引量：15
2郑欣悦,黄永辉.基于VAE和注意力机制的小样本图像分类方法[J].计算机应用与软件,2019,36(10):168-174. 被引量：4
3吕杰,罗芳颖,袁泽剑.目标搜索与识别的视觉注意网络与学习方法[J].机械工程学报,2019,55(11):123-130. 被引量：2
4朱海涛.AI芯片的应用与发展趋势[J].中国安全防范技术与应用,2019,0(5):44-49. 被引量：3
5杜俊文,王鹏程.浅析铜质水龙头阀芯与阀体配合结构研究与讨论[J].世界有色金属,2019,44(13):288-288.
6崔宵洋,林建辉,陈春俊,杨劼立,徐刚.FPN在高速列车接触网定位器检测应用[J].电子测量技术,2019,42(15):144-149. 被引量：5
7吴昊,袁国武,普园媛,徐丹.联合语义边缘与有向全变分的纹理-结构分解[J].计算机辅助设计与图形学学报,2019,31(10):1786-1794.
8江筱,邵珠宏,尚媛园,丁辉.基于级联深度神经网络的抑郁症识别[J].计算机应用与软件,2019,36(10):117-122. 被引量：5
9本刊2019年第8期专题:医学大数据研究[J].解放军医学院学报,2019,40(7):613-613.
10梁哲.人工智能嵌入社区矫正司法社会工作的理论探索[J].视界观,2019,0(13):0051-0053.

中国图象图形学报

2019年第9期

浏览历史

内容加载中请稍等...

代表特征网络的小样本学习方法被引量：8

参考文献3

二级参考文献6

共引文献176

同被引文献57

引证文献8

二级引证文献20

相关作者

相关机构

相关主题

浏览历史

代表特征网络的小样本学习方法 被引量：8

参考文献3

二级参考文献6

共引文献176

同被引文献57

引证文献8

二级引证文献20

相关作者

相关机构

相关主题

浏览历史

代表特征网络的小样本学习方法被引量：8