基于多尺度四维特征融合的小样本语义分割

Few-shot semantic segmentation based on multiscale 4D feature fusion

下载PDF

导出

摘要现有的语义分割方法依赖充足的像素级图像标签,而分割模型需要在新类样本条件下进行训练时,带来人工标注图像的问题;因此,提出了小样本语义分割方法来解决此类问题.当前小样本分割方法主要采用原型学习方法,而原型学习的方法缺乏像素级支持特征来指导查询图像分割,导致分割精度不高的问题.基于此,设计了一种四维特征融合与注意力增强的小样本分割网络.为了获取到像素级支持特征对查询图像的表征信息,在特征金字塔结构中使用四维卷积,将高级语义特征和中级语义特征逐步压缩成超相关特征进而应用于查询图像的分割中.同时,在两个标准小样本分割基准上进行了实验:在PASCAL-5i数据集1-shot设置下的测试结果mIoU分别比HSNet和PFENet提高了0.6%和2.3%. The existing semantic segmentation method relies on sufficient pixel-level image labeling,while the segmentation model needs to be trained under the new sample conditions,which brings the problem of manually labeling images.Therefore,few-shot semantic segmentation method is proposed to solve such problems.The current few-shot segmentation method mainly adopts the prototype learning method,while the prototype learning method lacks pixel-level support-level features to guide query image segmentation,resulting in the problem of low segmentation accuracy.Based on this,a four-dimensional feature fusion and attention-enhancing few-shot segmentation network is designed.In order to obtain the pixel-level representation information of rich support set features for query images,four-dimensional convolution is used in the feature pyramid structure to gradually compress advanced semantic features and intermediate semantic features into super-correlated features,which is then used to segment the query image.At the same time,the test results of mIoU in the PASCAL-5i dataset 1-shot setting were improved by 0.6%and 2.3%compared to the HSNet and PFENet,respectively.

作者丁月陈少波尹作轩 DING Yue;CHEN Shaobo;YIN Zuoxuan(College of Electronics and Information Engineering,South-Central Minzu University,Wuhan 430074,China)

机构地区中南民族大学电子信息工程学院

出处《中南民族大学学报（自然科学版）》 CAS 2024年第6期772-780,共9页 Journal of South-Central Minzu University(Natural Science Edition)

基金中央高校基本科研业务费专项资金资助项目(CZY22012)。

关键词小样本语义分割多尺度特征超相关特征交叉注意力 few-shot semantic segmentation multi-scale features hypercorrelation features cross-attention

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1文婧,陈少波.集间两级语义互补的小样本语义分割[J].中南民族大学学报（自然科学版）,2024,43(3):415-423. 被引量：2

共引文献1

1李智鹏,李江,李东声,刘界鹏,陈奉民.基于RGBD图像的预制构件钢筋间距高效检测方法[J].建筑结构,2024,54(24):91-98.

1赵依林,郭逸,刘雨烟,张晴.融合边缘信息和双分支注意力的息肉分割算法[J].应用技术学报,2024,24(3):367-375.
2李金堂.雷珠单抗与联合曲安奈德对视网膜静脉阻塞继发黄斑水肿的疗效对比研究[J].中文科技期刊数据库（文摘版）医药卫生,2024(10):0104-0107.
3吕秀丽,杨昕升,曹志民.改进YOLOv8的PCB表面缺陷检测算法[J].电子测量技术,2024,47(12):100-108.
4何磊,栗风永,秦川.跨通道交互注意力机制驱动的双流网络跨模态行人重识别[J].应用科学学报,2024,42(5):884-892.
5刘仲民,张长凯,胡文瑾.基于多核扩展卷积的无监督视频行人重识别[J].数据采集与处理,2024,39(5):1192-1203.
6刘媛媛,王晓燕,张雨欣,朱路.基于语义一致与多级相似性的跨模态哈希检索[J].数据分析与知识发现,2024,8(7):89-102.
7黄建伟,唐鹏飞,潘栋,石拓,莫淑蓓,李颢旭.基于多试样法的PVC塑料J-R阻力曲线的简便测量方法[J].压力容器,2024,41(7):25-31.
8马双宝,秦乐达,付正.基于改进YOLOv8n的高精度输液监测方法[J].电子测量技术,2024,47(12):155-163.
9曾芸芸,张红英,袁明东.多尺度融合的双分支特征提取人群计数算法[J].计算机工程与应用,2024,60(20):224-232.
10路晓亚,李海芳.低可见度环境下基于改进YOLOv3的井下人员定位方法[J].工矿自动化,2024,50(9):130-137.

中南民族大学学报（自然科学版）

2024年第6期

浏览历史

内容加载中请稍等...

基于多尺度四维特征融合的小样本语义分割

参考文献1

共引文献1

相关作者

相关机构

相关主题

浏览历史