基于对比学习及背景挖掘的少样本语义分割

doi:10.15888/j.cnki.csa.009617

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月4日 3:55 星期五

首页 > 过刊浏览>2024年第33卷第9期 >261-268. DOI:10.15888/j.cnki.csa.009617

PDF HTML阅读 XML下载导出引用引用提醒

基于对比学习及背景挖掘的少样本语义分割
DOI:
                        10.15888/j.cnki.csa.009617
                    
CSTR:
                        
                    
作者:
                        王善杰王善杰
南京信息工程大学 软件学院, 南京 210044
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:

Few-shot Semantic Segmentation Based on Contrastive Learning and Background Mining

Author:

WANG Shan-Jie
WANG Shan-Jie
School of Software, Nanjing University of Information Science & Technology, Nanjing 210044, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

少样本语义分割是在具有少量标注样本的查询图像的条件下, 对潜在对象类别进行分割的计算机视觉任务. 然而, 现有方法仍然存在两个问题, 这对它们构成了挑战. 首先是原型偏差问题, 这导致原型具有较少的前景目标信息, 难以模拟真实的类别统计信息. 另一个是特征破坏问题, 这意味着模型只关注当前类别而不关注潜在类别. 本文提出了一个基于对比原型以及背景挖掘的新网络. 该网络主要思想是使模型学习更具代表性的原型, 并从背景中识别潜在类别. 具体而言, 特定类学习分支构建了一个大且一致的原型字典, 然后使用InfoNCE损失使原型更具区分性. 另一方面, 背景挖掘分支初始化背景原型, 并使用构建的背景原型与字典之间的注意力机制来挖掘潜在类别. 在PASCAL-5ⁱ和COCO-20ⁱ数据集上的实验证明模型有优秀的性能. 在使用ResNet-50网络的1-shot设置下, 达到了64.9%和44.2%, 相较于基准模型分别提升了4.0%和1.9%.

关键词:图像分割;少样本语义分割;对比学习;背景挖掘

Abstract:

Few-shot semantic segmentation is a computer vision task that involves segmenting potential object categories in query images with a small number of annotated samples. However, existing methods still face two challenges. Firstly, there is a prototype bias problem, resulting in prototypes having less foreground object information and making it difficult to simulate real category statistics. The other issue is feature degradation, which means that the model only focuses on the current category rather than potential categories. This study proposes a new network based on contrastive prototypes and background mining. The main idea of the network is to enable the model to learn more representative prototypes and identify potential categories from the background. Specifically, a specific class learning branch constructs a large and consistent prototype dictionary and then uses InfoNCE loss to make the prototypes more discriminative. On the other hand, the background mining branch initializes background prototypes and uses an attention mechanism between the constructed background prototypes and the dictionary to mine potential categories. Experimental results on the PASCAL-5ⁱ and COCO-20ⁱ datasets demonstrate excellent performance of the model. Under the 1-shot setting using the ResNet-50 network, 64.9% and 44.2% are achieved, an improvement of 4.0% and 1.9%, respectively, compared to the baseline model.

Key words:image segmentation;few-shot semantic segmentation;contrastive learning;background mining

引用本文

王善杰.基于对比学习及背景挖掘的少样本语义分割.计算机系统应用,2024,33(9):261-268

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-03-22
最后修改日期:2024-04-16
录用日期:
在线发布日期: 2024-07-26
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码