Few-shot Semantic Segmentation Based on Contrastive Learning and Background Mining

doi:10.15888/j.cnki.csa.009617

AIPUB归智期刊联盟

WeChat

Mobile website

2025-8-1- 17

Home > Archive>Volume 33, Issue 9, 2024 >261-268. DOI:10.15888/j.cnki.csa.009617

PDF HTML XML Export Cite reminder

Few-shot Semantic Segmentation Based on Contrastive Learning and Background Mining
DOI:
                        10.15888/j.cnki.csa.009617
                    
CSTR:
                        
                    
Author:
                        WANG Shan-JieWANG Shan-Jie
School of Software, Nanjing University of Information Science & Technology, Nanjing 210044, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Few-shot semantic segmentation is a computer vision task that involves segmenting potential object categories in query images with a small number of annotated samples. However, existing methods still face two challenges. Firstly, there is a prototype bias problem, resulting in prototypes having less foreground object information and making it difficult to simulate real category statistics. The other issue is feature degradation, which means that the model only focuses on the current category rather than potential categories. This study proposes a new network based on contrastive prototypes and background mining. The main idea of the network is to enable the model to learn more representative prototypes and identify potential categories from the background. Specifically, a specific class learning branch constructs a large and consistent prototype dictionary and then uses InfoNCE loss to make the prototypes more discriminative. On the other hand, the background mining branch initializes background prototypes and uses an attention mechanism between the constructed background prototypes and the dictionary to mine potential categories. Experimental results on the PASCAL-5ⁱ and COCO-20ⁱ datasets demonstrate excellent performance of the model. Under the 1-shot setting using the ResNet-50 network, 64.9% and 44.2% are achieved, an improvement of 4.0% and 1.9%, respectively, compared to the baseline model.

Key words:image segmentation;few-shot semantic segmentation;contrastive learning;background mining

Get Citation

王善杰.基于对比学习及背景挖掘的少样本语义分割.计算机系统应用,2024,33(9):261-268

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:March 22,2024
Revised:April 16,2024
Adopted:
Online: July 26,2024
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063