基于语义对齐的小样本语义分割模型

doi:10.15888/j.cnki.csa.008830

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月6日 5:33 星期日

首页 > 过刊浏览>2022年第31卷第12期 >203-210. DOI:10.15888/j.cnki.csa.008830

PDF HTML阅读 XML下载导出引用引用提醒

基于语义对齐的小样本语义分割模型
DOI:
                        10.15888/j.cnki.csa.008830
                    
CSTR:
                        
                    
作者:
                        张珉张珉
合肥工业大学 计算机与信息学院, 合肥 230009
在期刊界中查找
在百度中查找
在本站中查找
杨娟杨娟
合肥工业大学 计算机与信息学院, 合肥 230009
在期刊界中查找
在百度中查找
在本站中查找
汪荣贵汪荣贵
合肥工业大学 计算机与信息学院, 合肥 230009
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金联合基金 (U20B2044)

Few-shot Semantic Segmentation Model Based on Semantic Alignment

Author:

ZHANG Min
ZHANG Min
School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230009, China
在期刊界中查找
在百度中查找
在本站中查找
YANG Juan
YANG Juan
School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230009, China
在期刊界中查找
在百度中查找
在本站中查找
WANG Rong-Gui
WANG Rong-Gui
School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230009, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

现实世界的物体图像往往存在较大的类内变化, 使用单一原型描述整个类别会导致语义模糊问题, 为此提出一种基于超像素的多原型生成模块, 利用多个原型分别表示物体的不同语义区域, 通过图神经网络在生成的多个原型间利用上下文信息执行原型校正以保证子原型的正交性. 为了获取到更准确的原型表示, 设计了一种基于Transformer的语义对齐模块, 以挖掘查询图像特征和支持图像的背景特征中蕴含的语义信息, 此外还提出了一种多尺度特征融合结构, 引导模型关注同时出现在支持图像和查询图像中的特征, 提高对物体尺度变化的鲁棒性. 所提出的模型在PASCAL-5ⁱ数据集上进行了实验, 与基线模型相比平均交并比提高了6%.

关键词:小样本语义分割;度量学习;原型学习;Transformer;注意力机制;语义对齐

Abstract:

Object images in the real world often have large intra-class variations, and thus using a single prototype to describe an entire category will lead to semantic ambiguity. Considering this, a multi-prototype generation module based on superpixels is proposed, which uses multiple prototypes to represent different semantic regions of objects and employs the context to correct prototypes among the generated prototypes by a graph neural network to ensure the orthogonality of the sub-prototypes. To obtain a more accurate prototype representation, a Transformer-based semantic alignment module is designed to mine the semantic information contained in the features of the query images and the background features of the supporting images. In addition, a multi-scale feature fusion structure is proposed to instruct the model to focus on features that appear in both the supporting images and the query images, which can improve the robustness to changes in object scales. The proposed model is tested on the PASCAL-5ⁱ dataset, and the mean intersection over union (mIoU) is improved by 6% compared with that of the baseline model.

Key words:few-shot semantic segmentation;metric learning;prototype learning;Transformer;attention mechanism;semantic alignment

引用本文

张珉,杨娟,汪荣贵.基于语义对齐的小样本语义分割模型.计算机系统应用,2022,31(12):203-210

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2022-03-20
最后修改日期:2022-04-14
录用日期:
在线发布日期: 2022-08-19
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码