融合注意力与多尺度特征的城市街景实例分割

doi:10.15888/j.cnki.csa.009740

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月13日 3:42 星期日

首页 > 过刊浏览>2025年第34卷第1期 >90-99. DOI:10.15888/j.cnki.csa.009740

PDF HTML阅读 XML下载导出引用引用提醒

融合注意力与多尺度特征的城市街景实例分割
DOI:
                        10.15888/j.cnki.csa.009740
                    
CSTR:
                        32024.14.csa.009740
                    
作者:
                        王军王军
南京信息工程大学 软件学院, 南京 210044;南京信息工程大学 科技产业处, 南京 210044
在期刊界中查找
在百度中查找
在本站中查找
吕佳吕佳
南京信息工程大学 软件学院, 南京 210044
在期刊界中查找
在百度中查找
在本站中查找
程勇程勇
南京信息工程大学 软件学院, 南京 210044;南京信息工程大学 科技产业处, 南京 210044
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金(41975183)

Instances Segmentation of Urban Streetscape Incorporating Attention and Multi-scale Feature

Author:

WANG Jun
WANG Jun
School of Software, Nanjing University of Information Science & Technology, Nanjing 210044, China;Science and Technology Industries Division, Nanjing University of Information Science & Technology, Nanjing 210044, China
在期刊界中查找
在百度中查找
在本站中查找
LYU Jia
LYU Jia
School of Software, Nanjing University of Information Science & Technology, Nanjing 210044, China
在期刊界中查找
在百度中查找
在本站中查找
CHENG Yong
CHENG Yong
School of Software, Nanjing University of Information Science & Technology, Nanjing 210044, China;Science and Technology Industries Division, Nanjing University of Information Science & Technology, Nanjing 210044, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

城市街道场景实例分割算法可以显著提升城市环境感知和智能交通系统的准确性与效率, 针对城市街景行人和车辆之间相互遮挡和背景干扰严重等问题, 提出一种基于频率注意力机制和多尺度特征融合的实例分割模型FMInst. 首先, 构建一种高低频注意力机制进行交互编码从而增加高分辨率细节信息. 其次, 在Swin Transformer主干网络的Patch Merging层引入软池化操作, 减少特征信息损失, 有效提高小尺度目标分割结果. 最后, 结合MLP层构建多尺度的深度卷积, 有效增强目标局部信息提取, 提升实例分割精度. 在Cityscapes公共数据集进行对比实验, 结果表明FMInst的mAP提高1.2%, 达35.6%, 同时AP50提高2.2%, 达61.4%, 极大地改善实例分割的掩码质量和分割效果.

关键词:城市街景;实例分割;频率注意力机制;多尺度特征融合;小目标

Abstract:

Algorithms for the instance segmentation of urban street scenes can significantly improve the accuracy and efficiency of urban environment perception and intelligent transportation system. To address mutual occlusions between pedestrians and vehicles and significant background interference in urban street scenes, this study proposes an instance segmentation model, FMInst, based on a frequency attention mechanism and multi-scale feature fusion. Firstly, a high and low-frequency attention mechanism is constructed for interactive coding to increase high-resolution detail information. Secondly, a soft pooling operation is introduced into the Patch Merging layer of the Swin Transformer backbone network to reduce the loss of feature information and effectively improve the segmentation of small-scale targets. Finally, an MLP layer is combined to construct multi-scale deep convolution, which effectively enhances the extraction of local information and improves the segmentation accuracy. Comparison experiments conducted on the public dataset Cityscapes show that FMInst reaches an mAP of 35.6%, with an improvement of 1.2%, and an AP50 of 61.4%, with an improvement of 2.2%. The mask quality and the segmentation effect of the instance segmentation are greatly improved.

Key words:urban streetscape;instance segmentation;frequency attention mechanism;multi-scale feature fusion;small target

引用本文

王军,吕佳,程勇.融合注意力与多尺度特征的城市街景实例分割.计算机系统应用,2025,34(1):90-99

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-06-24
最后修改日期:2024-07-18
录用日期:
在线发布日期: 2024-11-28
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码