Instances Segmentation of Urban Streetscape Incorporating Attention and Multi-scale Feature

doi:10.15888/j.cnki.csa.009740

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-22- 0

Home > Archive>Volume 34, Issue 1, 2025 >90-99. DOI:10.15888/j.cnki.csa.009740

PDF HTML XML Export Cite reminder

Instances Segmentation of Urban Streetscape Incorporating Attention and Multi-scale Feature
DOI:
                        10.15888/j.cnki.csa.009740
                    
CSTR:
                        
                    
Author:
                        WANG JunWANG Jun
School of Software, Nanjing University of Information Science & Technology, Nanjing 210044, China;Science and Technology Industries Division, Nanjing University of Information Science & Technology, Nanjing 210044, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
LYU JiaLYU Jia
School of Software, Nanjing University of Information Science & Technology, Nanjing 210044, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
CHENG YongCHENG Yong
School of Software, Nanjing University of Information Science & Technology, Nanjing 210044, China;Science and Technology Industries Division, Nanjing University of Information Science & Technology, Nanjing 210044, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Algorithms for the instance segmentation of urban street scenes can significantly improve the accuracy and efficiency of urban environment perception and intelligent transportation system. To address mutual occlusions between pedestrians and vehicles and significant background interference in urban street scenes, this study proposes an instance segmentation model, FMInst, based on a frequency attention mechanism and multi-scale feature fusion. Firstly, a high and low-frequency attention mechanism is constructed for interactive coding to increase high-resolution detail information. Secondly, a soft pooling operation is introduced into the Patch Merging layer of the Swin Transformer backbone network to reduce the loss of feature information and effectively improve the segmentation of small-scale targets. Finally, an MLP layer is combined to construct multi-scale deep convolution, which effectively enhances the extraction of local information and improves the segmentation accuracy. Comparison experiments conducted on the public dataset Cityscapes show that FMInst reaches an mAP of 35.6%, with an improvement of 1.2%, and an AP50 of 61.4%, with an improvement of 2.2%. The mask quality and the segmentation effect of the instance segmentation are greatly improved.

Key words:urban streetscape;instance segmentation;frequency attention mechanism;multi-scale feature fusion;small target

Get Citation

王军,吕佳,程勇.融合注意力与多尺度特征的城市街景实例分割.计算机系统应用,2025,34(1):90-99

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:June 24,2024
Revised:July 18,2024
Adopted:
Online: November 28,2024
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063