###

计算机系统应用英文版:2024,33(1):76-86

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

交叉特征融合和RASPP驱动的场景分割方法

朱新杰¹, 熊风光^1,2,3, 谢帅康¹, 宋宁栋¹, 李文清¹

(1.中北大学计算机科学与技术学院, 太原 030051;2.中北大学山西省视觉信息处理及智能机器人工程研究中心, 太原 030051;3.中北大学机器视觉与虚拟现实山西省重点实验室, 太原 030051)

Cross Feature Fusion and RASPP Driven Scene Segmentation Method

ZHU Xin-Jie¹, XIONG Feng-Guang^1,2,3, XIE Shuai-Kang¹, SONG Ning-Dong¹, LI Wen-Qing¹

(1.School of Data Science and Technology, North University of China, Taiyuan 030051, China;2.Shanxi Province’s Vision Information Processing and Intelligent Robot Engineering Research Center, North University of China, Taiyuan 030051, China;3.Shanxi Key Laboratory of Machine Vision and Virtual Reality, North University of China, Taiyuan 030051, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 219次下载 561次
Received:June 28, 2023 Revised:August 08, 2023

中文摘要: 本文针对场景中目标多样性和尺度不统一等现象造成的边缘分割错误、特征不连续问题, 提出了一种交叉特征融合和RASPP驱动的场景分割方法. 该方法以交叉特征融合的方式合并编码器输出的多尺度特征, 在融合高层语义信息时使用复合卷积注意力模块进行处理, 避免上采样操作造成的特征信息丢失以及引入噪声的影响, 细化目标边缘分割效果. 同时提出了深度可分离残差卷积, 在此基础上设计并实现了结合残差的金字塔池化模块——RASPP, 对交叉融合后的特征进行处理, 获得不同尺度的上下文信息, 增强特征语义表达. 最后, 将RASPP模块处理后的特征进行合并, 提升分割效果. 在Cityscapes和CamVid数据集上的实验结果表明, 本文提出方法相比现有方法具有更好的表现, 并且对场景中的目标边缘有更好的分割效果.

中文关键词: 语义分割交叉特征融合金字塔池化注意力机制深度可分离卷积

Abstract:This study proposes a cross feature fusion and RASPP-driven scene segmentation method to address the edge segmentation errors and feature discontinuity caused by target diversity and scale inconsistency in the scenes. This method combines the multi-scale features output by the encoder in the way of cross feature fusion and employs the compound convolution attention module to process high-level semantic information fusion. As a result, this avoids the feature information loss caused by the upsampling operation and the influence of noise and refines the segmentation effect of target edges. Meanwhile, this study proposes a depthwise separable convolution combining residual connections. Based on this, a pyramid pooling module RASPP combining residuals is designed and implemented to process the features after cross fusion, obtain contextual information at different scales, and enhance feature semantic expression. Finally, the features processed by the RASPP module are merged to improve the segmentation effect. The experimental results on the Cityscapes and CamVid datasets show that the proposed method outperforms existing methods and has better segmentation performance on target edges in the scenes.

keywords: semantic segmentation cross feature fusion pyramid pooling attention mechanism depthwise separable convolution

文章编号： 中图分类号： 文献标志码：

基金项目:国家自然科学基金(62272426); 山西省回国留学人员科研基金(2020-113); 山西省科技成果转化引导专项基金(202104021301055); 山西省科技重大专项计划“揭榜挂帅”基金(202201150401021); 山西省自然科学基金(202203021212138, 202303021211153, 202203021222027)

Author Name	Affiliation	E-mail
ZHU Xin-Jie	School of Data Science and Technology, North University of China, Taiyuan 030051, China
XIONG Feng-Guang	School of Data Science and Technology, North University of China, Taiyuan 030051, China Shanxi Province’s Vision Information Processing and Intelligent Robot Engineering Research Center, North University of China, Taiyuan 030051, China Shanxi Key Laboratory of Machine Vision and Virtual Reality, North University of China, Taiyuan 030051, China	hopenxfg@nuc.edu.cn
XIE Shuai-Kang	School of Data Science and Technology, North University of China, Taiyuan 030051, China
SONG Ning-Dong	School of Data Science and Technology, North University of China, Taiyuan 030051, China
LI Wen-Qing	School of Data Science and Technology, North University of China, Taiyuan 030051, China

Author Name	Affiliation	E-mail
ZHU Xin-Jie	School of Data Science and Technology, North University of China, Taiyuan 030051, China
XIONG Feng-Guang	School of Data Science and Technology, North University of China, Taiyuan 030051, China Shanxi Province’s Vision Information Processing and Intelligent Robot Engineering Research Center, North University of China, Taiyuan 030051, China Shanxi Key Laboratory of Machine Vision and Virtual Reality, North University of China, Taiyuan 030051, China	hopenxfg@nuc.edu.cn
XIE Shuai-Kang	School of Data Science and Technology, North University of China, Taiyuan 030051, China
SONG Ning-Dong	School of Data Science and Technology, North University of China, Taiyuan 030051, China
LI Wen-Qing	School of Data Science and Technology, North University of China, Taiyuan 030051, China

引用文本：
朱新杰,熊风光,谢帅康,宋宁栋,李文清.交叉特征融合和RASPP驱动的场景分割方法.计算机系统应用,2024,33(1):76-86
ZHU Xin-Jie,XIONG Feng-Guang,XIE Shuai-Kang,SONG Ning-Dong,LI Wen-Qing.Cross Feature Fusion and RASPP Driven Scene Segmentation Method.COMPUTER SYSTEMS APPLICATIONS,2024,33(1):76-86