Image Semantic Segmentation Model Based on Dense Dilation Convolution

doi:10.15888/j.cnki.csa.008376

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-15- 1

Home > Archive>Volume 31, Issue 3, 2022 >19-29. DOI:10.15888/j.cnki.csa.008376

PDF HTML XML Export Cite reminder

Image Semantic Segmentation Model Based on Dense Dilation Convolution
DOI:
                        10.15888/j.cnki.csa.008376
                    
CSTR:
                        [cstr]
                    
Author:
                        ZHANG Fu-CaiZHANG Fu-Cai
School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
XU Jian-LongXU Jian-Long
School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
BAO Xiao-AnBAO Xiao-An
School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Semantic segmentation is a very challenging task because of the complexity of parsing the scene, the diversity of segmented objects, and the differences in spatial positions of objects. To tackle this dilemma, this paper proposes a novel architecture named double branch and multi-stage network (DBMSNet) based on dense dilated convolution. Firstly, four feature maps (De1, De2, De3, and De4) with different resolutions are extracted by the backbone network, and then the feature refinement maps of De1 and De3 are output through the feature refinement (FR) module. Secondly, the output branch is processed by the mixed dilation module (MDM) to extract rich spatial location features, while the De4 branch is processed by the pyramid pooling module (PPM) to extract multi-scale semantic information. Finally, the two branches are merged and the segmentation result is output. Comprehensive experiments are conducted on two public datasets of CelebAMask-HQ and Cityscapes, on which our model achieves mean intersection-over-union (mIoU) scores of 74.64% and 78.29%, respectively. The results show that the segmentation accuracy of this study is higher than that of the counterpart method, and this method has fewer parameters.

Key words:deep learning;image semantic segmentation;dilation convolution;dense connection;multi-stages feature

Get Citation

张富财,许建龙,包晓安.基于稠密扩张卷积的图像语义分割模型.计算机系统应用,2022,31(3):19-29

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:May 23,2021
Revised:June 21,2021
Adopted:
Online: January 24,2022
Published:

Article QR Code

You are the first991236Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063