基于改进Deeplab V3+网络的语义分割

doi:10.15888/j.cnki.csa.007541

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年7月27日 22:05 星期日

首页 > 过刊浏览>2020年第29卷第9期 >178-183. DOI:10.15888/j.cnki.csa.007541

PDF HTML阅读 XML下载导出引用引用提醒

基于改进Deeplab V3+网络的语义分割
DOI:
                        10.15888/j.cnki.csa.007541
                    
CSTR:
                        
                    
作者:
                        席一帆席一帆
长安大学 信息工程学院, 西安 710064
在期刊界中查找
在百度中查找
在本站中查找
孙乐乐孙乐乐
长安大学 信息工程学院, 西安 710064
在期刊界中查找
在百度中查找
在本站中查找
何立明何立明
长安大学 信息工程学院, 西安 710064
在期刊界中查找
在百度中查找
在本站中查找
吕悦吕悦
长安大学 信息工程学院, 西安 710064
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:

Semantic Segmentation Based on Improved Deeplab V3+ Network

Author:

XI Yi-Fan
XI Yi-Fan
School of Information Engineering, Chang’an University, Xi’an 710064, China
在期刊界中查找
在百度中查找
在本站中查找
SUN Le-Le
SUN Le-Le
School of Information Engineering, Chang’an University, Xi’an 710064, China
在期刊界中查找
在百度中查找
在本站中查找
HE Li-Ming
HE Li-Ming
School of Information Engineering, Chang’an University, Xi’an 710064, China
在期刊界中查找
在百度中查找
在本站中查找
LYU Yue
LYU Yue
School of Information Engineering, Chang’an University, Xi’an 710064, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献 [17]

相似文献 [20]

引证文献

资源附件

文章评论

摘要:

深度学习的语义分割在计算机视觉领域中有非常广阔的发展前景，但许多分割效果较好网络模型占用内存大和处理单张图片耗时长.针对这个问题，把Deeplab V3+模型的骨干网（ResNet101）的瓶颈单元设计为1D非瓶颈单元，且对空洞空间金字塔池化模块（Atrous Spatial Pyramid Pooling，ASPP）的卷积层进行分解.该算法能大幅度降低Deeplab V3+网络的参数量，提高网络推理速度.基于PASCAL VOC 2012数据集进行对比实验，实验结果显示改进网络模型拥有更快的处理速度和更优的分割效果，且消耗更少的内存.

关键词:语义分割;Deeplab V3+模型;骨干网(ResNet101);1D非瓶颈单元;空洞空间金字塔池化(ASPP)

Abstract:

Semantic segmentation of deep learning has a very broad development prospect in the field of computer vision, but many network models with better segmentation effects take up a lot of memory and take a long time to process a single picture. In response to this problem, we replace the bottleneck unit of the Deeplab V3+ model backbone network (ResNet101) with a 1D non-bottleneck unit, and decompose the convolutional layer of the Atrous Spatial Pyramid Pooling (ASPP) module. The algorithm can greatly reduce the parameter amount of Deeplab V3+ network and accelerate the speed of network inference. Based on the PASCAL VOC 2012 dataset, the experimental results show that the improved network model has faster speed and better segmentation, and takes up less memory space.

Key words:semantic segmentation;Deeplab V3+ model;backbone network (ResNet101);1D non-bottleneck unit;Atrous Spatial Pyramid Pooling (ASPP)

参考文献

[1] 计梦予, 袭肖明, 于治楼. 基于深度学习的语义分割方法综述. 信息技术与信息化, 2017, (10): 137-140. [doi: 10.3969/j.issn.1672-9528.2017.10.037

[2] 肖朝霞, 陈胜. 图像语义分割问题研究综述. 软件导刊, 2018, 17(8): 6-8, 12

[3] Arbeláez P, Hariharan B, Gu CH, et al. Semantic segmentation using regions and parts. Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition. Providence, RI, USA. 2012. 3378-3385.

[4] Lu ZW, Fu ZY, Xiang T, et al. Learning from weak and noisy labels for semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(3): 486-500. [doi: 10.1109/TPAMI.2016.2552172

[5] Shelhamer E, Long J, Darrell T. Fully convolutional networks for semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4): 640-651. [doi: 10.1109/TPAMI.2016.2572683

[6] Ronneberger O, Fischer P, Brox T. U-Net: Convolutional networks for biomedical image segmentation. Proceedings of the 18th International Conference on Medical Image Computing and Computer-assisted Intervention. Munich, Germany. 2015. 234-241.

[7] Badrinarayanan V, Kendall A, Cipolla R. SegNet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481-2495. [doi: 10.1109/TPAMI.2016.2644615

[8] de Oliveira Junior LA, Medeiros HR, Macêdo D, et al. SegNetRes-CRF: A deep convolutional encoder-decoder architecture for semantic image segmentation. Proceedings of 2018 International Joint Conference on Neural Networks. Rio de Janeiro, Brazil. 2018. 1-6.

[9] Zhao HS, Shi JP, Qi XJ, et al. Pyramid scene parsing network. Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, HI, USA. 2017. 6230-6239.

[10] Lin GS, Milan A, Shen CH, et al. RefineNet: Multi-path refinement networks for high-resolution semantic segmentation. Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, HI, USA. 2017. 5168-5177.

[11] Chen LC, Papandreou G, Kokkinos I, et al. Semantic image segmentation with deep convolutional nets and fully connected CRFs. Proceedings of the 3rd International Conference on Learning Representations. San Diego, CA, USA. 2014. 357-361.

[12] Chen LC, Papandreou G, Kokkinos I, et al. DeepLab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4): 834-848. [doi: 10.1109/TPAMI.2017.2699184

[13] Chen LC, Papandreou G, Schroff F, et al. Rethinking atrous convolution for semantic image segmentation. arXiv: 1706.05587, 2017.

[14] Chen LC, Zhu YK, Papandreou G, et al. Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Ferrari V, Hebert M, Sminchisescu C, et al, eds. Computer Vision (ECCV 2018). Cham: Springer, 2018. 833-851.

[15] Alvarez J, Petersson L. DecomposeMe: Simplifying ConvNets for end-to-end learning. arXiv: 1606.05426, 2016.

[16] Na T, Mukhopadhyay S. Speeding up convolutional neural network training with dynamic precision scaling and flexible multiplier-accumulator. Proceedings of 2016 International Symposium on Low Power Electronics and Design. San Francisco, CA, USA. 2016. 58-63.

[17] Sironi A, Tekin B, Rigamonti R, et al. Learning separable filters. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(1): 94-106. [doi: 10.1109/TPAMI.2014.2343229

引用本文

席一帆,孙乐乐,何立明,吕悦.基于改进Deeplab V3+网络的语义分割.计算机系统应用,2020,29(9):178-183

复制

文章指标

点击次数:1689
下载次数: 5087
HTML阅读次数: 3356
引用次数: 0

历史

收稿日期:2019-12-12
最后修改日期:2020-02-08
录用日期:
在线发布日期: 2020-09-07
出版日期: 2020-09-15

微信公众号

网站二维码

引用本文

相关视频

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

相关视频

分享

微信扫一扫：分享

文章指标

历史

文章二维码