Bi-branch Remote Sensing Cloud and Shadow Detection Network Based on ViT-D-UNet

doi:10.15888/j.cnki.csa.009596

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-21- 20

Home > Archive>Volume 33, Issue 8, 2024 >68-77. DOI:10.15888/j.cnki.csa.009596

PDF HTML XML Export Cite reminder

Bi-branch Remote Sensing Cloud and Shadow Detection Network Based on ViT-D-UNet
DOI:
                        10.15888/j.cnki.csa.009596
                    
CSTR:
                        [cstr]
                    
Author:
                        LI Yuan-LuLI Yuan-Lu
School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, China;Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology, Nanjing 210044, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
WANG Jian-XiangWANG Jian-Xiang
School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
FAN Xiao-TingFAN Xiao-Ting
School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHOU XinZHOU Xin
School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
WU Ming-XuanWU Ming-Xuan
School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Effective segmentation of clouds and their shadows is a critical issue in the field of remote sensing image processing. It plays a significant role in surface feature extraction, climate detection, atmospheric correction, and more. However, the complex features of clouds and cloud shadows in remote sensing images, characterized by their diverse, irregular distributions and fuzzy boundary information that is easily disturbed by the background, make accurate feature extraction challenging. Moreover, there are few networks specifically designed for this task. To address these issues, this study proposes a dual-path network combining vision Transformer (ViT) and D-UNet. The network is divided into two branches: one is a convolutional local feature extraction module based on the dilated convolution module of D-UNet, which introduces a multi-scale atrous spatial pyramid pooling (ASPP) to extract multi-dimensional features; the other branch comprehends the context semantics globally through the vision Transformer, enhancing feature extraction. Finally, the study performs an upsampling through a feature fusion decoder. The model achieves superior performance on both a self-built dataset of clouds and cloud shadows and the publicly available HRC_WHU dataset, leading the second-best model by 0.52% and 0.44% in the MIoU metric, achieving 92.05% and 85.37%, respectively.

Key words:remote sensing;cloud detection;semantic segmentation;feature fusion

Get Citation

李远禄,王键翔,范小婷,周昕,吴明轩.基于ViT-D-UNet的双分支遥感云影检测网络.计算机系统应用,2024,33(8):68-77

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:February 28,2024
Revised:March 28,2024
Adopted:
Online: June 28,2024
Published:

Article QR Code

You are the first991734Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063