###

计算机系统应用英文版:2024,33(8):68-77

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

基于ViT-D-UNet的双分支遥感云影检测网络

李远禄^1,2, 王键翔¹, 范小婷¹, 周昕¹, 吴明轩¹

(1.南京信息工程大学自动化学院, 南京 210044;2.江苏省大气环境与装备技术协同创新中心, 南京 210044)

Bi-branch Remote Sensing Cloud and Shadow Detection Network Based on ViT-D-UNet

LI Yuan-Lu^1,2, WANG Jian-Xiang¹, FAN Xiao-Ting¹, ZHOU Xin¹, WU Ming-Xuan¹

(1.School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, China;2.Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology, Nanjing 210044, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 196次下载 598次
Received:February 28, 2024 Revised:March 28, 2024

中文摘要: 云及其阴影的有效分割是遥感图像处理领域中重要的问题, 它对于地表特征提取、气候检测、大气校正等有很大帮助. 然而云和云影遥感图像特征复杂, 云分布多样不规则, 且边界信息模糊易受背景干扰等特点, 导致其特征难以准确提取, 也少有专门为其设计的网络. 针对以上问题, 本文提出一种ViT (vision Transformer)和D-UNet双路网络. 本文网络分为两个分支: 一路是基于卷积的局部特征提取模块, 在D-UNet的膨胀卷积模块基础上, 引入深度可分离卷积, 提取多尺度特征的同时, 减少参数; 另一路通过ViT在全局上理解上下文语义, 加深对整体特征提取. 两支路间存在信息交互, 完善提取的特征信息. 最后通过独特设计的融合特征解码器, 进行上采样, 减少信息丢失. 模型在自建的云和云影数据集以及HRC_WHU公开数据集上取得优越的性能, 在MIoU指标上分别领先次优模型0.52%和0.44%, 达到了92.05%和85.37%.

中文关键词: 遥感云检测语义分割特征融合

Abstract:Effective segmentation of clouds and their shadows is a critical issue in the field of remote sensing image processing. It plays a significant role in surface feature extraction, climate detection, atmospheric correction, and more. However, the complex features of clouds and cloud shadows in remote sensing images, characterized by their diverse, irregular distributions and fuzzy boundary information that is easily disturbed by the background, make accurate feature extraction challenging. Moreover, there are few networks specifically designed for this task. To address these issues, this study proposes a dual-path network combining vision Transformer (ViT) and D-UNet. The network is divided into two branches: one is a convolutional local feature extraction module based on the dilated convolution module of D-UNet, which introduces a multi-scale atrous spatial pyramid pooling (ASPP) to extract multi-dimensional features; the other branch comprehends the context semantics globally through the vision Transformer, enhancing feature extraction. Finally, the study performs an upsampling through a feature fusion decoder. The model achieves superior performance on both a self-built dataset of clouds and cloud shadows and the publicly available HRC_WHU dataset, leading the second-best model by 0.52% and 0.44% in the MIoU metric, achieving 92.05% and 85.37%, respectively.

keywords: remote sensing cloud detection semantic segmentation feature fusion

文章编号： 中图分类号： 文献标志码：

基金项目:国家自然科学基金(61671010)

引用文本：
李远禄,王键翔,范小婷,周昕,吴明轩.基于ViT-D-UNet的双分支遥感云影检测网络.计算机系统应用,2024,33(8):68-77
LI Yuan-Lu,WANG Jian-Xiang,FAN Xiao-Ting,ZHOU Xin,WU Ming-Xuan.Bi-branch Remote Sensing Cloud and Shadow Detection Network Based on ViT-D-UNet.COMPUTER SYSTEMS APPLICATIONS,2024,33(8):68-77

Author Name	Affiliation	E-mail
LI Yuan-Lu	School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, China Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology, Nanjing 210044, China
WANG Jian-Xiang	School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, China	202212490003@nuist.edu.cn
FAN Xiao-Ting	School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, China
ZHOU Xin	School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, China
WU Ming-Xuan	School of Automation, Nanjing University of Information Science and Technology, Nanjing 210044, China