用于3D器官图像分割的波随机自注意力编码器

doi:10.15888/j.cnki.csa.009768

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月9日 21:53 星期三

首页 > 过刊浏览>2025年第34卷第2期 >84-91. DOI:10.15888/j.cnki.csa.009768

PDF HTML阅读 XML下载导出引用引用提醒

用于3D器官图像分割的波随机自注意力编码器
DOI:
                        10.15888/j.cnki.csa.009768
                    
CSTR:
                        
                    
作者:
                        周迪周迪
青岛科技大学 数据科学学院, 青岛 266061
在期刊界中查找
在百度中查找
在本站中查找
刘豪刘豪
青岛科技大学 信息科学技术学院, 青岛 266061
在期刊界中查找
在百度中查找
在本站中查找
程远志程远志
青岛科技大学 信息科学技术学院, 青岛 266061;哈尔滨工业大学 计算机科学与技术学院, 哈尔滨 150001
在期刊界中查找
在百度中查找
在本站中查找
李辉李辉
青岛科技大学 数据科学学院, 青岛 266061
在期刊界中查找
在百度中查找
在本站中查找
刘晓亚刘晓亚
青岛科技大学 数据科学学院, 青岛 266061
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家重点研发计划(2023YFF0612102); 青岛市重点科技攻关及产业化示范项目(23-7-2-qljh-4-gx, 24-1-2-qljh-19-gx)

Wave Random Self-attention Encoder for 3D Organ Image Segmentation

Author:

ZHOU Di
ZHOU Di
School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China
在期刊界中查找
在百度中查找
在本站中查找
LIU Hao
LIU Hao
School of Information Science and Technology, Qingdao University of Science and Technology, Qingdao 266061, China
在期刊界中查找
在百度中查找
在本站中查找
CHENG Yuan-Zhi
CHENG Yuan-Zhi
School of Information Science and Technology, Qingdao University of Science and Technology, Qingdao 266061, China;School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
在期刊界中查找
在百度中查找
在本站中查找
LI Hui
LI Hui
School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China
在期刊界中查找
在百度中查找
在本站中查找
LIU Xiao-Ya
LIU Xiao-Ya
School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

在光谱三维CT数据中, 传统卷积的全局特征捕捉能力不足, 而全尺度的自注意力机制则需要大量的计算资源. 为了解决这一问题, 本文引入一种新视觉注意力范式(wave self-attention, WSA). 相比于ViT技术, 该机制使用更少的资源获得同等的自注意力信息. 此外, 为更充分地提取器官间的相对依赖关系并提高模型的鲁棒性和执行速度, 本文为WSA机制设计了一种即插即用的模块——波随机编码器(wave random encoder, WRE). 该编码器能够生成一对互逆的非对称全局(局部)位置信息矩阵. 其中, 全局位置矩阵用来对波特征进行全局性的随机取样, 局部位置矩阵则用于补充因随机取样而丢失的局部相对依赖. 本文在标准数据集Synapse和COVID-19的肾脏和肺实质的分割任务上进行实验. 结果表明, 本文方法在精度、参数量和推理速率方面均超越了nnFormer、Swin-UNETR等现有模型, 达到了SOTA水平.

关键词:医学影像;图像分割;波自注意力机制;波随机编码器

Abstract:

In spectral 3D CT data, the traditional convolution has a poor ability to capture global features, and the full-scale self-attention mechanism consumes large resources. To solve this problem, this study introduces a new visual attention paradigm, the wave self-attention (WSA). Compared with the ViT technology, this mechanism uses fewer resources to obtain the same amount of self-attention information. In addition, to more adequately extract the relative dependency among organs and to improve the robustness and execution speed of the model, a plug-and-play module, the wave random-encoder (WRE), is designed for the WSA mechanism. The encoder is capable of generating a pair of mutually inverse asymmetric global (local) position information matrices. The global position matrix is used to globally conduct random sampling of the wave features, and the local position matrix is used to complement the local relative dependency lost due to random sampling. In this study, experiments are performed on the task of segmenting the kidney and lung parenchyma in the standard datasets Synapse and COVID-19. The results show that this method outperforms existing models such as nnFormer and Swin-UNETR in terms of accuracy, the number of parameters, and inference rate, arriving at the SOTA level.

Key words:medical imaging;image segmentation;wave self-attention (WSA) mechanism;wave random encoder (WRE)

引用本文

周迪,刘豪,程远志,李辉,刘晓亚.用于3D器官图像分割的波随机自注意力编码器.计算机系统应用,2025,34(2):84-91

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-07-13
最后修改日期:2024-08-13
录用日期:
在线发布日期: 2024-12-19
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码