MCCNET: Feature-enhanced Dual-branch Multi-organ Segmentation Image Model
Author:
  • Article
  • | |
  • Metrics
  • |
  • Reference [27]
  • | |
  • Cited by
  • | |
  • Comments
    Abstract:

    To address the challenges in multi-organ segmentation of abdominal CT images, such as varying organ sizes and shapes, difficulties in distinguishing boundaries between adjacent organs, and low contrast, this study proposes a feature-enhanced dual-branch multi-organ segmentation model. The model adopts an encoder-decoder architecture, with a master-slave dual-branch structure in the encoder. The master branch leverages Mamba to capture global dependencies among organs, while the slave branch employs CNN to hierarchically extract local features of multiple organs. A cascade context module is introduced to transfer detailed local features from the slave branch to the master branch. In the decoder, a multi-scale feature fusion module integrates cross-level feature information to enhance boundary sharpness in multi-organ segmentation, and a deep feature enhancement module applies a cross-attention mechanism to improve the contrast between organ foregrounds and backgrounds, mitigating the interference of background noise. Experimental results on two public datasets, Synapse and ACDC, demonstrate that the proposed model achieves notable improvements in Dice and HD95 indexes compared to recent baseline models.

    Reference
    [1] Fu YB, Lei Y, Wang TH, et al. A review of deep learning based methods for medical image multi-organ segmentation. Physica Medica, 2021, 85: 107–122.
    [2] Zhou HY, Guo J, Zhang Y, et al. nnFormer: Volumetric medical image segmentation via a 3D Transformer. IEEE Transactions on Image Processing, 2023, 32: 4036–4045.
    [3] Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach: Curran Associates Inc., 2017. 6000–6010.
    [4] Gu A, Dao T. Mamba: Linear-time sequence modeling with selective state spaces. arXiv:2312.00752, 2023.
    [5] Xu R, Yang S, Wang YH, et al. Visual Mamba: A survey and new outlooks. arXiv:2404.18861, 2024.
    [6] Zhang DW, Huang GH, Zhang Q, et al. Cross-modality deep feature learning for brain tumor segmentation. Pattern Recognition, 2021, 110: 107562.
    [7] Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Boston: IEEE, 2015. 3431–3440.
    [8] Ronneberger O, Fischer P, Brox T. U-Net: Convolutional networks for biomedical image segmentation. Proceedings of the 18th International Conference. Munich: Springer, 2015. 234–241.
    [9] Zhou ZW, Siddiquee MMR, Tajbakhsh N, et al. UNet++: A nested U-Net architecture for medical image segmentation. Proceedings of the 4th International Workshop. Granada: Springer, 2018. 3–11.
    [10] Oktay O, Schlemper J, Le Folgoc L, et al. Attention U-Net: Learning where to look for the pancreas. arXiv:1804.03999, 2018.
    [11] Ruan JC, Xie MY, Gao JS, et al. Ege-UNet: An efficient group enhanced UNet for skin lesion segmentation. Proceedings of the 26th International Conference. Vancouver: Springer, 2023. 481–490.
    [12] Dosovitskiy A, Beyer L, Kolesnikov A, et al. An image is worth 16x16 words: Transformers for image recognition at scale. Proceedings of the 9th International Conference on Learning Representations. 2021.
    [13] Liu Z, Lin YT, Cao Y, et al. Swin Transformer: Hierarchical vision Transformer using shifted windows. Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision. Montreal: IEEE, 2021. 9992–10002.
    [14] Cao H, Wang YY, Chen J, et al. Swin-Unet: Unet-like pure Transformer for medical image segmentation. Proceedings of the 2022 European Conference on Computer Vision. Tel Aviv: Springer, 2022. 205–218.
    [15] Chen JN, Lu YY, Yu QH, et al. TransUNet: Transformers make strong encoders for medical image segmentation. arXiv:2102.04306, 2021.
    [16] Guo JS, Zhou HY, Wang LS, et al. UNet-2022: Exploring dynamics in non-isomorphic architecture. Proceedings of the 2022 International Conference on Medical Imaging and Computer-aided Diagnosis. Leicester: Springer, 2022. 465–476.
    [17] Zhu LH, Liao BC, Zhang Q, et al. Vision Mamba: Efficient visual representation learning with bidirectional state space model. Proceedings of the 41st International Conference on Machine Learning. Vienna, 2024.
    [18] Liu Y, Tian YJ, Zhao YZ, et al. VMamba: Visual state space model. Proceedings of the 38th Annual Conference on Neural Information Processing Systems. Vancouver, 2024.
    [19] Ma J, Li FF, Wang B. U-Mamba: Enhancing long-range dependency for biomedical image segmentation. arXiv:2401.04722, 2024.
    [20] Xing ZH, Ye T, Yang YJ, et al. Segmamba: Long-range sequential modeling mamba for 3D medical image segmentation. Proceedings of the 27th International Conference. Marrakesh: Springer, 2024. 578–588.
    [21] Ruan JC, Li JC, Xiang SC. VM-UNet: Vision mamba UNet for medical image segmentation. arXiv:2402.02491, 2024.
    [22] Gu A, Goel K, Ré C. Efficiently modeling long sequences with structured state spaces. Proceedings of the 10th International Conference on Learning Representations. OpenReview.net, 2022.
    [23] He KM, Zhang XY, Ren SQ, et al. Deep residual learning for image recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas: IEEE, 2016. 770–778.
    [24] Fu SH, Lu YY, Wang Y, et al. Domain adaptive relational reasoning for 3D multi-organ segmentation. Proceedings of the 23rd International Conference. Lima: Springer, 2020. 656–666.
    [25] Schlemper J, Oktay O, Schaap M, et al. Attention gated networks: Learning to leverage salient regions in medical images. Medical Image Analysis, 2019, 53: 197–207.
    [26] Wang HY, Xie SA, Lin LF, et al. Mixed Transformer U-Net for medical image segmentation. Proceedings of the 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Singapore: IEEE, 2022. 2390–2394.
    [27] Rahman MM, Marculescu R. Medical image segmentation via cascaded attention decoding. Proceedings of the 2023 IEEE/CVF Winter Conference on Applications of Computer Vision. Waikoloa: IEEE, 2023. 6211–6220.
    Related
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

郭俊林,陈平华,陈一嘉,詹晗晖. MCCNET: 特征增强的双分支多器官图像分割模型.计算机系统应用,,():1-12

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:November 27,2024
  • Revised:December 17,2024
  • Online: March 24,2025
Article QR Code
You are the first991371Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063