基于RandLA-CGNet的大规模室内点云语义分割
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

国家重点研发计划 (2024YFD2402205); 河北省高等学校科学技术研究项目 (QN2025371)


RandLA-CGNet for Large-scale Indoor Point Cloud Semantic Segmentation
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    随着数字孪生虚拟现实技术的应用越来越广泛, 针对大规模室内建筑点云语义分割中整体精度有限、小物体识别精度低及边界分割模糊等问题, 提出一种大规模室内点云语义分割的方法RandLA-CGNet. 在编码层中构建局部-全局上下文融合(local-global context fusion, LGCF)模块, 在保留局部邻域信息的同时融入整体上下文语义; 在解码层设计范数门控通道特征(norm-gated channel feature, NGCF)模块, 通过对网络特征图的通道维度进行自适应重标定, 增强有用信息、抑制冗余噪声, 增强对细节和边界的敏感性, 提高模型的精细化识别能力; 最后采用融合型损失函数(focused cross-entropy loss, FCE loss), 在保证模型对大多数样本稳定收敛和整体精度的同时, 增加对难分样本与少数类样本的关注, 从而提升模型在边界区域和稀有类别上的分割性能. 实验结果表明, 本文提出的模型在 S3DIS 数据集上经六折交叉验证OAmAccmIoU分别提升至88.8%、83.4%和71.9%, 较基准模型分别提高0.8%、1.4%和1.9%. 与主流算法相比, 较LG-Net分别提升0.5%、1.0%和1.1%, 总体精度以及平均交并比较FGC-AF提升0.2%和0.7%. RandLA-CGNet 在保持整体性能优势的同时, 对小物体以及边界细节分割的 IoU 提升了1%–6%, 有效提升对低频类别与复杂边界的识别能力, 为点云语义分割任务中少样本类别与细节边界的精准建模提供有效解决方案.

    Abstract:

    As the digital twin VR technology is increasingly widely applied, a method named RandLA-CGNet for large-scale indoor point cloud semantic segmentation is proposed to solve the problems such as the limited overall accuracy, low recognition accuracy for small objects, and blurred boundary segmentation in point cloud semantic segmentation of large-scale indoor buildings. In the encoder layer, a local-global context fusion (LGCF) module is constructed, preserving local neighborhood information while incorporating global contextual semantics. In the decoder layer, a norm-gated channel feature (NGCF) module is designed, which performs the adaptive recalibration of feature maps along the channel dimension to enhance useful information and suppress redundant noise, thereby enhancing sensitivity to details and boundaries, and improving the model’s refined recognition capability. Finally, focused cross-entropy loss (FCE loss), a hybrid loss function, is adopted to ensure stable convergence for the majority of samples and maintain overall accuracy. Additionally, this function increases the focus on hard samples and minority class samples, thereby enhancing the model’s segmentation performance in boundary regions and for rare classes. Experimental results show that the proposed model on the S3DIS dataset by employing 6-fold cross-validation increases OA, mAcc, and mIoU to 88.8%, 83.4%, and 71.9% respectively, an improvement of 0.8%, 1.4%, and 1.9% respectively compared with the baseline models. Compared to mainstream algorithms, it increases LG-Net by 0.5%, 1.0%, and 1.1% respectively, with the overall accuracy and mean intersection of union (IoU) 0.2% and 0.7% higher than FGC-AF respectively. While maintaining overall performance advantages, RandLA-CGNet improves the IoU for small objects and boundary detail segmentation by 1%–6%, significantly enhancing the recognition capability for low-frequency classes and complex boundaries. Finally, an effective solution is provided for the precise modeling of few-sample classes and detail boundaries in point cloud semantic segmentation tasks.

    参考文献
    相似文献
    引证文献
引用本文

王建超,王浩雨,苏鹤,王震洲,张丹.基于RandLA-CGNet的大规模室内点云语义分割.计算机系统应用,2026,35(2):175-186

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-07-28
  • 最后修改日期:2025-09-19
  • 录用日期:
  • 在线发布日期: 2025-12-29
  • 出版日期:
文章二维码
您是第位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62661041 传真: Email:csa@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号