杂乱场景中多尺度注意力特征融合抓取检测网络

doi:10.15888/j.cnki.csa.009500

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月23日 21:14 星期三

首页 > 过刊浏览>2024年第33卷第5期 >76-84. DOI:10.15888/j.cnki.csa.009500

PDF HTML阅读 XML下载导出引用引用提醒

杂乱场景中多尺度注意力特征融合抓取检测网络
DOI:
                        10.15888/j.cnki.csa.009500
                    
CSTR:
                        32024.14.csa.009500
                    
作者:
                        徐衍徐衍
武汉科技大学 计算机科学与技术学院, 武汉 430081;武汉科技大学 智能信息处理与实时工业系统湖北省重点实验室, 武汉 430081
在期刊界中查找
在百度中查找
在本站中查找
林云汉林云汉
武汉科技大学 计算机科学与技术学院, 武汉 430081;武汉科技大学 智能信息处理与实时工业系统湖北省重点实验室, 武汉 430081;武汉科技大学 机器人与智能系统研究院, 武汉 430081
在期刊界中查找
在百度中查找
在本站中查找
闵华松闵华松
武汉科技大学 机器人与智能系统研究院, 武汉 430081
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家重点研发计划(2022YFB4700400); 国家自然科学基金(62073249)

Grasping Detection Network of Multi-scale Attention Feature Fusion in Cluttered Scenes

Author:

XU Yan
XU Yan
School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430081, China;Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System, Wuhan University of Science and Technology, Wuhan 430081, China
在期刊界中查找
在百度中查找
在本站中查找
LIN Yun-Han
LIN Yun-Han
School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430081, China;Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System, Wuhan University of Science and Technology, Wuhan 430081, China;Institute of Robotics and Intelligent Systems (IRIS), Wuhan University of Science and Technology, Wuhan 430081, China
在期刊界中查找
在百度中查找
在本站中查找
MIN Hua-Song
MIN Hua-Song
Institute of Robotics and Intelligent Systems (IRIS), Wuhan University of Science and Technology, Wuhan 430081, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

GSNet使用抓取度区分杂乱场景的可抓取区域, 显著地提高了杂乱场景中机器人抓取位姿检测准确性, 但是GSNet仅使用一个固定大小的圆柱体来确定抓取位姿参数, 而忽略了不同大小尺度的特征对抓取位姿估计的影响. 针对这一问题, 本文提出了一个多尺度圆柱体注意力特征融合模块(Ms-CAFF), 包含注意力融合模块和门控单元两个核心模块, 替代了GSNet中原始的特征提取方法, 使用注意力机制有效地融合4个不同大小圆柱体空间内部的几何特征, 从而增强了网络对不同尺度几何特征的感知能力. 在大规模杂乱场景抓取位姿检测数据集GraspNet-1Billion的实验结果表明, 在引入模块后将网络生成抓取位姿的精度最多提高了10.30%和6.65%. 同时本文将网络应用于实际实验, 验证了方法在真实场景当中的有效性.

关键词:点云;机器人抓取位姿检测;多尺度特征融合;杂乱场景;注意力机制

Abstract:

GSNet relies on graspness to distinguish graspable areas in cluttered scenes, which significantly improves the accuracy of robot grasping pose detection in cluttered scenes. However, GSNet only uses a fixed-size cylinder to determine the grasping pose parameters and ignores the influence of features of different sizes on grasping pose estimation. To address this problem, this study proposes a multi-scale cylinder attention feature fusion module (Ms-CAFF), which contains two core modules: the attention fusion module and the gating unit. It replaces the original feature extraction method in GSNet and uses an attention mechanism to effectively integrate the geometric features inside the four cylinders of different sizes, thereby enhancing the network’s ability to perceive geometric features at different scales. The experimental results on GraspNet-1Billion, a grabbing pose detection dataset for large-scale cluttered scenes, show that after the introduction of the modules, the accuracy of the network’s grasping poses is increased by up to 10.30% and 6.65%. At the same time, this study applies the network to actual experiments to verify the effectiveness of the method in real scenes.

Key words:point cloud;robot grasping pose detection;multi-scale feature fusion;cluttered scene;attention mechanism

引用本文

徐衍,林云汉,闵华松.杂乱场景中多尺度注意力特征融合抓取检测网络.计算机系统应用,2024,33(5):76-84

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-11-13
最后修改日期:2023-12-11
录用日期:
在线发布日期: 2024-04-01
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码