基于场景语义感知与大语言模型推理的行为树生成

doi:10.15888/j.cnki.csa.009742

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月9日 21:53 星期三

首页 > 过刊浏览>2025年第34卷第1期 >37-46. DOI:10.15888/j.cnki.csa.009742

PDF HTML阅读 XML下载导出引用引用提醒

基于场景语义感知与大语言模型推理的行为树生成
DOI:
                        10.15888/j.cnki.csa.009742
                    
CSTR:
                        32024.14.csa.009742
                    
作者:
                        鄢龙武鄢龙武
武汉科技大学 计算机科学与技术学院, 武汉 430081;武汉科技大学 智能信息处理与实时工业系统湖北省重点实验室, 武汉 430081
在期刊界中查找
在百度中查找
在本站中查找
郑王里郑王里
国网电力科学研究院有限公司, 南京 211106
在期刊界中查找
在百度中查找
在本站中查找
林云汉林云汉
武汉科技大学 计算机科学与技术学院, 武汉 430081;武汉科技大学 智能信息处理与实时工业系统湖北省重点实验室, 武汉 430081
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家重点研发计划 (2022YFB4700400)

Behavior Tree Generation Based on Scene Semantic Perception and Reasoning with Large Language Models

Author:

YAN Long-Wu
YAN Long-Wu
School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430081, China;Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System, Wuhan University of Science and Technology, Wuhan 430081, China
在期刊界中查找
在百度中查找
在本站中查找
ZHENG Wang-Li
ZHENG Wang-Li
State Grid Electric Power Research Institute Co. Ltd., Nanjing 211106, China
在期刊界中查找
在百度中查找
在本站中查找
LIN Yun-Han
LIN Yun-Han
School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430081, China;Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System, Wuhan University of Science and Technology, Wuhan 430081, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

具身智能(embodied AI)需要能够与环境进行互动和感知, 并具备自主规划、决策和行动等能力. 行为树(BT)由于其模块化和高效控制的特性, 已经成为机器人技术中广泛使用的方法. 然而, 现有的行为树生成技术在处理复杂任务时仍面临一定的挑战. 这些方法通常依赖于领域专业知识, 生成行为树的能力有限. 此外, 许多现有方法在语言理解方面存在不足, 或者在理论上无法保证行为树的成功, 从而导致在机器人上的实际部署难度较大. 本研究提出一种新的行为树自动生成方法, 该方法基于大语言模型(LLM)和场景语义感知, 生成包含任务目标的初始行为树. 本文的方法根据机器人的能力设计机器人动作原语和相关条件节点, 并以此设计提示(prompt)使LLM输出行为规划(generated plan), 然后将行为规划转化为初始行为树. 虽然本文以此为示例, 但该方法具有广泛的适用性, 可以根据不同需求应用于其他类型的机器人任务. 同时, 本文将这种方法应用于机器人任务中, 并给出具体实现方法和示例. 在机器人执行任务过程中, 行为树可以根据机器人操作失误和环境变化动态更新, 对外部环境变化具有一定的鲁棒性. 本文进行了初始行为树生成验证实验, 并在仿真机器人环境中进行了验证, 展示了本文方法的有效性.

关键词:具身智能;大语言模型;机器人操作;行为树;行为树生成

Abstract:

Embodied AI requires the ability to interact with and perceive the environment, and capabilities such as autonomous planning, decision making, and action taking. Behavior trees (BTs) become a widely used approach in robotics due to their modularity and efficient control. However, existing behavior tree generation techniques still face certain challenges when dealing with complex tasks. These methods typically rely on domain expertise and have a limited capacity to generate behavior trees. In addition, many existing methods have language comprehension deficiencies or are theoretically unable to guarantee the success of the behavior tree, leading to difficulties in practical robotic applications. In this study, a new method for automatic behavior tree generation is proposed, which generates an initial behavior tree with task goals based on large language models (LLMs) and scene semantic perception. The method in this study designs robot action primitives and related condition nodes based on the robot’s capabilities. It then uses these to design prompts to make the LLMs output a behavior plan (generated plan), which is then transformed into an initial behavior tree. Although this paper takes this as an example, the method has wide applicability and can be applied to other types of robotic tasks according to different needs. Meanwhile, this study applies this method to robot tasks and gives specific implementation methods and examples. During the process of the robot performing a task, the behavior tree can be dynamically updated in response to the robot’s operation errors and environmental changes and has a certain degree of robustness to changes in the external environment. In this study, the first validation experiments on behavior tree generation are carried out and verified in the simulated robot environment, which demonstrates the effectiveness of the proposed method.

Key words:embodied AI;large language model (LLM);robotic manipulation;behavior tree (BT);behavior tree generation

引用本文

鄢龙武,郑王里,林云汉.基于场景语义感知与大语言模型推理的行为树生成.计算机系统应用,2025,34(1):37-46

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-06-14
最后修改日期:2024-07-18
录用日期:
在线发布日期: 2024-11-15
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码