Abstract
The ability to wield tools was once considered exclusive to human intelligence, but it is now known that many other animals, like crows, possess this capability. Yet, robotic systems still fall short of matching biological dexterity. In this paper, we investigate the use of Large Language Models (LLMs), tool affordances, and object manoeuvrability for non-prehensile tool-based manipulation tasks. Our novel method leverages LLMs based on scene information and natural language instructions to enable symbolic task planning for tool-object manipulation. This approach allows the system to convert a human language sentence into a sequence of feasible motion functions. We have developed a novel manoeuvrability-driven controller using a new tool affordance model derived from visual feedback. This controller helps guide the robot's tool utilization and manipulation actions, even within confined areas, using a stepping incremental approach. The proposed methodology is evaluated with experiments to prove its effectiveness under various manipulation scenarios.
| Original language | English |
|---|---|
| Article number | 103231 |
| Journal | Robotics and Computer-Integrated Manufacturing |
| Volume | 100 |
| DOIs | |
| Publication status | Published - Aug 2026 |
Keywords
- Human–robot collaboration
- Large Language Models (LLMs)
- Symbolic planning
ASJC Scopus subject areas
- Control and Systems Engineering
- Software
- General Mathematics
- Computer Science Applications
- Industrial and Manufacturing Engineering
Fingerprint
Dive into the research topics of 'Non-prehensile tool-object manipulation by integrating LLM-based planning and manoeuvrability-driven controls'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver