Iterative Shaping of Multi-Particle Aggregates Based on Action Trees and VLM

Hoi Yin Lee, Peng Zhou, Anqing Duan, Chenguang Yang, David Navarro-Alarcon

Research output: Journal article publicationJournal articleAcademic researchpeer-review

1 Citation (Scopus)

Abstract

In this letter, we address the problem of manipulating multi-particle aggregates using a bimanual robotic system. Our approach enables the autonomous transport of dispersed particles through a series of shaping and pushing actions using robotically controlled tools. Achieving this advanced manipulation capability presents two key challenges: high-level task planning and trajectory execution. For task planning, we leverage Vision Language Models (VLMs) to enable primitive actions such as tool affordance grasping and non-prehensile particle pushing. For trajectory execution, we represent the evolving particle aggregate's contour using truncated Fourier series, providing efficient parametrization of its closed shape. We adaptively compute trajectory waypoints based on group cohesion and the geometric centroid of the aggregate, accounting for its spatial distribution and collective motion. Through real-world experiments, we demonstrate the effectiveness of our methodology in actively shaping and manipulating multi-particle aggregates while maintaining high system cohesion.

Original languageEnglish
Pages (from-to)7102-7109
Number of pages8
JournalIEEE Robotics and Automation Letters
Volume10
Issue number7
DOIs
Publication statusPublished - Jul 2025

Keywords

  • action trees
  • multi- particle manipulation
  • Robot manipulation
  • shape control
  • VLM

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Biomedical Engineering
  • Human-Computer Interaction
  • Mechanical Engineering
  • Computer Vision and Pattern Recognition
  • Computer Science Applications
  • Control and Optimization
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Iterative Shaping of Multi-Particle Aggregates Based on Action Trees and VLM'. Together they form a unique fingerprint.

Cite this