TY - JOUR
T1 - Deep reinforcement learning enabling a BCFbot to learn various undulatory patterns
AU - Hameed, Imran
AU - Chao, Xu
AU - Navarro-Alarcon, David
AU - Jing, Xingjian
N1 - Publisher Copyright:
© 2025 The Author(s)
PY - 2025/3/15
Y1 - 2025/3/15
N2 - In bio-inspired marine robots, one particular motion pattern is generally adopted to achieve benefits of that pattern. However, multiple gait patterns can be utilized together in a single biomimetic design to employ their benefits, as required. However, there is a lack of a unified control scheme that can be used to optimize and mimic undulatory patterns observed among different organisms in the body and/or caudal fin (BCF) category. Thus, central pattern generators (CPGs) were incorporated into a deep reinforcement learning (DRL) architecture to train a robot to develop various swimming gaits. The proposed framework can not only develop and optimize distinct motion patterns but also seamlessly and instantly switch between them. Oscillators integrated into a learning paradigm provide a bioinspired framework to systematically develop a variety of swimming gaits. The prototyped BCFbot has multiple joints, which makes it easy to realize more than one tail undulation patterns. Three different swimming patterns (anguilliform, sub-carangiform, and carangiform) were learned through simulation and then verified on a physical robot. Testing and comparison results show that the claimed benefits of the three benchmark motion patterns can be well realized using the developed robot and can be freely switched and optimized using the developed DRL mechanism. This should be the first attempt for achieving a multimotion pattern optimization and switching within a single BCFbot and demonstrating a successful motion generation regime similar to a real animal.
AB - In bio-inspired marine robots, one particular motion pattern is generally adopted to achieve benefits of that pattern. However, multiple gait patterns can be utilized together in a single biomimetic design to employ their benefits, as required. However, there is a lack of a unified control scheme that can be used to optimize and mimic undulatory patterns observed among different organisms in the body and/or caudal fin (BCF) category. Thus, central pattern generators (CPGs) were incorporated into a deep reinforcement learning (DRL) architecture to train a robot to develop various swimming gaits. The proposed framework can not only develop and optimize distinct motion patterns but also seamlessly and instantly switch between them. Oscillators integrated into a learning paradigm provide a bioinspired framework to systematically develop a variety of swimming gaits. The prototyped BCFbot has multiple joints, which makes it easy to realize more than one tail undulation patterns. Three different swimming patterns (anguilliform, sub-carangiform, and carangiform) were learned through simulation and then verified on a physical robot. Testing and comparison results show that the claimed benefits of the three benchmark motion patterns can be well realized using the developed robot and can be freely switched and optimized using the developed DRL mechanism. This should be the first attempt for achieving a multimotion pattern optimization and switching within a single BCFbot and demonstrating a successful motion generation regime similar to a real animal.
KW - Biomimetic marine robots
KW - Deep reinforcement learning
KW - Swimming gait optimization
UR - http://www.scopus.com/inward/record.url?scp=85214454791&partnerID=8YFLogxK
U2 - 10.1016/j.oceaneng.2025.120322
DO - 10.1016/j.oceaneng.2025.120322
M3 - Journal article
AN - SCOPUS:85214454791
SN - 0029-8018
VL - 320
JO - Ocean Engineering
JF - Ocean Engineering
M1 - 120322
ER -