Recent advances in reinforcement learning-based autonomous driving behavior planning: A survey

Jingda Wu, Chao Huang, Hailong Huang, Chen Lv, Yuntong Wang, Fei Yue Wang

Research output: Journal article publicationReview articleAcademic researchpeer-review

2 Citations (Scopus)

Abstract

Autonomous driving (AD) holds the potential to revolutionize transportation efficiency, but its success hinges on robust behavior planning (BP) mechanisms. Reinforcement learning (RL) emerges as a pivotal tool in crafting these BP strategies. This paper offers a comprehensive review of RL-based BP strategies, spotlighting advancements from 2021 to 2023. We completely organize and distill the relevant literature, emphasizing paradigm shifts in RL-based BP. Introducing a novel categorization, we trace the trajectory of efforts aimed at surmounting practical challenges encountered by autonomous vehicles through innovative RL techniques. To guide readers, we furnish a quantitative analysis that maps the volume and diversity of recent RL configurations, elucidating prevailing trends. Additionally, we delve into the imminent challenges and potential directions for the future of RL-driven BP in AD. These directions encompass addressing safety vulnerabilities, fostering continual learning capabilities, enhancing data efficiency, championing collaborative vehicular cloud networks, integrating large language models, and enhancing ethical considerations.

Original languageEnglish
Article number104654
Number of pages28
JournalTransportation Research Part C: Emerging Technologies
Volume164
DOIs
Publication statusPublished - Jul 2024

Keywords

  • Autonomous driving
  • Autonomous vehicle
  • Behavior planning
  • Decision
  • Reinforcement learning

ASJC Scopus subject areas

  • Civil and Structural Engineering
  • Automotive Engineering
  • Transportation
  • Management Science and Operations Research

Fingerprint

Dive into the research topics of 'Recent advances in reinforcement learning-based autonomous driving behavior planning: A survey'. Together they form a unique fingerprint.

Cite this