Abstract
Adaptive traffic signal control is widely recognized as an effective solution to improve urban mobility and reduce congestion in metropolises. Recently, reinforcement learning has been adopted for this transportation problem. While centralized reinforcement learning inevitably faces action space explosion, decentralized reinforcement learning allows agents to develop policies based on local observations but suffers from unstable training. In this paper, we present CommNetPF, a multi-agent decentralized reinforcement learning model incorporating communication and neighbourhood policy fingerprints for adaptive traffic signal control. With policy fingerprints in communication, agents learn to produce cooperative policies and the model converges faster. Experiments in scenarios of adaptive traffic signal control show that CommNetPF outperforms several strong baselines in terms of control performance and convergence speed.
| Original language | English |
|---|---|
| Title of host publication | 2020 IEEE 16th International Conference on Automation Science and Engineering, CASE 2020 |
| Publisher | IEEE Computer Society |
| Pages | 266-273 |
| Number of pages | 8 |
| ISBN (Electronic) | 9781728169040 |
| DOIs | |
| Publication status | Published - Aug 2020 |
| Externally published | Yes |
| Event | 16th IEEE International Conference on Automation Science and Engineering, CASE 2020 - Hong Kong, Hong Kong Duration: 20 Aug 2020 → 21 Aug 2020 |
Publication series
| Name | IEEE International Conference on Automation Science and Engineering |
|---|---|
| Volume | 2020-August |
| ISSN (Print) | 2161-8070 |
| ISSN (Electronic) | 2161-8089 |
Conference
| Conference | 16th IEEE International Conference on Automation Science and Engineering, CASE 2020 |
|---|---|
| Country/Territory | Hong Kong |
| City | Hong Kong |
| Period | 20/08/20 → 21/08/20 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 11 Sustainable Cities and Communities
Keywords
- adaptive traffic signal control
- multi-agent reinforcement learning
- policy fingerprints
- reinforcement learning
ASJC Scopus subject areas
- Control and Systems Engineering
- Electrical and Electronic Engineering
Fingerprint
Dive into the research topics of 'Learning Multi-Agent Communication with Policy Fingerprints for Adaptive Traffic Signal Control'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver