Abstract
The mobile battery energy storage systems (MBESS) utilize flexibility in temporal and spatial to enhance smart grid resilience and economic benefits. Recently, the high penetration of renewable energy increases the volatility of electricity prices and gives MBESS an opportunity for price difference arbitrage. However, the strong randomness of both the traffic system and renewable energy leads to difficulty in achieving profit with acceptable risk. To address this problem, this paper proposes a risk-sensitive MBESS control framework based on safe deep reinforcement learning, which can constrain the risk under a certain level according to the company's risk preference. The risk-estimation safe deep deterministic policy gradient (RE-SDDPG) algorithm is proposed to learn the optimal policy under the premise of lacking direct risk signals. Moreover, a hybrid risk estimation method is proposed to avoid local convergence caused by inaccurate estimation during the learning process. Last, a parameter-sharing method is applied to increase learning efficiency by sharing the Q networks' parameters. The proposed methods are tested in the IEEE 30-bus system. The results show that the proposed method can keep the profit at a relatively high level while reducing the risk and increase learning efficiency compared with existing methods.
| Original language | English |
|---|---|
| Pages (from-to) | 4143-4158 |
| Number of pages | 16 |
| Journal | IEEE Transactions on Smart Grid |
| Volume | 15 |
| Issue number | 4 |
| DOIs | |
| Publication status | Published - 1 Jul 2024 |
| Externally published | Yes |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- Mobile battery energy storage system
- hybrid risk estimation
- parametersharing method
- safe deep reinforcement learning
ASJC Scopus subject areas
- General Computer Science
Fingerprint
Dive into the research topics of 'Risk-Sensitive Mobile Battery Energy Storage System Control With Deep Reinforcement Learning and Hybrid Risk Estimation Method'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver