Reinforcement learning-based scheduling of multi-battery energy storage system

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

In this paper, a reinforcement learning-based multi-battery energy storage system (MBESS) scheduling policy is proposed to minimize the consumers' electricity cost. The MBESS scheduling problem is modeled as a Markov decision process (MDP) with unknown transition probability. However, the optimal value function is time-dependent and difficult to obtain because of the periodicity of the electricity price and residential load. Therefore, a series of time-independent action-value functions are proposed to describe every period of a day. To approximate every action-value function, a corresponding critic network is established, which is cascaded with other critic networks according to the time sequence. Then, the continuous management strategy is obtained from the related action network. Moreover, a two-stage learning protocol including offline and online learning stages is provided for detailed implementation in real-time battery management. Numerical experimental examples are given to demonstrate the effectiveness of the developed algorithm.

Original languageEnglish
Pages (from-to)117-128
Number of pages12
JournalJournal of Systems Engineering and Electronics
Volume34
Issue number1
DOIs
StatePublished - 1 Feb 2023
Externally publishedYes

Keywords

  • data-driven
  • multi-battery energy storage system (MBESS)
  • periodic value iteration
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Reinforcement learning-based scheduling of multi-battery energy storage system'. Together they form a unique fingerprint.

Cite this