TY - JOUR
T1 - Particle-Assisted Deep Reinforcement Learning for Quantum State Manipulation
AU - Yu, Haixu
AU - Liu, Xiang
AU - Wang, Bohui
AU - Zhao, Xudong
N1 - Publisher Copyright:
© 1997-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Applying deep reinforcement learning (DRL) to solve quantum control problems has become a popular research direction. However, the exploration capability and reward design for the learning agent, which usually affect the DRL's application performance, has not been sufficiently emphasized. In this article, we propose a particle-assisted DRL (PDRL) method to address the above concern by enhancing exploration capabilities and designing appropriate reward functions for efficient quantum state manipulation. In PDRL, each episode in the quantum learning process is characterized by three kinds of events, i.e., unidentifiable, identifiable, and successful events. To improve exploration, exploration particles and feedback particles are employed in the early learning phase when episodes end in identifiable and successful events, respectively. To assign rewards, three event-based reward functions are provided for the DRL's agent, exploration particles and feedback particles, respectively. Numerical results on single-qubit, two-qubit, and many-qubit systems validate the effectiveness of PDRL. Comparative results with existing DRL methods demonstrate the superior performance of PDRL for quantum state manipulation.
AB - Applying deep reinforcement learning (DRL) to solve quantum control problems has become a popular research direction. However, the exploration capability and reward design for the learning agent, which usually affect the DRL's application performance, has not been sufficiently emphasized. In this article, we propose a particle-assisted DRL (PDRL) method to address the above concern by enhancing exploration capabilities and designing appropriate reward functions for efficient quantum state manipulation. In PDRL, each episode in the quantum learning process is characterized by three kinds of events, i.e., unidentifiable, identifiable, and successful events. To improve exploration, exploration particles and feedback particles are employed in the early learning phase when episodes end in identifiable and successful events, respectively. To assign rewards, three event-based reward functions are provided for the DRL's agent, exploration particles and feedback particles, respectively. Numerical results on single-qubit, two-qubit, and many-qubit systems validate the effectiveness of PDRL. Comparative results with existing DRL methods demonstrate the superior performance of PDRL for quantum state manipulation.
KW - Particle-assisted exploration
KW - deep reinforcement learning
KW - quantum control
KW - reward design
UR - https://www.scopus.com/pages/publications/85216679225
U2 - 10.1109/TEVC.2025.3534530
DO - 10.1109/TEVC.2025.3534530
M3 - 文章
AN - SCOPUS:85216679225
SN - 1089-778X
JO - IEEE Transactions on Evolutionary Computation
JF - IEEE Transactions on Evolutionary Computation
ER -