Model-based Data Efficient Reinforcement Learning for Active Pantograph Control in High-Speed Railways

Hui Wang, Zhigang Liu, Xufan Wang, Xiangyu Meng, Yanbo Wu, Zhiwei Han

Research output: Contribution to journalArticlepeer-review

Abstract

The active pantograph is a promising technology to suppress contact force fluctuation in pantograph catenary systems (PCS). Recently, the rapid development of reinforcement learning techniques has dramatically facilitated complex system controllers’ design. However, the low data efficiency problem is fatal because data collection is costly. In this paper, We propose the Ensemble Q-functions Model-based Reinforcement Learning algorithm (EQ-MBRL) to achieve data-efficient reinforcement learning. First, we introduce an ensemble probabilistic neural network to estimate the distribution and uncertainty of the dynamics model and adopt multi-step loss to constraint accumulation error in the long-length model rollout. Second, we employ a short-term rollout of the model to trade off the ease of data generation and the error of the model-generated data. Finally, we propose ensemble Q functions and in-target minimization techniques to help stabilize the training process of value functions and improve the accuracy of value estimation. In addition, we discussed the appropriate model-based rollout length and explored the performance of network update rates with different strategies. The experimental results demonstrate that the proposed approach outperforms compared algorithms and delivers a state-of-the-art performance on the PCS benchmark. The controller learned robust motion patterns using only 50K collected transitions, which was more than ten times faster than compared baseline.

Original languageEnglish
JournalIEEE Transactions on Transportation Electrification
DOIs
Publication statusPublished - 18 Aug 2023

Keywords

  • Adaptation models
  • Data models
  • Finite element analysis
  • Force
  • High-speed railways
  • Predictive models
  • Training
  • Uncertainty
  • active pantograph control
  • contact force
  • model-based reinforcement learning

Fingerprint

Dive into the research topics of 'Model-based Data Efficient Reinforcement Learning for Active Pantograph Control in High-Speed Railways'. Together they form a unique fingerprint.

Cite this