Federated Reinforcement Learning for Collective Navigation of Robotic Swarms

Seongin Na, Tomáš Rouček, Jiří Ulrich, Jan Pikman, Tomáš Krajník, Barry Lennox, Farshad Arvin (Corresponding)

Research output: Contribution to journalArticlepeer-review

1 Downloads (Pure)

Abstract

The recent advancement of deep reinforcement learning (DRL) contributed to robotics by allowing automatic controller design. The automatic controller design is a crucial approach for designing swarm robotic systems, which require more complex controllers than a single robot system to lead a desired collective behavior. Although the DRL-based controller design method showed its effectiveness for swarm robotic systems, the reliance on the central training server is a critical problem in real-world environments where robot-server communication is unstable or limited. We propose a novel federated learning (FL)-based DRL training strategy federated learning DDPG (FLDDPG) for use in swarm robotic applications. Through the comparison with baseline strategies under a limited communication bandwidth scenario, it is shown that the FLDDPG method resulted in higher robustness and generalization ability into a different environment and real robots, while the baseline strategies suffer from the limitation of communication bandwidth. This result suggests that the proposed method can benefit swarm robotic systems operating in environments with limited communication bandwidth, e.g., in high radiation, underwater, or subterranean environments.

Original languageEnglish
Pages (from-to)2122-2131
Number of pages10
JournalIEEE Transactions on Cognitive and Developmental Systems
Volume15
Issue number4
Early online date25 Jan 2023
DOIs
Publication statusPublished - 1 Dec 2023

Keywords

  • Collective Navigation
  • Collision avoidance
  • Deep Reinforcement Learning
  • Design methodology
  • Federated Learning
  • Robot kinematics
  • Robots
  • Servers
  • Swarm Robotics
  • Swarm robotics
  • Training

Fingerprint

Dive into the research topics of 'Federated Reinforcement Learning for Collective Navigation of Robotic Swarms'. Together they form a unique fingerprint.

Cite this