TY - JOUR
T1 - Exploration in deep reinforcement learning: A survey
AU - Ladosz, Pawel
AU - Weng, Lilian
AU - Kim, Minwoo
AU - Oh, Hyondong
PY - 2022/1
Y1 - 2022/1
N2 - This paper reviews exploration techniques in deep reinforcement learning. Exploration techniques are of primary importance when solving sparse reward problems. In sparse reward problems, the reward is rare, which means that the agent will not find the reward often by acting randomly. In such a scenario, it is challenging for reinforcement learning to learn rewards and actions association. Thus more sophisticated exploration methods need to be devised. This review provides a comprehensive overview of existing exploration approaches, which are categorised based on the key contributions as: reward novel states, reward diverse behaviours, goal-based methods, probabilistic methods, imitation-based methods, safe exploration and random-based methods. Then, unsolved challenges are discussed to provide valuable future research directions. Finally, the approaches of different categories are compared in terms of complexity, computational effort and overall performance.
AB - This paper reviews exploration techniques in deep reinforcement learning. Exploration techniques are of primary importance when solving sparse reward problems. In sparse reward problems, the reward is rare, which means that the agent will not find the reward often by acting randomly. In such a scenario, it is challenging for reinforcement learning to learn rewards and actions association. Thus more sophisticated exploration methods need to be devised. This review provides a comprehensive overview of existing exploration approaches, which are categorised based on the key contributions as: reward novel states, reward diverse behaviours, goal-based methods, probabilistic methods, imitation-based methods, safe exploration and random-based methods. Then, unsolved challenges are discussed to provide valuable future research directions. Finally, the approaches of different categories are compared in terms of complexity, computational effort and overall performance.
KW - Deep reinforcement learning
KW - Exploration
KW - Intrinsic motivation
KW - Sparse reward problems
UR - https://www.mendeley.com/catalogue/7ce27d08-64d4-3b03-ae03-b0918881bfe5/
U2 - 10.1016/j.inffus.2022.03.003
DO - 10.1016/j.inffus.2022.03.003
M3 - Article
SN - 1566-2535
VL - 85
SP - 1
EP - 22
JO - Information Fusion
JF - Information Fusion
ER -