TY - JOUR
T1 - Demonstration Learning and Generalization of Robotic Motor Skills Based on Wearable Motion Tracking Sensors
AU - Liu, Xiaofeng
AU - Wang, Ziyang
AU - Li, Jie
AU - Cangelosi, Angelo
AU - Yang, Chenguang
PY - 2023/6/22
Y1 - 2023/6/22
N2 - Learning from Demonstration (LfD) is an important way for robots to learn new skills. Its principle is to obtain an optimized trajectory through an instructor’s action teaching or action coding regression. By learning the teaching trajectory, the robot can master the movement skills. However, adapting the learned motor skills to meet additional constraints that arise during the task can be challenging. This paper proposed a noise exploration method PIBB-CMA, which improves the Policy Improvement through Black-Box Optimization (PIBB) algorithm by using an adaptive covariance matrix. In our method, the teaching trajectory is collected with our designed inertial motion capture system. After then, a more reasonable trajectory is synthesized as the input to Dynamic Movement Primitives (DMP), using the Gaussian Mixture Model (GMM) and Gaussian Mixture Regression (GMR). Finally, the improved algorithm PIBB-CMA is used to perform noise exploration to ensure that the robot arm can pass the designated points in the path planning. Furthermore, three baseline methods are chosen for comparison to validate the robustness of our proposed method. The experimental results demonstrated that our proposed method outperforms the baseline algorithms and is feasible for improving robot intelligence. We believe that our method has a certain potential in the aspect of skill learning for robots.
AB - Learning from Demonstration (LfD) is an important way for robots to learn new skills. Its principle is to obtain an optimized trajectory through an instructor’s action teaching or action coding regression. By learning the teaching trajectory, the robot can master the movement skills. However, adapting the learned motor skills to meet additional constraints that arise during the task can be challenging. This paper proposed a noise exploration method PIBB-CMA, which improves the Policy Improvement through Black-Box Optimization (PIBB) algorithm by using an adaptive covariance matrix. In our method, the teaching trajectory is collected with our designed inertial motion capture system. After then, a more reasonable trajectory is synthesized as the input to Dynamic Movement Primitives (DMP), using the Gaussian Mixture Model (GMM) and Gaussian Mixture Regression (GMR). Finally, the improved algorithm PIBB-CMA is used to perform noise exploration to ensure that the robot arm can pass the designated points in the path planning. Furthermore, three baseline methods are chosen for comparison to validate the robustness of our proposed method. The experimental results demonstrated that our proposed method outperforms the baseline algorithms and is feasible for improving robot intelligence. We believe that our method has a certain potential in the aspect of skill learning for robots.
KW - Path planning
KW - Learning from Demonstration
KW - Dynamic Movement Primitives
KW - Policy Improvement through Black-box Optimization
KW - adaptive covariance matrix
UR - https://www.scopus.com/pages/publications/85163542890
U2 - 10.1109/TIM.2023.3288240
DO - 10.1109/TIM.2023.3288240
M3 - Article
SN - 0018-9456
VL - 72
JO - IEEE Transactions on Instrumentation and Measurement
JF - IEEE Transactions on Instrumentation and Measurement
ER -