TY - JOUR
T1 - From rough to precise
T2 - Human-inspired phased target learning framework for redundant musculoskeletal systems
AU - Zhou, Junjie
AU - Chen, Jiahao
AU - Deng, Hu
AU - Qiao, Hong
N1 - Publisher Copyright:
Copyright © 2019 Zhou, Chen, Deng and Qiao.
PY - 2019
Y1 - 2019
N2 - Redundant muscles in human-like musculoskeletal robots provide additional dimensions to the solution space. Consequently, the computation of muscle excitations remains an open question. Conventional methods like dynamic optimization and reinforcement learning usually have high computational costs or unstable learning processes when applied to a complex musculoskeletal system. Inspired by human learning, we propose a phased target learning framework that provides different targets to learners at varying levels, to guide their training process and to avoid local optima. By introducing an extra layer of neurons reflecting a preference, we improve the Q-network method to generate continuous excitations. In addition, based on information transmission in the human nervous system, two kinds of biological noise sources are introduced into our framework to enhance exploration over the solution space. Tracking experiments based on a simplified musculoskeletal arm model indicate that under guidance of phased targets, the proposed framework prevents divergence of excitations, thus stabilizing training. Moreover, the enhanced exploration of solutions results in smaller motion errors. The phased target learning framework can be expanded for general-purpose reinforcement learning, and it provides a preliminary interpretation for modeling the mechanisms of human motion learning.
AB - Redundant muscles in human-like musculoskeletal robots provide additional dimensions to the solution space. Consequently, the computation of muscle excitations remains an open question. Conventional methods like dynamic optimization and reinforcement learning usually have high computational costs or unstable learning processes when applied to a complex musculoskeletal system. Inspired by human learning, we propose a phased target learning framework that provides different targets to learners at varying levels, to guide their training process and to avoid local optima. By introducing an extra layer of neurons reflecting a preference, we improve the Q-network method to generate continuous excitations. In addition, based on information transmission in the human nervous system, two kinds of biological noise sources are introduced into our framework to enhance exploration over the solution space. Tracking experiments based on a simplified musculoskeletal arm model indicate that under guidance of phased targets, the proposed framework prevents divergence of excitations, thus stabilizing training. Moreover, the enhanced exploration of solutions results in smaller motion errors. The phased target learning framework can be expanded for general-purpose reinforcement learning, and it provides a preliminary interpretation for modeling the mechanisms of human motion learning.
KW - Human-inspired motion learning
KW - Musculoskeletal system
KW - Noise in nervous system
KW - Phased target learning
KW - Reinforcement learning
UR - https://www.scopus.com/pages/publications/85072053487
U2 - 10.3389/fnbot.2019.00061
DO - 10.3389/fnbot.2019.00061
M3 - 文章
AN - SCOPUS:85072053487
SN - 1662-5218
VL - 13
JO - Frontiers in Neurorobotics
JF - Frontiers in Neurorobotics
M1 - 61
ER -