TY - GEN
T1 - Manipulating Constrained Soft Tissue while Avoiding Obstacles using Reinforcement Learning with Self-Attention
AU - He, Xian
AU - Zhang, Shuai
AU - Yang, Shanlin
AU - Ouyang, Bo
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - The complexity of dynamics raises challenges in deformable object manipulation, particularly the soft tissue shape control in in vivo environment. Previous studies assume the soft tissue is in an obstacle-free space, and the constraints are constant. To go one step further, a Soft Actor-Critic (SAC) algorithm is presented to manipulate the soft tissue under external disturbances while avoiding obstacles. K-Means is applied to reduce the dimension of the state space. Furthermore, self-attention is introduced to focus on the external forces and the constraints. Experiment results show that the agent can shape the soft tissue into the target deformation with the proposed approach under unknown external force, where the agent can automatically increase attention to the external force. The agent can find an obstacle-free path while manipulating the soft tissue. Compared with the SAC algorithm, the proposed approach with self-attention requires less training time, and the critic network is more stable.
AB - The complexity of dynamics raises challenges in deformable object manipulation, particularly the soft tissue shape control in in vivo environment. Previous studies assume the soft tissue is in an obstacle-free space, and the constraints are constant. To go one step further, a Soft Actor-Critic (SAC) algorithm is presented to manipulate the soft tissue under external disturbances while avoiding obstacles. K-Means is applied to reduce the dimension of the state space. Furthermore, self-attention is introduced to focus on the external forces and the constraints. Experiment results show that the agent can shape the soft tissue into the target deformation with the proposed approach under unknown external force, where the agent can automatically increase attention to the external force. The agent can find an obstacle-free path while manipulating the soft tissue. Compared with the SAC algorithm, the proposed approach with self-attention requires less training time, and the critic network is more stable.
UR - https://www.scopus.com/pages/publications/85147325258
U2 - 10.1109/ROBIO55434.2022.10011986
DO - 10.1109/ROBIO55434.2022.10011986
M3 - 会议稿件
AN - SCOPUS:85147325258
T3 - 2022 IEEE International Conference on Robotics and Biomimetics, ROBIO 2022
SP - 845
EP - 850
BT - 2022 IEEE International Conference on Robotics and Biomimetics, ROBIO 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Conference on Robotics and Biomimetics, ROBIO 2022
Y2 - 5 December 2022 through 9 December 2022
ER -