Fig. 10From: Optimization algorithm for feedback and feedforward policies towards robot control robust to sensing failuresSnapshots before and after learning: the yellow horizontal dashed lines represents the target where \(y=0\); before learning, the initial policy failed to make the snaking locomotion forward; in contrast, the proposed method yielded the forward locomotion using the optimized composed policyBack to article page