We apply the Proximal Policy Optimization algorithm in order to learn a Push Recovery strategy in... 0 0 0 Learning Humanoid Robot Push Recovery using Deep Reinforcement Learning Dicksiano Carvalho Melo Created: 08/27/2020