Deep RL Through Policy Optimization – P. Abbeel, J. Schulman (NIPS 2016 Slides) [pdf]people.eecs.berkeley.edu2 pointsseycombi10 years ago