Near Optimal Policy Optimization via REPS

Publication
35th Conference on Neural Processing Systems