RL: thoughts, content for self-teaching and more
Direct policy search and reinforcement learning
Details about policy gradient methods
Slides
PG details
Video
PG details (9’)