RL: thoughts, content for self-teaching and more

Direct policy search and reinforcement learning

Details about policy gradient methods

Slides

PG details

Video

PG details (9’)