Difference between revisions of "Trust Region Policy Optimization (TRPO)"
(Created page with "[http://www.youtube.com/results?search_query=Trust+Region+Policy+Optimization+%28TRPO%29 Youtube search...] * Deep Q Learning (DQN) <youtube>xvRrgxcpaHY</youtube> <youtu...") |
|||
Line 1: | Line 1: | ||
[http://www.youtube.com/results?search_query=Trust+Region+Policy+Optimization+%28TRPO%29 Youtube search...] | [http://www.youtube.com/results?search_query=Trust+Region+Policy+Optimization+%28TRPO%29 Youtube search...] | ||
− | * [[Deep | + | * [[Deep Reinforcement Learning]] |
<youtube>xvRrgxcpaHY</youtube> | <youtube>xvRrgxcpaHY</youtube> |