Difference between revisions of "Trust Region Policy Optimization (TRPO)"
Line 1: | Line 1: | ||
[http://www.youtube.com/results?search_query=Trust+Region+Policy+Optimization+%28TRPO%29 Youtube search...] | [http://www.youtube.com/results?search_query=Trust+Region+Policy+Optimization+%28TRPO%29 Youtube search...] | ||
− | * [[Deep Reinforcement Learning]] | + | * [[Deep Reinforcement Learning (DRL)]] |
<youtube>xvRrgxcpaHY</youtube> | <youtube>xvRrgxcpaHY</youtube> |