Difference between revisions of "Proximal Policy Optimization (PPO)"

From
Jump to: navigation, search
(Created page with "[http://www.youtube.com/results?search_query=Trust+Region+Policy+Optimization+%28TRPO%29 Youtube search...] * Deep Q Learning (DQN) <youtube>xvRrgxcpaHY</youtube> <youtu...")
 
Line 1: Line 1:
[http://www.youtube.com/results?search_query=Trust+Region+Policy+Optimization+%28TRPO%29 Youtube search...]
+
[http://www.youtube.com/results?search_query=Proximal+Policy+Optimization+%28PPO%29 Youtube search...]
  
 
* [[Deep Q Learning (DQN)]]
 
* [[Deep Q Learning (DQN)]]
  
<youtube>xvRrgxcpaHY</youtube>
+
<youtube>QHAu8EWRJJ0</youtube>
<youtube>CKaN5PgkSBc</youtube>
+
<youtube>bqdjsmSoSgI</youtube>
 +
<youtube>GlwgeUmhWIM</youtube>
 +
<youtube>0cBAjqQ8nw4</youtube>

Revision as of 22:23, 26 May 2018