Difference between revisions of "Proximal Policy Optimization (PPO)"
(Created page with "[http://www.youtube.com/results?search_query=Trust+Region+Policy+Optimization+%28TRPO%29 Youtube search...] * Deep Q Learning (DQN) <youtube>xvRrgxcpaHY</youtube> <youtu...") |
|||
| Line 1: | Line 1: | ||
| − | [http://www.youtube.com/results?search_query= | + | [http://www.youtube.com/results?search_query=Proximal+Policy+Optimization+%28PPO%29 Youtube search...] |
* [[Deep Q Learning (DQN)]] | * [[Deep Q Learning (DQN)]] | ||
| − | <youtube> | + | <youtube>QHAu8EWRJJ0</youtube> |
| − | <youtube> | + | <youtube>bqdjsmSoSgI</youtube> |
| + | <youtube>GlwgeUmhWIM</youtube> | ||
| + | <youtube>0cBAjqQ8nw4</youtube> | ||