Difference between revisions of "Policy Gradient (PG)"
Line 8: | Line 8: | ||
[http://www.google.com/search?q=Deep+Deterministic+Policy+Gradient+DDPG+machine+learning+ML+artificial+intelligence ...Google search] | [http://www.google.com/search?q=Deep+Deterministic+Policy+Gradient+DDPG+machine+learning+ML+artificial+intelligence ...Google search] | ||
+ | * [[Policy vs Plan]] | ||
* [[Trust Region Policy Optimization (TRPO)]] | * [[Trust Region Policy Optimization (TRPO)]] | ||
* [[Proximal Policy Optimization (PPO)]] | * [[Proximal Policy Optimization (PPO)]] |