Difference between revisions of "Proximal Policy Optimization (PPO)"

From
Jump to: navigation, search
Line 9: Line 9:
  
 
* [[Deep Reinforcement Learning (DRL)]]
 
* [[Deep Reinforcement Learning (DRL)]]
 +
* [[Policy Gradient (PG)]]
  
 
<youtube>5P7I-xPq8u8</youtube>
 
<youtube>5P7I-xPq8u8</youtube>

Revision as of 16:06, 3 July 2020