Difference between revisions of "Distributed Deep Reinforcement Learning (DDRL)"

From
Jump to: navigation, search
m (Text replacement - "http:" to "https:")
m
 
Line 25: Line 25:
 
** [[Hierarchical Reinforcement Learning (HRL)]]
 
** [[Hierarchical Reinforcement Learning (HRL)]]
 
* [[Agents]]  ... [[Agents#Communication | communications]]
 
* [[Agents]]  ... [[Agents#Communication | communications]]
 +
* [[Policy]]  ... [[Policy vs Plan]] ... [[Constitutional AI]] ... [[Trust Region Policy Optimization (TRPO)]] ... [[Policy Gradient (PG)]] ... [[Proximal Policy Optimization (PPO)]]
  
  

Latest revision as of 15:36, 16 April 2023