Difference between revisions of "Distributed Deep Reinforcement Learning (DDRL)"

From
Jump to: navigation, search
m
m
Line 24: Line 24:
 
*** [[Lifelong Latent Actor-Critic (LILAC)]]
 
*** [[Lifelong Latent Actor-Critic (LILAC)]]
 
** [[Hierarchical Reinforcement Learning (HRL)]]
 
** [[Hierarchical Reinforcement Learning (HRL)]]
 +
* [[Agents]]
  
  
  
a new, highly scalable agent architecture for distributed training called Importance Weighted Actor-Learner Architecture that uses a new off-policy correction algorithm called V-trace.
+
a new, highly scalable [[Agents|agent]] architecture for distributed training called Importance Weighted Actor-Learner Architecture that uses a new off-policy correction algorithm called V-trace.
  
 
<youtube>-YMfJLFynmA</youtube>
 
<youtube>-YMfJLFynmA</youtube>

Revision as of 07:56, 4 February 2023