Distributed Deep Reinforcement Learning (DDRL)

From
Jump to: navigation, search

Youtube search... ...Google search

a new, highly scalable agent architecture for distributed training called Importance Weighted Actor-Learner Architecture that uses a new off-policy correction algorithm called V-trace.