Distributed Deep Reinforcement Learning (DDRL)

From
Revision as of 20:00, 1 September 2019 by BPeat (talk | contribs)
Jump to: navigation, search

Youtube search... ...Google search

a new, highly scalable agent architecture for distributed training called Importance Weighted Actor-Learner Architecture that uses a new off-policy correction algorithm called V-trace.