Difference between revisions of "Advanced Actor Critic (A2C)"

Revision as of 07:15, 6 July 2020

A2C produces comparable performance to Asynchronous Advantage Actor Critic (A3C) while being more efficient. A2C is like A3C but without the asynchronous part; this means a single-worker variant of the A3C. Understanding Actor Critic Methods and A2C | Chris Yoon - Towards Data Science

@@ Line 18: / Line 18: @@
 ** [[Evolutionary Computation / Genetic Algorithms]]
 ** [[Actor Critic]]
+*** [[Asynchronous Advantage Actor Critic (A3C)]]
 *** Advanced Actor Critic (A2C)
-*** [[Asynchronous Advantage Actor Critic (A3C)]]
 *** [[Lifelong Latent Actor-Critic (LILAC)]]
 ** [[Hierarchical Reinforcement Learning (HRL)]]
 * [http://towardsdatascience.com/advanced-reinforcement-learning-6d769f529eb3 Beyond DQN/A3C: A Survey in Advanced Reinforcement Learning | Joyce Xu - Towards Data Science]
 * [[Policy Gradient (PG)]]