Difference between revisions of "State-Action-Reward-State-Action (SARSA)"

From
Jump to: navigation, search
Line 13: Line 13:
 
** [[Markov Decision Process (MDP)]]
 
** [[Markov Decision Process (MDP)]]
 
** [[Q Learning]]
 
** [[Q Learning]]
 +
** State-Action-Reward-State-Action (SARSA)
 
** [[Deep Reinforcement Learning (DRL)]] DeepRL
 
** [[Deep Reinforcement Learning (DRL)]] DeepRL
 
** [[Distributed Deep Reinforcement Learning (DDRL)]]
 
** [[Distributed Deep Reinforcement Learning (DDRL)]]
Line 18: Line 19:
 
** [[Evolutionary Computation / Genetic Algorithms]]
 
** [[Evolutionary Computation / Genetic Algorithms]]
 
** [[Actor Critic]]
 
** [[Actor Critic]]
 +
*** [[Advanced Actor Critic (A2C)]]
 +
*** [[Asynchronous Advantage Actor Critic (A3C)]]
 +
*** [[Lifelong Latent Actor-Critic (LILAC)]]
 
** [[Hierarchical Reinforcement Learning (HRL)]]
 
** [[Hierarchical Reinforcement Learning (HRL)]]
  

Revision as of 12:48, 3 July 2020