Difference between revisions of "Reinforcement Learning (RL)"

Revision as of 14:33, 11 August 2019

Markov Decision Process (MDP)
Monte Carlo (MC) Method - Model Free Reinforcement Learning
Deep Reinforcement Learning (DRL) - DeepRL
Neural Architecture Search (NAS) with Reinforcement Learning | Barret Zoph & Quoc V. Le ...Wikipedia
Distributed Deep Reinforcement Learning (DeepRL)
Deep Q Learning (DQN)
Neural Coreference
State-Action-Reward-State-Action (SARSA)
Deep Deterministic Policy Gradient (DDPG)
Trust Region Policy Optimization (TRPO)
Proximal Policy Optimization (PPO)
AdaNet

___________________________________________________________

Apprenticeship Learning - Inverse Reinforcement Learning (IRL)
Lifelong Learning
Dopamine Google DeepMind
- Math for Intelligence
Inside Out - Curious Optimistic Reasoning
World Models
Google DeepMind AlphaGo Zero
Google’s AI picks which machine learning models will produce the best results | Kyle Wiggers - VentureBeat off-policy classification,” or OPC, which evaluates the performance of AI-driven agents by treating evaluation as a classification problem
Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more | Maxim Lapan
Reinforcement-Learning-Notebooks - A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python

This is a bit similar to the traditional type of data analysis; the algorithm discovers through trial and error and decides which action results in greater rewards. Three major components can be identified in reinforcement learning functionality: the agent, the environment, and the actions. The agent is the learner or decision-maker, the environment includes everything that the agent interacts with, and the actions are what the agent can do. Reinforcement learning occurs when the agent chooses actions that maximize the expected reward over a given time. This is best achieved when the agent has a good policy to follow. Machine Learning: What it is and Why it Matters | Priyadharshini @ simplilearn

@@ Line 9: / Line 9: @@
 * [[Markov Decision Process (MDP)]]
-* [[Monte Carlo (MC) Method]] - Model Free Reinforcement Learning
+* [[Monte Carlo]] (MC) Method - Model Free Reinforcement Learning
 * [[Deep Reinforcement Learning (DRL)]] - DeepRL
 * [http://arxiv.org/abs/1611.01578 Neural Architecture Search (NAS) with Reinforcement Learning | Barret Zoph & Quoc V. Le]  ...[http://en.wikipedia.org/wiki/Neural_architecture_search#NAS_with_Reinforcement_Learning  Wikipedia]

Difference between revisions of "Reinforcement Learning (RL)"

Revision as of 14:33, 11 August 2019

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools