Difference between revisions of "Reinforcement Learning (RL)"

From
Jump to: navigation, search
Line 12: Line 12:
 
* [[Deep Reinforcement Learning (DRL)]] - DeepRL
 
* [[Deep Reinforcement Learning (DRL)]] - DeepRL
 
* Neural Architecture Search (NAS)
 
* Neural Architecture Search (NAS)
** [http://en.wikipedia.org/wiki/Neural_architecture_search#NAS_with_Reinforcement_Learning Neural Architecture Search (NAS) with Reinforcement Learning | Wikipedia]
+
** [http://en.wikipedia.org/wiki/Neural_architecture_search#NAS_with_Reinforcement_Learning NAS with Reinforcement Learning | Wikipedia]
** [http://arxiv.org/abs/1611.01578 Neural Architecture Search with Reinforcement Learning | Barret Zoph & Quoc V. Le]
+
** [http://arxiv.org/abs/1611.01578 NAS with Reinforcement Learning | Barret Zoph & Quoc V. Le]
 
* [[Distributed Deep Reinforcement Learning (DeepRL)]]
 
* [[Distributed Deep Reinforcement Learning (DeepRL)]]
 
* [[Deep Q Learning (DQN)]]
 
* [[Deep Q Learning (DQN)]]

Revision as of 09:20, 23 March 2019

YouTube search... ...Google search

___________________________________________________________

This is a bit similar to the traditional type of data analysis; the algorithm discovers through trial and error and decides which action results in greater rewards. Three major components can be identified in reinforcement learning functionality: the agent, the environment, and the actions. The agent is the learner or decision-maker, the environment includes everything that the agent interacts with, and the actions are what the agent can do. Reinforcement learning occurs when the agent chooses actions that maximize the expected reward over a given time. This is best achieved when the agent has a good policy to follow. Machine Learning: What it is and Why it Matters | Priyadharshini @ simplilearn

Machine_Learning_5.jpg