Difference between revisions of "Reinforcement Learning (RL)"

From
Jump to: navigation, search
(Reinforcement Learning | Phil Tabor)
(Reinforcement Learning | Phil Tabor)
Line 71: Line 71:
  
  
⌨️ ([http://www.youtube.com/watch?v=ELE2_Mftqoc&t=0s 00:00:00]) Intro
+
⌨️ ([http://www.youtube.com/watch?v=ELE2_Mftqoc&t=0s 00:00:00]) Introduction
⌨️ (00:01:30) Intro to Deep Q Learning  
+
 
⌨️ (00:08:56) How to Code Deep Q Learning in Tensorflow  
+
⌨️ ([http://www.youtube.com/watch?v=ELE2_Mftqoc&t=90s 00:01:30]) Intro to Deep Q Learning  
⌨️ (00:52:03) Deep Q Learning with Pytorch Part 1: The Q Network  
+
 
⌨️ (01:06:21) Deep Q Learning with Pytorch part 2: Coding the Agent  
+
⌨️ ([http://www.youtube.com/watch?v=ELE2_Mftqoc&t=536s 00:08:56]) How to Code Deep Q Learning in Tensorflow  
⌨️ (01:28:54) Deep Q Learning with Pytorch part
+
 
⌨️ (01:46:39) Intro to Policy Gradients  3: Coding the main loop  
+
⌨️ ([http://www.youtube.com/watch?v=ELE2_Mftqoc&t=3123s 00:52:03]) Deep Q Learning with Pytorch Part 1: The Q Network  
⌨️ (01:55:01) How to Beat Lunar Lander with Policy Gradients  
+
 
⌨️ (02:21:32) How to Beat Space Invaders with Policy Gradients  
+
⌨️ ([http://www.youtube.com/watch?v=ELE2_Mftqoc&t=3981s 01:06:21]) Deep Q Learning with Pytorch part 2: Coding the Agent  
 +
 
 +
⌨️ ([http://www.youtube.com/watch?v=ELE2_Mftqoc&t=5334s 01:28:54]) Deep Q Learning with Pytorch part 3
 +
 
 +
⌨️ ([http://www.youtube.com/watch?v=ELE2_Mftqoc&t=6399s 01:46:39]) Intro to Policy Gradients  3: Coding the main loop  
 +
 
 +
⌨️ ([http://www.youtube.com/watch?v=ELE2_Mftqoc&t=6901s 01:55:01]) How to Beat Lunar Lander with Policy Gradients  
 +
 
 +
⌨️ ([http://www.youtube.com/watch?v=ELE2_Mftqoc&t=8492s 02:21:32]) How to Beat Space Invaders with Policy Gradients  
 +
 
 
⌨️ (02:34:41) How to Create Your Own Reinforcement Learning Environment Part 1  
 
⌨️ (02:34:41) How to Create Your Own Reinforcement Learning Environment Part 1  
 +
 
⌨️ (02:55:39) How to Create Your Own Reinforcement Learning Environment Part 2  
 
⌨️ (02:55:39) How to Create Your Own Reinforcement Learning Environment Part 2  
 +
 
⌨️ (03:08:20) Fundamentals of Reinforcement Learning  
 
⌨️ (03:08:20) Fundamentals of Reinforcement Learning  
 +
 
⌨️ (03:17:09) Markov Decision Processes  
 
⌨️ (03:17:09) Markov Decision Processes  
 +
 
⌨️ (03:23:02) The Explore Exploit Dilemma  
 
⌨️ (03:23:02) The Explore Exploit Dilemma  
 +
 
⌨️ (03:29:19) Reinforcement Learning in the Open AI Gym: SARSA  
 
⌨️ (03:29:19) Reinforcement Learning in the Open AI Gym: SARSA  
 +
 
⌨️ (03:39:56) Reinforcement Learning in the Open AI Gym: Double Q Learning  
 
⌨️ (03:39:56) Reinforcement Learning in the Open AI Gym: Double Q Learning  
 +
 
⌨️ (03:54:07) Conclusion  
 
⌨️ (03:54:07) Conclusion  
 +
  
 
<youtube>ELE2_Mftqoc</youtube>
 
<youtube>ELE2_Mftqoc</youtube>

Revision as of 11:38, 1 September 2019

YouTube search... ...Google search

___________________________________________________________

This is a bit similar to the traditional type of data analysis; the algorithm discovers through trial and error and decides which action results in greater rewards. Three major components can be identified in reinforcement learning functionality: the agent, the environment, and the actions. The agent is the learner or decision-maker, the environment includes everything that the agent interacts with, and the actions are what the agent can do. Reinforcement learning occurs when the agent chooses actions that maximize the expected reward over a given time. This is best achieved when the agent has a good policy to follow. Machine Learning: What it is and Why it Matters | Priyadharshini @ simplilearn

Machine_Learning_5.jpg

Q Learning Algorithm and Agent - Reinforcement Learning w/ Python Tutorial | Sentdex - Harrison

P.1

P.2

P.3

P.4

P.5

P.6

Reinforcement Learning | Phil Tabor


⌨️ (00:00:00) Introduction

⌨️ (00:01:30) Intro to Deep Q Learning

⌨️ (00:08:56) How to Code Deep Q Learning in Tensorflow

⌨️ (00:52:03) Deep Q Learning with Pytorch Part 1: The Q Network

⌨️ (01:06:21) Deep Q Learning with Pytorch part 2: Coding the Agent

⌨️ (01:28:54) Deep Q Learning with Pytorch part 3

⌨️ (01:46:39) Intro to Policy Gradients 3: Coding the main loop

⌨️ (01:55:01) How to Beat Lunar Lander with Policy Gradients

⌨️ (02:21:32) How to Beat Space Invaders with Policy Gradients

⌨️ (02:34:41) How to Create Your Own Reinforcement Learning Environment Part 1

⌨️ (02:55:39) How to Create Your Own Reinforcement Learning Environment Part 2

⌨️ (03:08:20) Fundamentals of Reinforcement Learning

⌨️ (03:17:09) Markov Decision Processes

⌨️ (03:23:02) The Explore Exploit Dilemma

⌨️ (03:29:19) Reinforcement Learning in the Open AI Gym: SARSA

⌨️ (03:39:56) Reinforcement Learning in the Open AI Gym: Double Q Learning

⌨️ (03:54:07) Conclusion


Jump Start

Lunar Lander: Deep Q learning is Easy in PyTorch

Lunar Lander: How to Beat Lunar Lander with Policy Gradients | Tensorflow Tutorial

Breakout: How to Code Deep Q Learning in Tensorflow (Tutorial)

Gridworld: How To Create Your Own Reinforcement Learning Environments