Difference between revisions of "Deep Q Network (DQN)"

Latest revision as of 09:09, 28 March 2023

Reinforcement Learning (RL)
- Monte Carlo (MC) Method - Model Free Reinforcement Learning
- Markov Decision Process (MDP)
- State-Action-Reward-State-Action (SARSA)
- Q Learning
  - Deep Q Network (DQN)
- Deep Reinforcement Learning (DRL) DeepRL
- Distributed Deep Reinforcement Learning (DDRL)
- Evolutionary Computation / Genetic Algorithms
- Actor Critic
- Hierarchical Reinforcement Learning (HRL)

Gaming

Deep Q learning (DQN), as published in Playing Atari with Deep Reinforcement Learning | Mnih et al, 2013, leverages advances in deep learning to learn policies from high dimensional sensory input. A convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. Vanilla Deep Q Networks: Deep Q Learning Explained | Chris Yoon - Towards Data Science

Training deep neural networks to show that a novel end-to-end reinforcement learning agent, termed a deep Q-network (DQN) Human-level control through Deep Reinforcement Learning | Deepmind

@@ Line 1: / Line 1: @@
-== Q Learning (DQN) ==
+{{#seo:
-[http://www.youtube.com/results?search_query=deep+reinforcement+q+learning+artificial+intelligence+ Youtube search...]
+|title=PRIMO.ai
+|titlemode=append
+|keywords=artificial, intelligence, machine, learning, models, algorithms, data, singularity, moonshot, Tensorflow, Google, Nvidia, Microsoft, Azure, Amazon, AWS
+|description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools
+}}
+[https://www.youtube.com/results?search_query=deep+reinforcement+q+learning+artificial+intelligence+ Youtube search...]
+[https://www.google.com/search?q=deep+reinforcement+q+learning+machine+learning+ML+artificial+intelligence ...Google search]
+* [[Reinforcement Learning (RL)]]
+** [[Monte Carlo]] (MC) Method - Model Free Reinforcement Learning
+** [[Markov Decision Process (MDP)]]
+** [[State-Action-Reward-State-Action (SARSA)]]
+** [[Q Learning]]
+*** Deep Q Network (DQN)
+** [[Deep Reinforcement Learning (DRL)]] DeepRL
+** [[Distributed Deep Reinforcement Learning (DDRL)]]
+** [[Evolutionary Computation / Genetic Algorithms]]
+** [[Actor Critic]]
+*** [[Asynchronous Advantage Actor Critic (A3C)]]
+*** [[Advanced Actor Critic (A2C)]]
+*** [[Lifelong Latent Actor-Critic (LILAC)]]
+** [[Hierarchical Reinforcement Learning (HRL)]]
-* [[Deep Reinforcement Learning (DRL)]]
 * [[Gaming]]
-* [http://en.wikipedia.org/wiki/Q-learning Wikipedia]
-When feedback is provided, it might be long time after the fateful decision has been made. In reality, the feedback is likely to be the result of a large number of prior decisions, taken amid a shifting, uncertain environment. Unlike supervised learning, there are no correct input/output pairs, so suboptimal actions are not explicitly corrected, wrong actions just decrease the corresponding value in the Q-table, meaning there’s less chance choosing the same action should the same state be encountered again. [http://www.quora.com/How-does-Q-learning-work-1 Quora | Jaron Collis]
+Deep Q learning (DQN), as published in [https://arxiv.org/abs/1312.5602 Playing Atari with Deep Reinforcement Learning | Mnih et al, 2013], leverages advances in deep learning to learn policies from high dimensional sensory input. A convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. [https://towardsdatascience.com/dqn-part-1-vanilla-deep-q-networks-6eb4a00febfb Vanilla Deep Q Networks: Deep Q Learning Explained | Chris Yoon - Towards Data Science]
-Training deep neural networks to show that a novel end-to-end reinforcement learning agent, termed a deep Q-network (DQN) [http://deepmind.com/research/dqn/ Human-level control through Deep Reinforcement Learning | Deepmind]
+Training deep neural networks to show that a novel end-to-end reinforcement learning [[Agents|agent]], termed a deep Q-network (DQN) [https://deepmind.com/research/dqn/ Human-level control through Deep Reinforcement Learning | Deepmind]
 <youtube>79pmNdyxEGo</youtube>
-<youtube>A5eihauRQvo</youtube>
-<youtube>aCEvtRtNO-M</youtube>
-<youtube>nSxaG_Kjw_w</youtube>
 <youtube>V1eYniJ0Rnk</youtube>
-<youtube>1XRahNzA5bE</youtube>
+<youtube>fevMOp5TDQs</youtube>
+<youtube>5fHngyN8Qhw</youtube>

Difference between revisions of "Deep Q Network (DQN)"

Latest revision as of 09:09, 28 March 2023

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools