Distributed Deep Reinforcement Learning (DDRL) - Revision history

BPeat at 20:36, 16 April 2023

2023-04-16T20:36:27Z

2023-03-28T14:15:59Z

Text replacement - "http:" to "https:"

2023-02-12T17:24:01Z

2023-02-04T12:56:45Z

2020-09-26T15:52:59Z

2020-07-06T11:19:15Z

2020-07-06T11:10:12Z

2020-07-06T11:09:32Z

2019-09-02T14:37:44Z

2019-09-02T01:01:41Z

← Older revision		Revision as of 20:36, 16 April 2023
Line 25:		Line 25:
	** [[Hierarchical Reinforcement Learning (HRL)]]		** [[Hierarchical Reinforcement Learning (HRL)]]
	* [[Agents]] ... [[Agents#Communication \| communications]]		* [[Agents]] ... [[Agents#Communication \| communications]]
		+	* [[Policy]] ... [[Policy vs Plan]] ... [[Constitutional AI]] ... [[Trust Region Policy Optimization (TRPO)]] ... [[Policy Gradient (PG)]] ... [[Proximal Policy Optimization (PPO)]]

@@ Line 5: / Line 5: @@
 |description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools
 }}
-[http://www.youtube.com/results?search_query=Distributed+Deep+Reinforcement+Learning+DeepRL Youtube search...]
+[https://www.youtube.com/results?search_query=Distributed+Deep+Reinforcement+Learning+DeepRL Youtube search...]
-[http://www.google.com/search?q=Distributed+Deep+Reinforcement+Learning+DeepRL+machine+learning+ML+artificial+intelligence ...Google search]
+[https://www.google.com/search?q=Distributed+Deep+Reinforcement+Learning+DeepRL+machine+learning+ML+artificial+intelligence ...Google search]
-* [http://deepmind.com/blog/impala-scalable-distributed-deeprl-dmlab-30/ Importance Weighted Actor-Learner Architectures: Scalable Distributed DeepRL in DMLab-30]
+* [https://deepmind.com/blog/impala-scalable-distributed-deeprl-dmlab-30/ Importance Weighted Actor-Learner Architectures: Scalable Distributed DeepRL in DMLab-30]
 * [[Decentralized: Federated & Distributed]] Learning
 * [[Reinforcement Learning (RL)]]

@@ Line 24: / Line 24: @@
 *** [[Lifelong Latent Actor-Critic (LILAC)]]
 ** [[Hierarchical Reinforcement Learning (HRL)]]
-* [[Agents]]
+* [[Agents]]   ... [[Agents#Communication | communications]]

@@ Line 24: / Line 24: @@
 *** [[Lifelong Latent Actor-Critic (LILAC)]]
 ** [[Hierarchical Reinforcement Learning (HRL)]]
-a new, highly scalable agent architecture for distributed training called Importance Weighted Actor-Learner Architecture that uses a new off-policy correction algorithm called V-trace.
+a new, highly scalable [[Agents|agent]] architecture for distributed training called Importance Weighted Actor-Learner Architecture that uses a new off-policy correction algorithm called V-trace.
 <youtube>-YMfJLFynmA</youtube>

@@ Line 9: / Line 9: @@
 * [http://deepmind.com/blog/impala-scalable-distributed-deeprl-dmlab-30/ Importance Weighted Actor-Learner Architectures: Scalable Distributed DeepRL in DMLab-30]
-* [[Federated]] Learning
+* [[Decentralized: Federated & Distributed]] Learning
 * [[Reinforcement Learning (RL)]]
 ** [[Monte Carlo]] (MC) Method - Model Free Reinforcement Learning

@@ Line 16: / Line 16: @@
 ** [[Q Learning]]
 *** [[Deep Q Network (DQN)]]
-** Deep Reinforcement Learning (DRL) DeepRL
+** [[Deep Reinforcement Learning (DRL)]] DeepRL
-** [[Distributed Deep Reinforcement Learning (DDRL)]]
+** Distributed Deep Reinforcement Learning (DDRL)
 ** [[Evolutionary Computation / Genetic Algorithms]]
 ** [[Actor Critic]]