Difference between revisions of "Policy"

From
Jump to: navigation, search
m
m
Line 17: Line 17:
 
* [[Trust Region Policy Optimization (TRPO)]]
 
* [[Trust Region Policy Optimization (TRPO)]]
 
* [[Proximal Policy Optimization (PPO)]]
 
* [[Proximal Policy Optimization (PPO)]]
* [[Privacy]] ... [[Privacy policy]]
+
* [[Privacy]]
 
* [[Loop]]
 
* [[Loop]]
 
* [[Apprenticeship Learning - Inverse Reinforcement Learning (IRL)]]
 
* [[Apprenticeship Learning - Inverse Reinforcement Learning (IRL)]]

Revision as of 10:59, 26 March 2023