Difference between revisions of "World Models"

From
Jump to: navigation, search
m
m
Line 17: Line 17:
 
* [[Inside Out - Curious Optimistic Reasoning]]
 
* [[Inside Out - Curious Optimistic Reasoning]]
 
* [http://worldmodels.github.io/ Recurrent World Models Facilitate Policy Evolution]
 
* [http://worldmodels.github.io/ Recurrent World Models Facilitate Policy Evolution]
 +
* [[Policy]]  ... [[Policy vs Plan]] ... [[Constitutional AI]] ... [[Trust Region Policy Optimization (TRPO)]] ... [[Policy Gradient (PG)]] ... [[Proximal Policy Optimization (PPO)]]
  
 
<youtube>IZPKohYNri4</youtube>
 
<youtube>IZPKohYNri4</youtube>

Revision as of 15:36, 16 April 2023