Difference between revisions of "World Models"
m |
m |
||
Line 17: | Line 17: | ||
* [[Inside Out - Curious Optimistic Reasoning]] | * [[Inside Out - Curious Optimistic Reasoning]] | ||
* [http://worldmodels.github.io/ Recurrent World Models Facilitate Policy Evolution] | * [http://worldmodels.github.io/ Recurrent World Models Facilitate Policy Evolution] | ||
+ | * [[Policy]] ... [[Policy vs Plan]] ... [[Constitutional AI]] ... [[Trust Region Policy Optimization (TRPO)]] ... [[Policy Gradient (PG)]] ... [[Proximal Policy Optimization (PPO)]] | ||
<youtube>IZPKohYNri4</youtube> | <youtube>IZPKohYNri4</youtube> |
Revision as of 15:36, 16 April 2023
YouTube search... ...Google search
- World Models | NIPS 2018 - GitHub
- World Models | David Ha, Jürgen Schmidhuber
- World Models in TensorFlow — Episode 1.0 — OpenAi Gym Race Car | Dario Cazzani
- Automated Learning
- Building Your Environment
- Simulated Environment Learning
- 3D Simulation Environments
- Inside Out - Curious Optimistic Reasoning
- Recurrent World Models Facilitate Policy Evolution
- Policy ... Policy vs Plan ... Constitutional AI ... Trust Region Policy Optimization (TRPO) ... Policy Gradient (PG) ... Proximal Policy Optimization (PPO)
Unity AI
YouTube search... ...Google search