Difference between revisions of "Policy"
m |
m |
||
| Line 17: | Line 17: | ||
* [[Trust Region Policy Optimization (TRPO)]] | * [[Trust Region Policy Optimization (TRPO)]] | ||
* [[Proximal Policy Optimization (PPO)]] | * [[Proximal Policy Optimization (PPO)]] | ||
| − | * [[Privacy | + | * [[Privacy]] |
* [[Loop]] | * [[Loop]] | ||
* [[Apprenticeship Learning - Inverse Reinforcement Learning (IRL)]] | * [[Apprenticeship Learning - Inverse Reinforcement Learning (IRL)]] | ||
Revision as of 10:59, 26 March 2023
YouTube ... Quora ...Google search ...Google News ...Bing News
- Ethics
- Policy vs Plan
- Policy Gradient (PG)
- Trust Region Policy Optimization (TRPO)
- Proximal Policy Optimization (PPO)
- Privacy
- Loop
- Apprenticeship Learning - Inverse Reinforcement Learning (IRL)
- Bias and Variances
- Government Services
- Gaming
- Assistants ... Hybrid Assistants ... Agents ... Negotiation ... LangChain
- Generative AI ... OpenAI's ChatGPT ... Perplexity ... Microsoft's BingAI ... You ...Google's Bard ... Baidu's Ernie