Difference between revisions of "Policy"
m |
m |
||
| Line 12: | Line 12: | ||
| + | * [[Ethics]] | ||
* [[Policy vs Plan]] | * [[Policy vs Plan]] | ||
* [[Policy Gradient (PG)]] | * [[Policy Gradient (PG)]] | ||
| Line 19: | Line 20: | ||
* [[Loop]] | * [[Loop]] | ||
* [[Apprenticeship Learning - Inverse Reinforcement Learning (IRL)]] | * [[Apprenticeship Learning - Inverse Reinforcement Learning (IRL)]] | ||
| + | * [[Bias and Variances]] | ||
* [[Government Services]] | * [[Government Services]] | ||
* [[Gaming]] | * [[Gaming]] | ||
Revision as of 10:55, 26 March 2023
YouTube ... Quora ...Google search ...Google News ...Bing News
- Ethics
- Policy vs Plan
- Policy Gradient (PG)
- Trust Region Policy Optimization (TRPO)
- Proximal Policy Optimization (PPO)
- Privacy ... Privacy policy
- Loop
- Apprenticeship Learning - Inverse Reinforcement Learning (IRL)
- Bias and Variances
- Government Services
- Gaming
- Assistants ... Hybrid Assistants ... Agents ... Negotiation ... LangChain
- Generative AI ... OpenAI's ChatGPT ... Perplexity ... Microsoft's BingAI ... You ...Google's Bard ... Baidu's Ernie