Difference between revisions of "Policy"
m |
m |
||
| Line 17: | Line 17: | ||
* [[Proximal Policy Optimization (PPO)]] | * [[Proximal Policy Optimization (PPO)]] | ||
* [[Privacy]] ... [[Privacy policy]] | * [[Privacy]] ... [[Privacy policy]] | ||
| + | * [[Loop]] | ||
* [[Apprenticeship Learning - Inverse Reinforcement Learning (IRL)]] | * [[Apprenticeship Learning - Inverse Reinforcement Learning (IRL)]] | ||
* [[Government Services]] | * [[Government Services]] | ||
| Line 22: | Line 23: | ||
* [[Assistants]] ... [[Hybrid Assistants]] ... [[Agents]] ... [[Negotiation]] ... [[LangChain]] | * [[Assistants]] ... [[Hybrid Assistants]] ... [[Agents]] ... [[Negotiation]] ... [[LangChain]] | ||
* [[Generative AI]] ... [[OpenAI]]'s [[ChatGPT]] ... [[Perplexity]] ... [[Microsoft]]'s [[BingAI]] ... [[You]] ...[[Google]]'s [[Bard]] ... [[Baidu]]'s [[Ernie]] | * [[Generative AI]] ... [[OpenAI]]'s [[ChatGPT]] ... [[Perplexity]] ... [[Microsoft]]'s [[BingAI]] ... [[You]] ...[[Google]]'s [[Bard]] ... [[Baidu]]'s [[Ernie]] | ||
| + | |||
<youtube>PO8-fegV4X0</youtube> | <youtube>PO8-fegV4X0</youtube> | ||
Revision as of 10:45, 26 March 2023
YouTube ... Quora ...Google search ...Google News ...Bing News
- Policy vs Plan
- Policy Gradient (PG)
- Trust Region Policy Optimization (TRPO)
- Proximal Policy Optimization (PPO)
- Privacy ... Privacy policy
- Loop
- Apprenticeship Learning - Inverse Reinforcement Learning (IRL)
- Government Services
- Gaming
- Assistants ... Hybrid Assistants ... Agents ... Negotiation ... LangChain
- Generative AI ... OpenAI's ChatGPT ... Perplexity ... Microsoft's BingAI ... You ...Google's Bard ... Baidu's Ernie