Difference between revisions of "Policy Gradient (PG)"

From
Jump to: navigation, search
m
Line 13: Line 13:
 
* [[Reinforcement Learning (RL)]]
 
* [[Reinforcement Learning (RL)]]
 
* [[Gradient Descent Optimization & Challenges]]
 
* [[Gradient Descent Optimization & Challenges]]
 +
* [[Policy]]
 +
* [[Assistants]] ... [[Hybrid Assistants]]  ... [[Agents]]  ... [[Negotiation]] ... [[LangChain]]
 +
* [[Generative AI]]  ... [[OpenAI]]'s [[ChatGPT]] ... [[Perplexity]]  ... [[Microsoft]]'s [[BingAI]] ... [[You]] ...[[Google]]'s [[Bard]] ... [[Baidu]]'s [[Ernie]]
  
 
<youtube>IS0V8z8HXrM</youtube>
 
<youtube>IS0V8z8HXrM</youtube>

Revision as of 10:50, 26 March 2023