Difference between revisions of "Constitutional AI"

From
Jump to: navigation, search
m
m
Line 5: Line 5:
 
|description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools  
 
|description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools  
 
}}
 
}}
[https://www.youtube.com/results?search_query=ai+Reinforcement+Human+Feedback+RLHF YouTube]
+
[https://www.youtube.com/results?search_query=Constitutional+AI YouTube]
[https://www.quora.com/search?q=ai%20Reinforcement%20Human%20Feedback%20XRLHF ... Quora]
+
[https://www.quora.com/search?q=Constitutional%20AI ... Quora]
[https://www.google.com/search?q=ai+Reinforcement+Human+Feedback+RLHF ...Google search]
+
[https://www.google.com/search?q=Constitutional+AI ...Google search]
[https://news.google.com/search?q=ai+Reinforcement+Human+Feedback+RLHF ...Google News]
+
[https://news.google.com/search?q=Constitutional+AI ...Google News]
[https://www.bing.com/news/search?q=ai+Reinforcement+Human+Feedback+RLHF&qft=interval%3d%228%22 ...Bing News]
+
[https://www.bing.com/news/search?q=Constitutional+AI&qft=interval%3d%228%22 ...Bing News]
  
 
* [[Reinforcement Learning (RL)]]
 
* [[Reinforcement Learning (RL)]]
Line 37: Line 37:
 
<youtube>i5rzqACykYk</youtube>
 
<youtube>i5rzqACykYk</youtube>
 
<youtube>quyqRIHRa60</youtube>
 
<youtube>quyqRIHRa60</youtube>
 +
<youtube>fqC3D-zNJUM</youtube>
  
 
= Claude | Anthropic =
 
= Claude | Anthropic =
Line 46: Line 47:
 
<youtube>_TAWaueEmoY</youtube>
 
<youtube>_TAWaueEmoY</youtube>
 
<youtube>Us-OAs9hDI4</youtube>
 
<youtube>Us-OAs9hDI4</youtube>
 +
<youtube>B7Mg8Hbcc0w</youtube>

Revision as of 14:55, 16 April 2023

YouTube ... Quora ...Google search ...Google News ...Bing News


Constitutional AI is a method for training AI systems using a set of rules or principles that act as a “constitution” for the AI system. This approach allows the AI system to operate within a societally accepted framework and aligns it with human intentions1.

Some benefits of using Constitutional AI include allowing a model to explain why it is refusing to provide an answer, improving transparency of AI decision making, and controlling AI behavior more precisely with fewer human labels.

The Constitutional AI methodology has two phases, similar to Reinforcement Learning (RL) from Human Feedback (RLHF).

1. The Supervised Learning Phase.


2. The Reinforcement Learning Phase.


Claude | Anthropic