Difference between revisions of "AI Verification and Validation"
m |
m (Text replacement - "* Conversational AI ... ChatGPT | OpenAI ... Bing | Microsoft ... Bard | Google ... Claude | Anthropic ... Perplexity ... You ... Ernie | Baidu" to "* Conversational AI ... [[C...) |
||
| Line 41: | Line 41: | ||
* [https://www.guardrailsai.com/ Guardrails AI] | * [https://www.guardrailsai.com/ Guardrails AI] | ||
** [https://github.com/shreyaR/guardrails Guardrails } GitHub] | ** [https://github.com/shreyaR/guardrails Guardrails } GitHub] | ||
| − | * [[Conversational AI]] ... [[ChatGPT]] | [[OpenAI]] ... [[Bing]] | [[Microsoft]] ... [[ | + | * [[Conversational AI]] ... [[ChatGPT]] | [[OpenAI]] ... [[Bing/Copilot]] | [[Microsoft]] ... [[Gemini]] | [[Google]] ... [[Claude]] | [[Anthropic]] ... [[Perplexity]] ... [[You]] ... [[phind]] ... [[Ernie]] | [[Baidu]] |
* [[Large Language Model (LLM)]] ... [[Large Language Model (LLM)#Multimodal|Multimodal]] ... [[Foundation Models (FM)]] ... [[Generative Pre-trained Transformer (GPT)|Generative Pre-trained]] ... [[Transformer]] ... [[GPT-4]] ... [[GPT-5]] ... [[Attention]] ... [[Generative Adversarial Network (GAN)|GAN]] ... [[Bidirectional Encoder Representations from Transformers (BERT)|BERT]] | * [[Large Language Model (LLM)]] ... [[Large Language Model (LLM)#Multimodal|Multimodal]] ... [[Foundation Models (FM)]] ... [[Generative Pre-trained Transformer (GPT)|Generative Pre-trained]] ... [[Transformer]] ... [[GPT-4]] ... [[GPT-5]] ... [[Attention]] ... [[Generative Adversarial Network (GAN)|GAN]] ... [[Bidirectional Encoder Representations from Transformers (BERT)|BERT]] | ||
Revision as of 10:43, 16 March 2024
YouTube ... Quora ...Google search ...Google News ...Bing News
- Risk, Compliance and Regulation ... Ethics ... Privacy ... Law ... AI Governance ... AI Verification and Validation
- Data Quality ... validity, accuracy, cleaning, completeness, consistency, encoding, padding, augmentation, labeling, auto-tagging, normalization, standardization, imbalanced data
- Artificial General Intelligence (AGI) to Singularity ... Curious Reasoning ... Emergence ... Moonshots ... Explainable AI ... Automated Learning
- Policy ... Policy vs Plan ... Constitutional AI ... Trust Region Policy Optimization (TRPO) ... Policy Gradient (PG) ... Proximal Policy Optimization (PPO)
- Strategy & Tactics ... Project Management ... Best Practices ... Checklists ... Project Check-in ... Evaluation ... Measures
- AI Governance / Algorithm Administration
- Data Science ... Governance ... Preprocessing ... Exploration ... Interoperability ... Master Data Management (MDM) ... Bias and Variances ... Benchmarks ... Datasets
- Development ... Notebooks ... AI Pair Programming ... Codeless, Generators, Drag n' Drop ... AIOps/MLOps ... AIaaS/MLaaS
- Libraries & Frameworks Overview ... Libraries & Frameworks ... Git - GitHub and GitLab ... Other Coding options
- AI Solver ... Algorithms ... Administration ... Model Search ... Discriminative vs. Generative ... Train, Validate, and Test
- AI Agent Optimization ... Optimization Methods ... Optimizer ... Objective vs. Cost vs. Loss vs. Error Function ... Exploration
- Policy ... Policy vs Plan ... Constitutional AI ... Trust Region Policy Optimization (TRPO) ... Policy Gradient (PG) ... Proximal Policy Optimization (PPO)
- Testing of Artificial Intelligence | Sogeti
- Other Challenges in Artificial Intelligence
- Data Science Concepts Explained to a Five-year-old | Megan Dibble - Toward Data Science
Guardrails AI
- Guardrails AI
- Conversational AI ... ChatGPT | OpenAI ... Bing/Copilot | Microsoft ... Gemini | Google ... Claude | Anthropic ... Perplexity ... You ... phind ... Ernie | Baidu
- Large Language Model (LLM) ... Multimodal ... Foundation Models (FM) ... Generative Pre-trained ... Transformer ... GPT-4 ... GPT-5 ... Attention ... GAN ... BERT
Guardrails AI is a Python package for specifying structure and type, validating and correcting the outputs of large language models (LLMs). Guardrails AI works by wrapping around LLM API calls to structure, validate, and correct the outputs. It can be used to enforce a wide range of requirements, such as:
- Ensuring that the output is of a certain type (e.g., JSON, Python code, etc.)
- Checking for bias in the output
- Identifying and correcting factual errors
- Preventing the output from containing certain keywords or phrases
Guardrails AI can be used to improve the safety and reliability of LLMs in a wide range of applications, such as:
- Generating text for websites and blogs
- Writing code and scripts
- Translating languages
- Answering questions in a comprehensive and informative way
Here are some examples of how Guardrails AI can be used:
- A news organization could use Guardrails AI to ensure that the articles it generates are free of bias and factual errors.
- A software company could use Guardrails AI to generate code that is well-formatted and bug-free.
- A customer service chatbot could use Guardrails AI to ensure that its responses are helpful and informative.
Testing
Covering both..
- Testing ‘of’ AI
- Testing ‘with’ AI
|
|
|
|
|
|
|
|
A/B Testing
YouTube search... ...Google search
- Data science you need to know! A/B testing | Michael Barber - Towards Data Science
- A/B Testing for Data Science | Anjali Tiwari - Analytics Vidhya - Medium
- A Data Analyst guide to A/B testing | Jacob Joseph - CleverTap - KDnuggets
A/B testing (also known as bucket testing or split-run testing) is a user experience research methodology. A/B tests consist of a randomized experiment with two variants, A and B. It includes application of statistical hypothesis testing or "two-sample hypothesis testing" as used in the field of statistics. A/B testing is a way to compare two versions of a single variable, typically by testing a subject's response to variant A against variant B, and determining which of the two variants is more effective. Wikipedia
|
|
|
|
Multivariate Testing (MVT)
YouTube search... ...Google search
- Key Differences Between Multivariate Testing (MVT) & A/B Testing | Paras Chopra - Visual Website Optimizer (VWO)
- What is Multivariate Testing? | Khalid Saleh - invesp
- Multivariate Testing 101: A Scientific Method Of Optimizing Design | Paras Chopra - Visual Website Optimizer (VWO)
Multivariate testing is a technique for testing a hypothesis in which multiple variables are modified. The goal of multivariate testing is to determine which combination of variations performs the best out of all of the possible combinations. Websites and mobile apps are made of combinations of changeable elements. A multivariate test will change multiple elements, like changing a picture and headline at the same time. Three variations of the image and two variations of the headline are combined to create six versions of the content, which are tested concurrently to find the winning variation. Optimizely
|
|