Difference between revisions of "Large Language Model (LLM)"

From
Jump to: navigation, search
m
m
Line 9: Line 9:
  
 
* Models:
 
* Models:
** [[Toolformer]] | [[Meta]] ... models can teach themselves to use tools and APIs
+
** [https://opt.alpa.ai/ Alpa]  ... serving large models like GPT-3 simple, affordable, accessible
 +
** [https://github.com/microsoft/BioGPT BioGPT]  ... [[Microsoft]] language model trained for biomedical tasks
 +
** [https://bigscience.notion.site/BLOOM-BigScience-176B-Model-ad073ca07cdf479398d5f95d88e218c4 BLOOM]  ... Big Science Language Open-science Open-access Multilingual  ... 176B
 +
** [https://cedille.ai/ Cedille] ... open-source French language model
 
** [[ChatGPT]] | [[OpenAI]]
 
** [[ChatGPT]] | [[OpenAI]]
 
*** [https://www.technologyreview.com/2023/02/08/1068068/chatgpt-is-everywhere-heres-where-it-came-from/ ChatGPT is everywhere. Here’s where it came from | Will Douglas Heaven - MIT Technology Review]
 
*** [https://www.technologyreview.com/2023/02/08/1068068/chatgpt-is-everywhere-heres-where-it-came-from/ ChatGPT is everywhere. Here’s where it came from | Will Douglas Heaven - MIT Technology Review]
Line 17: Line 20:
 
**** [[Supervised]] Learning
 
**** [[Supervised]] Learning
 
**** [[Proximal Policy Optimization (PPO)]]
 
**** [[Proximal Policy Optimization (PPO)]]
** [https://opt.alpa.ai/ Alpa]  ... serving large models like GPT-3 simple, affordable, accessible
 
** [https://github.com/microsoft/BioGPT BioGPT]  ... [[Microsoft]] language model trained for biomedical tasks
 
** [https://bigscience.notion.site/BLOOM-BigScience-176B-Model-ad073ca07cdf479398d5f95d88e218c4 BLOOM]  ... Big Science Language Open-science Open-access Multilingual  ... 176B
 
** [https://cedille.ai/ Cedille]  ... open-source French language model
 
 
** [https://www.deepmind.com/publications/an-empirical-analysis-of-compute-optimal-large-language-model-training Chinchilla |] [[Google | DeepMind]]
 
** [https://www.deepmind.com/publications/an-empirical-analysis-of-compute-optimal-large-language-model-training Chinchilla |] [[Google | DeepMind]]
 
** [https://arxiv.org/abs/2203.15556 ctrl] ... a Conditional Transformer Language Model for Controllable Generation | Salesforce
 
** [https://arxiv.org/abs/2203.15556 ctrl] ... a Conditional Transformer Language Model for Controllable Generation | Salesforce
Line 52: Line 51:
 
** [https://huggingface.co/bigscience/T0pp  T0pp |] [[Hugging Face]]
 
** [https://huggingface.co/bigscience/T0pp  T0pp |] [[Hugging Face]]
 
** [https://ai.facebook.com/blog/textless-nlp-generating-expressive-speech-from-raw-audio/  Textless NLP  ... Generating expressive speech from raw audio]
 
** [https://ai.facebook.com/blog/textless-nlp-generating-expressive-speech-from-raw-audio/  Textless NLP  ... Generating expressive speech from raw audio]
 +
** [[Toolformer]] | [[Meta]] ... models can teach themselves to use tools and APIs
 
** [https://github.com/allenai/unifiedqa  UnifiedQA]  ... single QA system
 
** [https://github.com/allenai/unifiedqa  UnifiedQA]  ... single QA system
 
** [https://openai.com/blog/webgpt/ WebGPT] ... GPT-3 version that can search the web
 
** [https://openai.com/blog/webgpt/ WebGPT] ... GPT-3 version that can search the web

Revision as of 23:57, 24 February 2023

YouTube search... ...Google search