Difference between revisions of "Large Language Model (LLM)"

From
Jump to: navigation, search
m
m
Line 35: Line 35:
 
** [https://openai.com/blog/instruction-following/ InstructGPT] ... [[OpenAI]] 1.3B InstructGPT model over outputs from a 175B GPT-3 model  
 
** [https://openai.com/blog/instruction-following/ InstructGPT] ... [[OpenAI]] 1.3B InstructGPT model over outputs from a 175B GPT-3 model  
 
** [https://uploads-ssl.webflow.com/60fd4503684b466578c0d307/61138924626a6981ee09caf6_jurassic_tech_paper.pdf  Jurassic-1 Language Model] ... huge 178B language model to rival [[OpenAI]]'s GPT-3]
 
** [https://uploads-ssl.webflow.com/60fd4503684b466578c0d307/61138924626a6981ee09caf6_jurassic_tech_paper.pdf  Jurassic-1 Language Model] ... huge 178B language model to rival [[OpenAI]]'s GPT-3]
 +
** [https://www.blog.google/technology/ai/lamda/ LaMDA |] [[Google]]  ... experimental language model
 +
** [https://github.com/allenai/macaw Macaw | AI2]
 +
** [https://arxiv.org/pdf/2212.13138.pdf Med-PaLM]  ... aligned to the medical domain
 +
** [https://turing.microsoft.com/ Turing-NLG |] [[Microsoft]]
 +
** [https://developer.nvidia.com/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/ Megatron NLG] ... Monolithic Transformer Language NLP Model Triple the Size of [[OpenAI]]’s GPT-3
 +
** [https://muse.lighton.ai/home Muse] ... VLM-4, a set of natively trained large Language Models in French, Italian, Spanish, German, and English
 +
** [https://github.com/karpathy/nanoGPT nanoGPT] ... for training/finetuning medium-sized GPTs
 
* [https://openai.com/blog/gpt-2-6-month-follow-up/ OpenAI Blog] | [[OpenAI]]
 
* [https://openai.com/blog/gpt-2-6-month-follow-up/ OpenAI Blog] | [[OpenAI]]
 
* [[Attention]] Mechanism/[[Transformer]] Model
 
* [[Attention]] Mechanism/[[Transformer]] Model
 
* [[Generative Pre-trained Transformer (GPT)]]
 
* [[Generative Pre-trained Transformer (GPT)]]
 
* [https://sambanova.ai/solutions/gpt/ SambaNova Systems] ... Dataflow-as-a-Service GPT
 
* [https://sambanova.ai/solutions/gpt/ SambaNova Systems] ... Dataflow-as-a-Service GPT

Revision as of 23:20, 24 February 2023

YouTube search... ...Google search