Difference between revisions of "Large Language Model (LLM)"

From
Jump to: navigation, search
m
m
Line 31: Line 31:
 
** [https://medium.com/syncedreview/deepminds-flamingo-visual-language-model-demonstrates-sota-few-shot-multimodal-learning-f795c3034b94 Flamingo |] [[Google|DeepMind]] ... [https://github.com/lucidrains/flamingo-pytorch Flamingo Pytorch] 80B  
 
** [https://medium.com/syncedreview/deepminds-flamingo-visual-language-model-demonstrates-sota-few-shot-multimodal-learning-f795c3034b94 Flamingo |] [[Google|DeepMind]] ... [https://github.com/lucidrains/flamingo-pytorch Flamingo Pytorch] 80B  
 
** [https://github.com/THUDM/GLM-130B GLM-130B]  ... Open Bilingual Pre-Trained Model
 
** [https://github.com/THUDM/GLM-130B GLM-130B]  ... Open Bilingual Pre-Trained Model
** [https://www.deepmind.com/blog/language-modelling-at-scale-gopher-ethical-considerations-and-retrieval Gopher |] [[Google | DeepMind]]
+
** [https://www.deepmind.com/blog/language-modelling-at-scale-gopher-ethical-considerations-and-retrieval Gopher |] [[Google | DeepMind]] 280B
 
** [https://ai.googleblog.com/2021/12/more-efficient-in-context-learning-with.html GLaM |] [[Google]]
 
** [https://ai.googleblog.com/2021/12/more-efficient-in-context-learning-with.html GLaM |] [[Google]]
 +
** [https://github.com/THUDM/GLM-130B GLM-130B]  130B
 
** [https://arxiv.org/abs/2006.16668 GShard |] [[Google]]  ... Scaling Giant Models with Conditional Computation and Automatic Sharding
 
** [https://arxiv.org/abs/2006.16668 GShard |] [[Google]]  ... Scaling Giant Models with Conditional Computation and Automatic Sharding
** [https://openai.com/blog/better-language-models/ GPT-2 |] [[OpenAI]] ... Generative Pre-trained Transformer 2 by [[OpenAI]]
+
** [https://openai.com/blog/better-language-models/ GPT-2 |] [[OpenAI]] 1.5B
 +
** [https://openai.com/blog/better-language-models/ GPT-3 |] [[OpenAI]] 175B
 
** [https://github.com/EleutherAI/gpt-neo/ GPT-Neo] ... Open-source GPT-3 by EleutherAI
 
** [https://github.com/EleutherAI/gpt-neo/ GPT-Neo] ... Open-source GPT-3 by EleutherAI
 
** [https://openai.com/blog/instruction-following/ InstructGPT] ... [[OpenAI]] 1.3B InstructGPT model over outputs from a 175B GPT-3 model  
 
** [https://openai.com/blog/instruction-following/ InstructGPT] ... [[OpenAI]] 1.3B InstructGPT model over outputs from a 175B GPT-3 model  

Revision as of 11:54, 25 February 2023

YouTube search... ...Google search


Inside language models (from GPT-3 to PaLM) | Alan-D-Thompson