Difference between revisions of "Large Language Model (LLM)"

From
Jump to: navigation, search
m
m
Line 31: Line 31:
 
** [https://medium.com/syncedreview/deepminds-flamingo-visual-language-model-demonstrates-sota-few-shot-multimodal-learning-f795c3034b94 Flamingo |] [[Google|DeepMind]] ... [https://github.com/lucidrains/flamingo-pytorch Flamingo Pytorch] 80B  
 
** [https://medium.com/syncedreview/deepminds-flamingo-visual-language-model-demonstrates-sota-few-shot-multimodal-learning-f795c3034b94 Flamingo |] [[Google|DeepMind]] ... [https://github.com/lucidrains/flamingo-pytorch Flamingo Pytorch] 80B  
 
** [https://github.com/THUDM/GLM-130B GLM-130B]  ... Open Bilingual Pre-Trained Model  130B
 
** [https://github.com/THUDM/GLM-130B GLM-130B]  ... Open Bilingual Pre-Trained Model  130B
 +
** [https://ai.googleblog.com/2021/12/more-efficient-in-context-learning-with.html GLaM |] [[Google]]
 
** [https://www.deepmind.com/blog/language-modelling-at-scale-gopher-ethical-considerations-and-retrieval Gopher |] [[Google | DeepMind]]  280B
 
** [https://www.deepmind.com/blog/language-modelling-at-scale-gopher-ethical-considerations-and-retrieval Gopher |] [[Google | DeepMind]]  280B
** [https://ai.googleblog.com/2021/12/more-efficient-in-context-learning-with.html GLaM |] [[Google]]
 
 
** [https://arxiv.org/abs/2006.16668 GShard |] [[Google]]  ... Scaling Giant Models with Conditional Computation and Automatic Sharding
 
** [https://arxiv.org/abs/2006.16668 GShard |] [[Google]]  ... Scaling Giant Models with Conditional Computation and Automatic Sharding
 
** [https://openai.com/blog/better-language-models/ GPT-2 |] [[OpenAI]]  1.5B
 
** [https://openai.com/blog/better-language-models/ GPT-2 |] [[OpenAI]]  1.5B

Revision as of 12:00, 25 February 2023

YouTube search... ...Google search


Inside language models (from GPT-3 to PaLM) | Alan-D-Thompson