Difference between revisions of "Large Language Model (LLM)"

From
Jump to: navigation, search
m
m
Line 30: Line 30:
 
** [[Bidirectional Encoder Representations from Transformers (BERT)]]
 
** [[Bidirectional Encoder Representations from Transformers (BERT)]]
 
** [https://ai.googleblog.com/2021/12/more-efficient-in-context-learning-with.html GLaM |] [[Google]]
 
** [https://ai.googleblog.com/2021/12/more-efficient-in-context-learning-with.html GLaM |] [[Google]]
 +
** [https://arxiv.org/abs/2006.16668 GShard |] [[Google]]  ... Scaling Giant Models with Conditional Computation and Automatic Sharding
 +
** [https://openai.com/blog/better-language-models/ GPT-2 |] [[OpenAI]] ... Generative Pre-trained Transformer 2 by [[OpenAI]]
 
* [https://openai.com/blog/gpt-2-6-month-follow-up/ OpenAI Blog] | [[OpenAI]]
 
* [https://openai.com/blog/gpt-2-6-month-follow-up/ OpenAI Blog] | [[OpenAI]]
 
* [[Attention]] Mechanism/[[Transformer]] Model
 
* [[Attention]] Mechanism/[[Transformer]] Model
 
* [[Generative Pre-trained Transformer (GPT)]]
 
* [[Generative Pre-trained Transformer (GPT)]]
 
* [https://sambanova.ai/solutions/gpt/ SambaNova Systems] ... Dataflow-as-a-Service GPT
 
* [https://sambanova.ai/solutions/gpt/ SambaNova Systems] ... Dataflow-as-a-Service GPT

Revision as of 22:55, 24 February 2023

YouTube search... ...Google search