Difference between revisions of "Large Language Model (LLM)"

From
Jump to: navigation, search
m
m
Line 43: Line 43:
 
** [https://github.com/allenai/macaw Macaw | AI2]
 
** [https://github.com/allenai/macaw Macaw | AI2]
 
** [https://arxiv.org/pdf/2212.13138.pdf Med-PaLM]  ... aligned to the medical domain
 
** [https://arxiv.org/pdf/2212.13138.pdf Med-PaLM]  ... aligned to the medical domain
 +
** [https://developer.nvidia.com/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/ Megatron NLG] ... Monolithic Transformer Language NLP Model Triple the Size of [[OpenAI]]’s GPT-3
 
** [https://github.com/karpathy/minGPT minGPT | Andrej Karpathy - GitHub]
 
** [https://github.com/karpathy/minGPT minGPT | Andrej Karpathy - GitHub]
** [https://developer.nvidia.com/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/ Megatron NLG] ... Monolithic Transformer Language NLP Model Triple the Size of [[OpenAI]]’s GPT-3
 
 
** [https://muse.lighton.ai/home Muse] ... VLM-4, a set of natively trained large Language Models in French, Italian, Spanish, German, and English
 
** [https://muse.lighton.ai/home Muse] ... VLM-4, a set of natively trained large Language Models in French, Italian, Spanish, German, and English
 +
** [https://app.inferkit.com/demo MT-NLG]
 
** [https://github.com/karpathy/nanoGPT nanoGPT] ... for training/finetuning medium-sized GPTs
 
** [https://github.com/karpathy/nanoGPT nanoGPT] ... for training/finetuning medium-sized GPTs
 
** [https://ai.facebook.com/blog/nllb-200-high-quality-machine-translation/ NLLB |] [[Meta]]  54.5B & 200B parameters; NLLB-200
 
** [https://ai.facebook.com/blog/nllb-200-high-quality-machine-translation/ NLLB |] [[Meta]]  54.5B & 200B parameters; NLLB-200

Revision as of 11:37, 25 February 2023

YouTube search... ...Google search


Inside language models (from GPT-3 to PaLM) | Alan-D-Thompson