Difference between revisions of "Large Language Model (LLM)"
m |
m |
||
| Line 17: | Line 17: | ||
**** [[Proximal Policy Optimization (PPO)]] | **** [[Proximal Policy Optimization (PPO)]] | ||
** [https://opt.alpa.ai/ Alpa] ... serving large models like GPT-3 simple, affordable, accessible | ** [https://opt.alpa.ai/ Alpa] ... serving large models like GPT-3 simple, affordable, accessible | ||
| − | ** [https:// | + | ** [https://github.com/microsoft/BioGPT BioGPT] ... [[Microsoft]] language model trained for biomedical tasks |
| − | ** [https:// | + | ** [https://bigscience.notion.site/BLOOM-BigScience-176B-Model-ad073ca07cdf479398d5f95d88e218c4 BLOOM] ... Big Science Language Open-science Open-access Multilingual ... 176B |
** [https://gpt3demo.com/apps/cedille-ai Cedille] ... open-source French language model | ** [https://gpt3demo.com/apps/cedille-ai Cedille] ... open-source French language model | ||
** [https://gpt3demo.com/apps/chinchilla-deepmind Chinchilla |] [[Google | DeepMind]] | ** [https://gpt3demo.com/apps/chinchilla-deepmind Chinchilla |] [[Google | DeepMind]] | ||
Revision as of 22:41, 24 February 2023
YouTube search... ...Google search
- Models:
- ChatGPT | OpenAI
- Alpa ... serving large models like GPT-3 simple, affordable, accessible
- BioGPT ... Microsoft language model trained for biomedical tasks
- BLOOM ... Big Science Language Open-science Open-access Multilingual ... 176B
- Cedille ... open-source French language model
- Chinchilla | DeepMind
- ctrl ... a Conditional Transformer Language Model for Controllable Generation | Salesforce
- Gopher | DeepMind
- RETRO | DeepMind
- DialogGPT ...Microsoft Releases DialogGPT AI Conversation Model | Anthony Alford - InfoQ - trained on over 147M dialogs
- minGPT | Andrej Karpathy - GitHub
- GLM-130B ... Open Bilingual Pre-Trained Model
- OPT-175B...Facebook-owner Meta opens access to AI large language model | Elizabeth Culliford - Reuters ... Facebook 175-billion-parameter language model - Open Pretrained Transformer
- Bidirectional Encoder Representations from Transformers (BERT)
- GLaM | Google
- OpenAI Blog | OpenAI
- Attention Mechanism/Transformer Model
- Generative Pre-trained Transformer (GPT)
- SambaNova Systems ... Dataflow-as-a-Service GPT