Difference between revisions of "Large Language Model (LLM)"

From
Jump to: navigation, search
m
m
Line 19: Line 19:
 
** [https://github.com/microsoft/BioGPT BioGPT]  ... [[Microsoft]] language model trained for biomedical tasks
 
** [https://github.com/microsoft/BioGPT BioGPT]  ... [[Microsoft]] language model trained for biomedical tasks
 
** [https://bigscience.notion.site/BLOOM-BigScience-176B-Model-ad073ca07cdf479398d5f95d88e218c4 BLOOM]  ... Big Science Language Open-science Open-access Multilingual  ... 176B
 
** [https://bigscience.notion.site/BLOOM-BigScience-176B-Model-ad073ca07cdf479398d5f95d88e218c4 BLOOM]  ... Big Science Language Open-science Open-access Multilingual  ... 176B
** [https://gpt3demo.com/apps/cedille-ai Cedille]  ... open-source French language model
+
** [https://cedille.ai/ Cedille]  ... open-source French language model
** [https://gpt3demo.com/apps/chinchilla-deepmind Chinchilla |] [[Google | DeepMind]]
+
** [https://www.deepmind.com/publications/an-empirical-analysis-of-compute-optimal-large-language-model-training Chinchilla |] [[Google | DeepMind]]
** [https://gpt3demo.com/apps/ctrl-salesforce ctrl] ... a Conditional Transformer Language Model for Controllable Generation | Salesforce
+
** [https://arxiv.org/abs/2203.15556 ctrl] ... a Conditional Transformer Language Model for Controllable Generation | Salesforce
** [https://gpt3demo.com/apps/deepmind-gopher Gopher |] [[Google | DeepMind]]
+
** [https://www.deepmind.com/blog/language-modelling-at-scale-gopher-ethical-considerations-and-retrieval Gopher |] [[Google | DeepMind]]
** [https://gpt3demo.com/apps/deepmind-retro RETRO |] [[Google | DeepMind]]  
+
** [https://www.deepmind.com/publications/improving-language-models-by-retrieving-from-trillions-of-tokens RETRO |] [[Google | DeepMind]]  
 
** [https://www.infoq.com/news/2019/11/microsoft-ai-conversation/ DialogGPT]  ...Microsoft Releases DialogGPT AI Conversation Model | Anthony Alford - InfoQ - trained on over 147M dialogs  
 
** [https://www.infoq.com/news/2019/11/microsoft-ai-conversation/ DialogGPT]  ...Microsoft Releases DialogGPT AI Conversation Model | Anthony Alford - InfoQ - trained on over 147M dialogs  
 
** [https://github.com/karpathy/minGPT minGPT | Andrej Karpathy - GitHub]
 
** [https://github.com/karpathy/minGPT minGPT | Andrej Karpathy - GitHub]

Revision as of 22:46, 24 February 2023

YouTube search... ...Google search