Difference between revisions of "Large Language Model (LLM)"

From
Jump to: navigation, search
m
m
Line 8: Line 8:
 
[https://www.google.com/search?q=Large+Language+Model+LLM ...Google search]
 
[https://www.google.com/search?q=Large+Language+Model+LLM ...Google search]
  
** [[ChatGPT]] | [[OpenAI]]
+
* [[ChatGPT]] | [[OpenAI]]
*** [https://www.technologyreview.com/2023/02/08/1068068/chatgpt-is-everywhere-heres-where-it-came-from/ ChatGPT is everywhere. Here’s where it came from | Will Douglas Heaven - MIT Technology Review]
+
** [https://www.technologyreview.com/2023/02/08/1068068/chatgpt-is-everywhere-heres-where-it-came-from/ ChatGPT is everywhere. Here’s where it came from | Will Douglas Heaven - MIT Technology Review]
**** [[Transformer]] / [[Attention]] Mechanism
+
*** [[Transformer]] / [[Attention]] Mechanism
**** [[Generative Pre-trained Transformer (GPT)]]
+
*** [[Generative Pre-trained Transformer (GPT)]]
**** [[Reinforcement Learning (RL) from Human Feedback (RLHF)]]
+
*** [[Reinforcement Learning (RL) from Human Feedback (RLHF)]]
**** [[Supervised]] Learning
+
*** [[Supervised]] Learning
**** [[Proximal Policy Optimization (PPO)]]
+
*** [[Proximal Policy Optimization (PPO)]]
** [https://opt.alpa.ai/ Alpa]  ... serving large models like GPT-3 simple, affordable, accessible  
+
* [https://opt.alpa.ai/ Alpa]  ... serving large models like GPT-3 simple, affordable, accessible  
** [https://gpt3demo.com/apps/biogpt BioGPT]  ... [[Microsoft]] language model trained for biomedical tasks
+
* [https://gpt3demo.com/apps/biogpt BioGPT]  ... [[Microsoft]] language model trained for biomedical tasks
** [https://gpt3demo.com/apps/bloom BLOOM]  ... Big Science Language Open-science Open-access Multilingual  
+
* [https://gpt3demo.com/apps/bloom BLOOM]  ... Big Science Language Open-science Open-access Multilingual  
** [https://gpt3demo.com/apps/cedille-ai Cedille]  ... open-source French language model
+
* [https://gpt3demo.com/apps/cedille-ai Cedille]  ... open-source French language model
** [https://gpt3demo.com/apps/chinchilla-deepmind Chinchilla |] [[Google | DeepMind]]
+
* [https://gpt3demo.com/apps/chinchilla-deepmind Chinchilla |] [[Google | DeepMind]]
** [https://gpt3demo.com/apps/ctrl-salesforce ctrl] ... a Conditional Transformer Language Model for Controllable Generation | Salesforce
+
* [https://gpt3demo.com/apps/ctrl-salesforce ctrl] ... a Conditional Transformer Language Model for Controllable Generation | Salesforce
** [https://gpt3demo.com/apps/deepmind-gopher Gopher |] [[Google | DeepMind]]
+
* [https://gpt3demo.com/apps/deepmind-gopher Gopher |] [[Google | DeepMind]]
** [https://gpt3demo.com/apps/deepmind-retro RETRO |] [[Google | DeepMind]]  
+
* [https://gpt3demo.com/apps/deepmind-retro RETRO |] [[Google | DeepMind]]  
** [https://www.infoq.com/news/2019/11/microsoft-ai-conversation/ DialogGPT]  ...Microsoft Releases DialogGPT AI Conversation Model | Anthony Alford - InfoQ - trained on over 147M dialogs  
+
* [https://www.infoq.com/news/2019/11/microsoft-ai-conversation/ DialogGPT]  ...Microsoft Releases DialogGPT AI Conversation Model | Anthony Alford - InfoQ - trained on over 147M dialogs  
** [https://github.com/karpathy/minGPT minGPT | Andrej Karpathy - GitHub]
+
* [https://github.com/karpathy/minGPT minGPT | Andrej Karpathy - GitHub]
** [https://gpt3demo.com/apps/glm-130b GLM-130B]  ... Open Bilingual Pre-Trained Model  
+
* [https://gpt3demo.com/apps/glm-130b GLM-130B]  ... Open Bilingual Pre-Trained Model  
 
* [https://openai.com/blog/gpt-2-6-month-follow-up/ OpenAI Blog] | [[OpenAI]]
 
* [https://openai.com/blog/gpt-2-6-month-follow-up/ OpenAI Blog] | [[OpenAI]]
 
* [[Attention]] Mechanism/[[Transformer]] Model
 
* [[Attention]] Mechanism/[[Transformer]] Model

Revision as of 22:26, 24 February 2023

YouTube search... ...Google search