Difference between revisions of "Large Language Model (LLM)"

From
Jump to: navigation, search
m (BPeat moved page Large Language Models (LLMs) to Large Language Model (LLM) without leaving a redirect)
m
Line 5: Line 5:
 
|description=Helpful resources for your journey with artificial intelligence; Attention, GPT, chat, videos, articles, techniques, courses, profiles, and tools  
 
|description=Helpful resources for your journey with artificial intelligence; Attention, GPT, chat, videos, articles, techniques, courses, profiles, and tools  
 
}}
 
}}
[https://www.youtube.com/results?search_query=Generative+Pre+trained+Transformer+GPT+generation+nlg+natural+language+semantics YouTube search...]
+
[https://www.youtube.com/results?search_query=Large+Language+Model+LLM YouTube search...]
[https://www.google.com/search?q=Generative+Pre+trained+Transformer+GPT+generation+nlg+natural+language+semantics ...Google search]
+
[https://www.google.com/search?q=Large+Language+Model+LLM ...Google search]
  
* [[Case Studies]]
 
** [[Writing]]
 
** [[Publishing]]
 
** [[Sequence to Sequence (Seq2Seq)]]
 
** [[Recurrent Neural Network (RNN)]] 
 
** [[Long Short-Term Memory (LSTM)]]
 
** [[ELMo]]
 
** [[Bidirectional Encoder Representations from Transformers (BERT)]]  ... a better model, but less investment than the larger [[OpenAI]] organization
 
* [[Large Language Models (LLMs)]]
 
 
** [[ChatGPT]] | [[OpenAI]]
 
** [[ChatGPT]] | [[OpenAI]]
 
*** [https://www.technologyreview.com/2023/02/08/1068068/chatgpt-is-everywhere-heres-where-it-came-from/ ChatGPT is everywhere. Here’s where it came from | Will Douglas Heaven - MIT Technology Review]
 
*** [https://www.technologyreview.com/2023/02/08/1068068/chatgpt-is-everywhere-heres-where-it-came-from/ ChatGPT is everywhere. Here’s where it came from | Will Douglas Heaven - MIT Technology Review]
Line 32: Line 23:
 
** [https://gpt3demo.com/apps/deepmind-gopher Gopher |] [[Google | DeepMind]]
 
** [https://gpt3demo.com/apps/deepmind-gopher Gopher |] [[Google | DeepMind]]
 
** [https://gpt3demo.com/apps/deepmind-retro RETRO |] [[Google | DeepMind]]  
 
** [https://gpt3demo.com/apps/deepmind-retro RETRO |] [[Google | DeepMind]]  
 +
** [https://www.infoq.com/news/2019/11/microsoft-ai-conversation/ DialogGPT]  ...Microsoft Releases DialogGPT AI Conversation Model | Anthony Alford - InfoQ - trained on over 147M dialogs
 
* [https://openai.com/blog/gpt-2-6-month-follow-up/ OpenAI Blog] | [[OpenAI]]
 
* [https://openai.com/blog/gpt-2-6-month-follow-up/ OpenAI Blog] | [[OpenAI]]
* [[Text Transfer Learning]]
 
* [[Natural Language Generation (NLG)]]
 
* [[Natural Language Tools & Services]]
 
* [[Generated Image]]
 
* [[SynthPub]]
 
* [https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf Language Models are Unsupervised Multitask Learners | Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, and Ilya Sutskever]
 
* [https://neural-monkey.readthedocs.io/en/latest/machine_translation.html Neural Monkey | Jindřich Libovický, Jindřich Helcl, Tomáš Musil] Byte Pair Encoding (BPE) enables NMT model translation on open-vocabulary by encoding rare and unknown words as sequences of subword units.
 
 
* [[Attention]] Mechanism/[[Transformer]] Model
 
* [[Attention]] Mechanism/[[Transformer]] Model
* [https://github.com/openai/gpt-2 Language Models are Unsupervised Multitask Learners - GitHub]
 
* [https://www.infoq.com/news/2019/11/microsoft-ai-conversation/ Microsoft Releases DialogGPT AI Conversation Model | Anthony Alford - InfoQ] - trained on over 147M dialogs
 
 
* [https://github.com/karpathy/minGPT minGPT | Andrej Karpathy - GitHub]
 
* [https://github.com/karpathy/minGPT minGPT | Andrej Karpathy - GitHub]
 
* [https://sambanova.ai/solutions/gpt/ SambaNova Systems] ... Dataflow-as-a-Service GPT
 
* [https://sambanova.ai/solutions/gpt/ SambaNova Systems] ... Dataflow-as-a-Service GPT
 
* [https://www.reuters.com/technology/facebook-owner-meta-opens-access-ai-large-language-model-2022-05-03/  [[Meta|Facebook]]-owner Meta opens access to AI large language model | Elizabeth Culliford - Reuters] ... [[Meta|Facebook]] 175-billion-parameter language model - Open Pretrained Transformer (OPT-175B)
 
* [https://www.reuters.com/technology/facebook-owner-meta-opens-access-ai-large-language-model-2022-05-03/  [[Meta|Facebook]]-owner Meta opens access to AI large language model | Elizabeth Culliford - Reuters] ... [[Meta|Facebook]] 175-billion-parameter language model - Open Pretrained Transformer (OPT-175B)

Revision as of 22:08, 24 February 2023

YouTube search... ...Google search