Difference between revisions of "Large Language Model (LLM)"

Revision as of 22:08, 24 February 2023

YouTube search... ...Google search

- ChatGPT | OpenAI
  - ChatGPT is everywhere. Here’s where it came from | Will Douglas Heaven - MIT Technology Review
    - Transformer / Attention Mechanism
    - Generative Pre-trained Transformer (GPT)
    - Reinforcement Learning (RL) from Human Feedback (RLHF)
    - Supervised Learning
    - Proximal Policy Optimization (PPO)
- Alpa ... serving large models like GPT-3 simple, affordable, accessible
- BioGPT ... Microsoft language model trained for biomedical tasks
- BLOOM ... Big Science Language Open-science Open-access Multilingual
- Cedille ... open-source French language model
- Chinchilla | DeepMind
- ctrl ... a Conditional Transformer Language Model for Controllable Generation | Salesforce
- Gopher | DeepMind
- RETRO | DeepMind
- DialogGPT ...Microsoft Releases DialogGPT AI Conversation Model | Anthony Alford - InfoQ - trained on over 147M dialogs
OpenAI Blog | OpenAI
Attention Mechanism/Transformer Model
minGPT | Andrej Karpathy - GitHub
SambaNova Systems ... Dataflow-as-a-Service GPT
Facebook-owner Meta opens access to AI large language model | Elizabeth Culliford - Reuters ... Facebook 175-billion-parameter language model - Open Pretrained Transformer (OPT-175B)

@@ Line 5: / Line 5: @@
 |description=Helpful resources for your journey with artificial intelligence; Attention, GPT, chat, videos, articles, techniques, courses, profiles, and tools
 }}
-[https://www.youtube.com/results?search_query=Generative+Pre+trained+Transformer+GPT+generation+nlg+natural+language+semantics YouTube search...]
+[https://www.youtube.com/results?search_query=Large+Language+Model+LLM YouTube search...]
-[https://www.google.com/search?q=Generative+Pre+trained+Transformer+GPT+generation+nlg+natural+language+semantics ...Google search]
+[https://www.google.com/search?q=Large+Language+Model+LLM ...Google search]
-* [[Case Studies]]
-** [[Writing]]
-** [[Publishing]]
-** [[Sequence to Sequence (Seq2Seq)]]
-** [[Recurrent Neural Network (RNN)]]
-** [[Long Short-Term Memory (LSTM)]]
-** [[ELMo]]
-** [[Bidirectional Encoder Representations from Transformers (BERT)]]  ... a better model, but less investment than the larger [[OpenAI]] organization
-* [[Large Language Models (LLMs)]]
 ** [[ChatGPT]] | [[OpenAI]]
 *** [https://www.technologyreview.com/2023/02/08/1068068/chatgpt-is-everywhere-heres-where-it-came-from/ ChatGPT is everywhere. Here’s where it came from | Will Douglas Heaven - MIT Technology Review]
@@ Line 32: / Line 23: @@
 ** [https://gpt3demo.com/apps/deepmind-gopher Gopher |] [[Google | DeepMind]]
 ** [https://gpt3demo.com/apps/deepmind-retro RETRO |] [[Google | DeepMind]]
+** [https://www.infoq.com/news/2019/11/microsoft-ai-conversation/ DialogGPT]  ...Microsoft Releases DialogGPT AI Conversation Model | Anthony Alford - InfoQ - trained on over 147M dialogs
 * [https://openai.com/blog/gpt-2-6-month-follow-up/ OpenAI Blog] | [[OpenAI]]
-* [[Text Transfer Learning]]
-* [[Natural Language Generation (NLG)]]
-* [[Natural Language Tools & Services]]
-* [[Generated Image]]
-* [[SynthPub]]
-* [https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf Language Models are Unsupervised Multitask Learners | Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, and Ilya Sutskever]
-* [https://neural-monkey.readthedocs.io/en/latest/machine_translation.html Neural Monkey | Jindřich Libovický, Jindřich Helcl, Tomáš Musil] Byte Pair Encoding (BPE) enables NMT model translation on open-vocabulary by encoding rare and unknown words as sequences of subword units.
 * [[Attention]] Mechanism/[[Transformer]] Model
-* [https://github.com/openai/gpt-2 Language Models are Unsupervised Multitask Learners - GitHub]
-* [https://www.infoq.com/news/2019/11/microsoft-ai-conversation/ Microsoft Releases DialogGPT AI Conversation Model | Anthony Alford - InfoQ] - trained on over 147M dialogs
 * [https://github.com/karpathy/minGPT minGPT | Andrej Karpathy - GitHub]
 * [https://sambanova.ai/solutions/gpt/ SambaNova Systems] ... Dataflow-as-a-Service GPT
 * [https://www.reuters.com/technology/facebook-owner-meta-opens-access-ai-large-language-model-2022-05-03/  [[Meta|Facebook]]-owner Meta opens access to AI large language model | Elizabeth Culliford - Reuters] ... [[Meta|Facebook]] 175-billion-parameter language model - Open Pretrained Transformer (OPT-175B)

Difference between revisions of "Large Language Model (LLM)"

Revision as of 22:08, 24 February 2023

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools