Difference between revisions of "Generative Pre-trained Transformer (GPT)"

Revision as of 14:55, 29 June 2019

Natural Language Generation (NLG)
Language Models are Unsupervised Multitask Learners | Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, and Ilya Sutskever
(117M parameter) version of GPT-2 | GitHub
GPT-2: It learned on the Internet | Janelle Shane
Attention Mechanism/Transformer Model
Neural Monkey | Jindřich Libovický, Jindřich Helcl, Tomáš Musil Byte Pair Encoding (BPE) enables NMT model translation on open-vocabulary by encoding rare and unknown words as sequences of subword units.
Too powerful NLP model (GPT-2): What is Generative Pre-Training | Edward Ma
Bidirectional Encoder Representations from Transformers (BERT)
ELMo
Language Models are Unsupervised Multitask Learners - GitHub

a text-generating bot based on a model with 1.5 billion parameters. ...Ultimately, OpenAI's researchers kept the full thing to themselves, only releasing a pared-down 117 million parameter version of the model (which we have dubbed "GPT-2 Junior") as a safer demonstration of what the full GPT-2 model could do.Twenty minutes into the future with OpenAI’s Deep Fake Text AI | Sean Gallagher

1*jbcwhhB8PEpJRk781rML_g.png

r/SubSimulator

Subreddit populated entirely by AI personifications of other subreddits -- all posts and comments are generated automatically using:

results in coherent and realistic simulated content.

GetBadNews

Get Bad News game - Can you beat my score? Play the fake news game! Drop all pretense of ethics and choose the path that builds your persona as an unscrupulous media magnate. Your task is to get as many followers as you can while

@@ Line 12: / Line 12: @@
 * [http://github.com/openai/gpt-2/blob/master/README.md  (117M parameter) version of GPT-2 | GitHub]
 * [http://aiweirdness.com/post/182824715257/gpt-2-it-learned-on-the-internet GPT-2: It learned on the Internet | Janelle Shane]
-* [[Attention Mechanism/Model - Transformer Model]]
+* [[Attention]] Mechanism/[[Transformer]] Model
 * [http://neural-monkey.readthedocs.io/en/latest/machine_translation.html Neural Monkey | Jindřich Libovický, Jindřich Helcl, Tomáš Musil] Byte Pair Encoding (BPE) enables NMT model translation on open-vocabulary by encoding rare and unknown words as sequences of subword units.
 * [http://towardsdatascience.com/too-powerful-nlp-model-generative-pre-training-2-4cc6afb6655 Too powerful NLP model (GPT-2): What is Generative Pre-Training | Edward Ma]

Difference between revisions of "Generative Pre-trained Transformer (GPT)"

Revision as of 14:55, 29 June 2019

r/SubSimulator

GetBadNews

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools