Difference between revisions of "Reinforcement Learning (RL) from Human Feedback (RLHF)"

Revision as of 09:24, 29 January 2023

Reinforcement Learning from Human Feedback: From Zero to ChatGPT In this talk, we will cover the basics of Reinforcement Learning from Human Feedback (RLHF) and how this technology is being used to enable state-of-the-art ML tools like ChatGPT. Most of the talk will be an overview of the interconnected ML models and cover the basics of Natural Language Processing and Reinforcement Learning (RL) that one needs to understand how Reinforcement Learning (RL) from Human Feedback (RLHF) is used on large language models. It will conclude with open question in RLHF. RLHF Blogpost The Deep RL Course Slides from this talk Nathan Twitter Thomas Twitter Nathan Lambert is a Research Scientist at HuggingFace. He received his PhD from the University of California, Berkeley working at the intersection of machine learning and robotics. He was advised by Professor Kristofer Pister in the Berkeley Autonomous Microsystems Lab and Roberto Calandra at Meta AI Research. He was lucky to intern at Facebook AI and DeepMind during his Ph.D. Nathan was was awarded the UC Berkeley EECS Demetri Angelakos Memorial Achievement Award for Altruism for his efforts to better community norms.

How ChatGPT works - From Transformers to Reinforcement Learning with Human Feedback (RLHF) ChatGPT has recently been released by OpenAI, and it is fundamentally a next token/word prediction model. Given the prompt, predict the next token/word(s). When trained on a massive internet corpus, it manages to be very powerful and can do many tasks like summarization, code completion, question and answer zero-shot. Amidst the hype of ChatGPT, it can be easy to assume that the model can reason and think for itself. Here, we try to demystify how the model works, first starting with a basic introduction of Transformers, and then how we can improve the model's output using Reinforcement Learning with Human Feedback (RLHF). Slides and code here: https://github.com/tanchongmin/Tensor... Transformer Introduction here: https://www.youtube.com/watch?v=iBamM... References: Original Transformer Paper (Attention is all you need): https://arxiv.org/pdf/1706.03762.pdf GPT Paper: https://arxiv.org/pdf/2005.14165.pdf DialoGPT Paper (conversational AI by Microsoft): https://arxiv.org/pdf/1911.00536.pdf InstructGPT Paper (with RLHF): https://arxiv.org/pdf/2203.02155.pdf Illustrated Transformer: https://jalammar.github.io/illustrate... Illustrated GPT-2: https://jalammar.github.io/illustrate... 0:00 Introduction 3:09 Embedding Space 15:35 Overall Transformer Architecture 36:06 Transformer (Details) 49:28 GPT Architecture 56:38 GPT Training and Loss Function 1:05:25 Live Demo of GPT Next Token Generation and Attention Visualisation 1:16:55 Conversational AI 1:19:00 Reinforcement Learning from Human Feedback (RLHF) 1:45:15 Discussion 08:24, 29 January 2023 (EST)08:24, 29 January 2023 (EST)08:24, 29 January 2023 (EST)08:24, 29 January 2023 (EST)08:24, 29 January 2023 (EST)BPeat (talk) 08:24, 29 January 2023 (EST)` AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator. Online AI blog: https://delvingintotech.wordpress.com/. LinkedIn: https://www.linkedin.com/in/chong-min... Twitch: https://www.twitch.tv/johncm99 Twitter: https://twitter.com/johntanchongmin Try out my games here: https://simmer.io/@chongmin

@@ Line 35: / Line 35: @@
 {| class="wikitable" style="width: 550px;"
 ||
-<youtube>Fw5ybNwwSbg</youtube>
+<youtube>wA8rjKueB3Q</youtube>
-<b>I challenged ChatGPT to code and hack (Are we doomed?)
+<b>How ChatGPT works - From Transformers to Reinforcement Learning with Human Feedback (RLHF)
-</b><br>Are we doomed? Will AI like ChatGPT replace us? I put it to the test and challenged it to write C code, Python hacking scripts,
+</b><br>ChatGPT has recently been released by OpenAI, and it is fundamentally a next token/word prediction model. Given the prompt, predict the next token/word(s). When trained on a massive internet corpus, it manages to be very powerful and can do many tasks like summarization, code completion, question and answer zero-shot.
+Amidst the hype of ChatGPT, it can be easy to assume that the model can reason and think for itself. Here, we try to demystify how the model works, first starting with a basic introduction of Transformers, and then how we can improve the model's output using Reinforcement Learning with Human Feedback (RLHF).
+Slides and code here: https://github.com/tanchongmin/Tensor...
+Transformer Introduction here: https://www.youtube.com/watch?v=iBamM...
+References:
+Original Transformer Paper (Attention is all you need): https://arxiv.org/pdf/1706.03762.pdf
+GPT Paper: https://arxiv.org/pdf/2005.14165.pdf
+DialoGPT Paper (conversational AI by Microsoft): https://arxiv.org/pdf/1911.00536.pdf
+InstructGPT Paper (with RLHF): https://arxiv.org/pdf/2203.02155.pdf
+Illustrated Transformer: https://jalammar.github.io/illustrate...
+Illustrated GPT-2: https://jalammar.github.io/illustrate...
+* 0:00 Introduction
+* 3:09 Embedding Space
+* 15:35 Overall Transformer Architecture
+* 36:06 Transformer (Details)
+* 49:28 GPT Architecture
+* 56:38 GPT Training and Loss Function
+* 1:05:25 Live Demo of GPT Next Token Generation and Attention Visualisation
+* 1:16:55 Conversational AI
+* 1:19:00 Reinforcement Learning from Human Feedback (RLHF)
+* 1:45:15 Discussion
+:24, 29 January 2023 (EST)08:24, 29 January 2023 (EST)08:24, 29 January 2023 (EST)08:24, 29 January 2023 (EST)08:24, 29 January 2023 (EST)[[User:BPeat|BPeat]] ([[User talk:BPeat|talk]]) 08:24, 29 January 2023 (EST)`
+AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.
+Online AI blog: https://delvingintotech.wordpress.com/.
+LinkedIn: https://www.linkedin.com/in/chong-min...
+Twitch: https://www.twitch.tv/johncm99
+Twitter: https://twitter.com/johntanchongmin
+Try out my games here: https://simmer.io/@chongmin
 |}
 |}<!-- B -->

Difference between revisions of "Reinforcement Learning (RL) from Human Feedback (RLHF)"

Revision as of 09:24, 29 January 2023

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools