Difference between revisions of "Apprenticeship Learning - Inverse Reinforcement Learning (IRL)"
m |
m (Text replacement - "http:" to "https:") |
||
| Line 5: | Line 5: | ||
|description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools | |description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools | ||
}} | }} | ||
| − | [ | + | [https://www.youtube.com/results?search_query=Inverse+Reinforcement+Machine+Learning+Apprenticeship YouTube search...] |
| − | [ | + | [https://www.google.com/search?q=Inverse+Reinforcement+Machine+Learning+Apprenticeship+machine+learning+ML+artificial+intelligence ...Google search] |
* [[Learning Techniques]] | * [[Learning Techniques]] | ||
| Line 16: | Line 16: | ||
* [[Policy]] | * [[Policy]] | ||
* [[Generative AI]] ... [[OpenAI]]'s [[ChatGPT]] ... [[Perplexity]] ... [[Microsoft]]'s [[BingAI]] ... [[You]] ...[[Google]]'s [[Bard]] ... [[Baidu]]'s [[Ernie]] | * [[Generative AI]] ... [[OpenAI]]'s [[ChatGPT]] ... [[Perplexity]] ... [[Microsoft]]'s [[BingAI]] ... [[You]] ...[[Google]]'s [[Bard]] ... [[Baidu]]'s [[Ernie]] | ||
| − | * [ | + | * [https://arxiv.org/pdf/1806.06877.pdf A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress | Saurabh Arora, Prashant Doshi] 18 Jun 2018 |
| − | * [ | + | * [https://arxiv.org/pdf/1805.07687.pdf Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications | Daniel S. Brown, Scott Niekum] 23 Jun 2018 |
| − | * [ | + | * [https://analyticsindiamag.com/guide-to-mbirl-model-based-inverse-reinforcement-learning/ Guide to MBIRL – Model Based Inverse Reinforcement Learning | Aishwarya Verma] |
<img src="https://149695847.v2.pressablecdn.com/wp-content/uploads/2021/02/IRL.png" width="500"> | <img src="https://149695847.v2.pressablecdn.com/wp-content/uploads/2021/02/IRL.png" width="500"> | ||
Revision as of 23:20, 27 March 2023
YouTube search... ...Google search
- Learning Techniques
- Inside Out - Curious Optimistic Reasoning
- Generative Adversarial Network (GAN)
- Connecting Brains
- Policy
- Generative AI ... OpenAI's ChatGPT ... Perplexity ... Microsoft's BingAI ... You ...Google's Bard ... Baidu's Ernie
- A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress | Saurabh Arora, Prashant Doshi 18 Jun 2018
- Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications | Daniel S. Brown, Scott Niekum 23 Jun 2018
- Guide to MBIRL – Model Based Inverse Reinforcement Learning | Aishwarya Verma
Inverse reinforcement learning (IRL) infers/derives a reward function from observed behavior/demonstrations, allowing for policy improvement and generalization. While ordinary "reinforcement learning" involves using rewards and punishments to learn behavior, in IRL the direction is reversed, and a robot observes a person's behavior to figure out what goal that behavior seems to be trying to achieve.