Difference between revisions of "Apprenticeship Learning - Inverse Reinforcement Learning (IRL)"

From
Jump to: navigation, search
m
m (Text replacement - "http:" to "https:")
Line 5: Line 5:
 
|description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools  
 
|description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools  
 
}}
 
}}
[http://www.youtube.com/results?search_query=Inverse+Reinforcement+Machine+Learning+Apprenticeship YouTube search...]
+
[https://www.youtube.com/results?search_query=Inverse+Reinforcement+Machine+Learning+Apprenticeship YouTube search...]
[http://www.google.com/search?q=Inverse+Reinforcement+Machine+Learning+Apprenticeship+machine+learning+ML+artificial+intelligence ...Google search]
+
[https://www.google.com/search?q=Inverse+Reinforcement+Machine+Learning+Apprenticeship+machine+learning+ML+artificial+intelligence ...Google search]
  
 
* [[Learning Techniques]]
 
* [[Learning Techniques]]
Line 16: Line 16:
 
* [[Policy]]
 
* [[Policy]]
 
* [[Generative AI]]  ... [[OpenAI]]'s [[ChatGPT]] ... [[Perplexity]]  ... [[Microsoft]]'s [[BingAI]] ... [[You]] ...[[Google]]'s [[Bard]] ... [[Baidu]]'s [[Ernie]]
 
* [[Generative AI]]  ... [[OpenAI]]'s [[ChatGPT]] ... [[Perplexity]]  ... [[Microsoft]]'s [[BingAI]] ... [[You]] ...[[Google]]'s [[Bard]] ... [[Baidu]]'s [[Ernie]]
* [http://arxiv.org/pdf/1806.06877.pdf A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress | Saurabh Arora, Prashant Doshi] 18 Jun 2018
+
* [https://arxiv.org/pdf/1806.06877.pdf A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress | Saurabh Arora, Prashant Doshi] 18 Jun 2018
* [http://arxiv.org/pdf/1805.07687.pdf Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications | Daniel S. Brown, Scott Niekum] 23 Jun 2018
+
* [https://arxiv.org/pdf/1805.07687.pdf Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications | Daniel S. Brown, Scott Niekum] 23 Jun 2018
* [http://analyticsindiamag.com/guide-to-mbirl-model-based-inverse-reinforcement-learning/ Guide to MBIRL – Model Based Inverse Reinforcement Learning | Aishwarya Verma]
+
* [https://analyticsindiamag.com/guide-to-mbirl-model-based-inverse-reinforcement-learning/ Guide to MBIRL – Model Based Inverse Reinforcement Learning | Aishwarya Verma]
  
 
<img src="https://149695847.v2.pressablecdn.com/wp-content/uploads/2021/02/IRL.png" width="500">
 
<img src="https://149695847.v2.pressablecdn.com/wp-content/uploads/2021/02/IRL.png" width="500">

Revision as of 23:20, 27 March 2023

YouTube search... ...Google search

Inverse reinforcement learning (IRL) infers/derives a reward function from observed behavior/demonstrations, allowing for policy improvement and generalization. While ordinary "reinforcement learning" involves using rewards and punishments to learn behavior, in IRL the direction is reversed, and a robot observes a person's behavior to figure out what goal that behavior seems to be trying to achieve.