Difference between revisions of "Apprenticeship Learning - Inverse Reinforcement Learning (IRL)"
| Line 1: | Line 1: | ||
| + | {{#seo: | ||
| + | |title=PRIMO.ai | ||
| + | |titlemode=append | ||
| + | |keywords=artificial, intelligence, machine, learning, models, algorithms, data, singularity, moonshot, Tensorflow, Google, Nvidia, Microsoft, Azure, Amazon, AWS | ||
| + | |description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools | ||
| + | }} | ||
[http://www.youtube.com/results?search_query=Inverse+Reinforcement+Machine+Learning+Apprenticeship YouTube search...] | [http://www.youtube.com/results?search_query=Inverse+Reinforcement+Machine+Learning+Apprenticeship YouTube search...] | ||
| + | [http://www.google.com/search?q=Inverse+Reinforcement+Machine+Learning+Apprenticeship+machine+learning+ML+artificial+intelligence ...Google search] | ||
* [[Imitation Learning]] | * [[Imitation Learning]] | ||
Revision as of 14:48, 3 February 2019
YouTube search... ...Google search
- Imitation Learning
- Reinforcement Learning
- Inside Out - Curious Optimistic Reasoning
- Generative Adversarial Network (GAN)
- A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress | Saurabh Arora, Prashant Doshi 18 Jun 2018
- Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications | Daniel S. Brown, Scott Niekum 23 Jun 2018
Inverse reinforcement learning (IRL) infers/derives a reward function from observed behavior/demonstrations, allowing for policy improvement and generalization. While ordinary "reinforcement learning" involves using rewards and punishments to learn behavior, in IRL the direction is reversed, and a robot observes a person's behavior to figure out what goal that behavior seems to be trying to achieve.