Difference between revisions of "Apprenticeship Learning - Inverse Reinforcement Learning (IRL)"

Revision as of 07:27, 4 August 2018

Inverse reinforcement learning (IRL) infers/derives a reward function from observed behavior/demonstrations, allowing for policy improvement and generalization. While ordinary "reinforcement learning" involves using rewards and punishments to learn behavior, in IRL the direction is reversed, and a robot observes a person's behavior to figure out what goal that behavior seems to be trying to achieve.

@@ Line 2: / Line 2: @@
 * [[Reinforcement Learning]]
+* [[Generative Adversarial Network (GAN)]]
 * [http://arxiv.org/pdf/1806.06877.pdf A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress | Saurabh Arora, Prashant Doshi] 18 Jun 2018
 * [http://arxiv.org/pdf/1805.07687.pdf Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications | Daniel S. Brown, Scott Niekum] 23 Jun 2018
-Inverse reinforcement learning (IRL) infers/deriving a reward function from observed behavior/demonstrations, allowing for policy improvement and generalization. While ordinary "reinforcement learning" involves using rewards and punishments to learn behavior, in IRL the direction is reversed, and a robot observes a person's behavior to figure out what goal that behavior seems to be trying to achieve.
+Inverse reinforcement learning (IRL) infers/derives a reward function from observed behavior/demonstrations, allowing for policy improvement and generalization. While ordinary "reinforcement learning" involves using rewards and punishments to learn behavior, in IRL the direction is reversed, and a robot observes a person's behavior to figure out what goal that behavior seems to be trying to achieve.
+<youtube>0q30_gDlrwk</youtube>
 <youtube>h7uGyBcIeII</youtube>
 <youtube>d9DlQSJQAoI</youtube>
 <youtube>JbNeLiNnvII</youtube>
 <youtube>f9UpSJdWwkQ</youtube>
+<youtube>giH0wWOXX_E</youtube>
-<youtube>d9DlQSJQAoI</youtube>
+<youtube>xNvNeg7JGSM</youtube>
-<youtube>f9UpSJdWwkQ</youtube>
+<youtube>fu7uBNWTzU8</youtube>

Difference between revisions of "Apprenticeship Learning - Inverse Reinforcement Learning (IRL)"

Revision as of 07:27, 4 August 2018

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools