Difference between revisions of "Apprenticeship Learning - Inverse Reinforcement Learning (IRL)"

Revision as of 10:10, 29 October 2018

Inverse reinforcement learning (IRL) infers/derives a reward function from observed behavior/demonstrations, allowing for policy improvement and generalization. While ordinary "reinforcement learning" involves using rewards and punishments to learn behavior, in IRL the direction is reversed, and a robot observes a person's behavior to figure out what goal that behavior seems to be trying to achieve.

@@ Line 1: / Line 1: @@
 [http://www.youtube.com/results?search_query=Inverse+Reinforcement+Machine+Learning+Apprenticeship YouTube search...]
+* [[Imitation Learning]]
 * [[Reinforcement Learning]]
 * [[Inside Out - Curious Optimistic Reasoning]]
@@ Line 18: / Line 19: @@
 <youtube>xNvNeg7JGSM</youtube>
 <youtube>fu7uBNWTzU8</youtube>
-== Imitation Learning ==
-[http://www.youtube.com/results?search_query=Imitation+Learning+Machine YouTube search...]
-The ongoing explosion of spatiotemporal tracking data has now made it possible to analyze and model fine-grained behaviors in a wide range of domains. For instance, tracking data is now being collected for every NBA basketball game with players, referees, and the ball tracked at 25 Hz, along with annotated game events such as passes, shots, and fouls. Other settings include laboratory animals, people in public spaces, professionals in settings such as operating rooms, actors speaking and performing, digital avatars in virtual environments, and even the behavior of other computational systems.
-<youtube>WjFdD7PDGw0</youtube>
-<youtube>teyGpr2Dgm4</youtube>
-<youtube>ZMhO1FO_j0o</youtube>
-<youtube>KBms4_LKbbg</youtube>

Difference between revisions of "Apprenticeship Learning - Inverse Reinforcement Learning (IRL)"

Revision as of 10:10, 29 October 2018

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools