Difference between revisions of "Deep Distributed Q Network Partial Observability"
| Line 12: | Line 12: | ||
* [http://www.cs.utexas.edu/~larg/hausknecht_thesis/slides/peter_ijcai.pdf Deep Multiagent Reinforcement Learning for Partially Observable Parameterized Environments | Peter Stone] | * [http://www.cs.utexas.edu/~larg/hausknecht_thesis/slides/peter_ijcai.pdf Deep Multiagent Reinforcement Learning for Partially Observable Parameterized Environments | Peter Stone] | ||
* [http://www.ifaamas.org/Proceedings/aamas2016/pdfs/p530.pdf Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds] | * [http://www.ifaamas.org/Proceedings/aamas2016/pdfs/p530.pdf Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds] | ||
| + | * [[Monte Carlo]] | ||
<youtube>8JeweuKOA1M</youtube> | <youtube>8JeweuKOA1M</youtube> | ||
<youtube>dMOUp7YzUpQ</youtube> | <youtube>dMOUp7YzUpQ</youtube> | ||
Revision as of 15:49, 11 August 2019
Youtube search... ...Google search
- Architectures
- Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability | ArXiv
- Deep Multiagent Reinforcement Learning for Partially Observable Parameterized Environments | Peter Stone
- Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds
- Monte Carlo