Difference between revisions of "Deep Distributed Q Network Partial Observability"
(Created page with "[http://www.youtube.com/results?search_query=deep+distributed+Q+network+partial+observability Youtube search...] * [https://arxiv.org/pdf/1703.06182.pdf Deep Decentralized Mu...") |
|||
| Line 1: | Line 1: | ||
[http://www.youtube.com/results?search_query=deep+distributed+Q+network+partial+observability Youtube search...] | [http://www.youtube.com/results?search_query=deep+distributed+Q+network+partial+observability Youtube search...] | ||
| − | * [ | + | * [[Architectures]] |
| + | * [http://arxiv.org/pdf/1703.06182.pdf Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability | ArXiv] | ||
* [http://www.cs.utexas.edu/~larg/hausknecht_thesis/slides/peter_ijcai.pdf Deep Multiagent Reinforcement Learning for Partially Observable Parameterized Environments | Peter Stone] | * [http://www.cs.utexas.edu/~larg/hausknecht_thesis/slides/peter_ijcai.pdf Deep Multiagent Reinforcement Learning for Partially Observable Parameterized Environments | Peter Stone] | ||
* [http://www.ifaamas.org/Proceedings/aamas2016/pdfs/p530.pdf Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds] | * [http://www.ifaamas.org/Proceedings/aamas2016/pdfs/p530.pdf Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds] | ||
Revision as of 18:22, 25 May 2018
- Architectures
- Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability | ArXiv
- Deep Multiagent Reinforcement Learning for Partially Observable Parameterized Environments | Peter Stone
- Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds