Difference between revisions of "Deep Distributed Q Network Partial Observability"

From
Jump to: navigation, search
Line 12: Line 12:
 
* [http://www.cs.utexas.edu/~larg/hausknecht_thesis/slides/peter_ijcai.pdf Deep Multiagent Reinforcement Learning for Partially Observable Parameterized Environments | Peter Stone]
 
* [http://www.cs.utexas.edu/~larg/hausknecht_thesis/slides/peter_ijcai.pdf Deep Multiagent Reinforcement Learning for Partially Observable Parameterized Environments | Peter Stone]
 
* [http://www.ifaamas.org/Proceedings/aamas2016/pdfs/p530.pdf Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds]
 
* [http://www.ifaamas.org/Proceedings/aamas2016/pdfs/p530.pdf Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds]
 +
* [[Monte Carlo]]
  
 
<youtube>8JeweuKOA1M</youtube>
 
<youtube>8JeweuKOA1M</youtube>
 
<youtube>dMOUp7YzUpQ</youtube>
 
<youtube>dMOUp7YzUpQ</youtube>

Revision as of 15:49, 11 August 2019