Difference between revisions of "Google DeepMind AlphaGo Zero"

Revision as of 11:12, 11 August 2019

Youtube search... ...Google search

Service Capabilities
Evolutionary Computation / Genetic Algorithms
Architectures
Google DeepMind AlphaFold
Minigo
Mastering the game of Go with deep neural networks and tree search | D. Silver, A. Huang, C. Maddison, A. Guez, L. Sifre, G. van den Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, I. Sutskever, T. Lillicrap, M. Leach, K. Kavukcuoglu, T. Graepel & D. Hassabis
Mastering the game of Go without human knowledge | Google DeepMind: D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton, Y. Chen, T. Lillicrap, F. Hui, L. Sifre, G. wan den Driessehe, T. Graepel, & D. Hassabis
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play | Google DeepMind: David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy Lillicrap, Fan Hui, Laurent Sifre, George wan den Driessehe, Thore Graepel, & Demis Hassabis - Science
AlphaGo Zero Explained In One Diagram | David Foster

1*0pn33bETjYOimWjlqDLLNw.png

Monte Carlo Tree Search

Youtube search...

Markov Decision Process (MDP)

@@ Line 13: / Line 13: @@
 * [[Google DeepMind AlphaFold]]
 * [[Minigo]]
+* [http://www.nature.com/articles/nature16961 Mastering the game of Go with deep neural networks and tree search | D. Silver, A. Huang, C. Maddison, A. Guez, L. Sifre, G. van den Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, I. Sutskever, T. Lillicrap, M. Leach, K. Kavukcuoglu, T. Graepel & D. Hassabis]
-* [http://www.nature.com/articles/nature24270.epdf?author_access_token=VJXbVjaSHxFoctQQ4p2k4tRgN0jAjWel9jnR3ZoTv0PVW4gB86EEpGqTRDtpIz-2rmo8-KG06gqVobU5NSCFeHILHcVFUeMsbvwS-lxjqQGg98faovwjxeTUgZAUMnRQ Mastering the game of Go without human knowledge | Google DeepMind: David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy Lillicrap, Fan Hui, Laurent Sifre, George wan den Driessehe, Thore Graepel, & Demis Hassabis]
+* [http://www.nature.com/articles/nature24270.epdf?author_access_token=VJXbVjaSHxFoctQQ4p2k4tRgN0jAjWel9jnR3ZoTv0PVW4gB86EEpGqTRDtpIz-2rmo8-KG06gqVobU5NSCFeHILHcVFUeMsbvwS-lxjqQGg98faovwjxeTUgZAUMnRQ Mastering the game of Go without human knowledge | Google DeepMind: D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton, Y. Chen, T. Lillicrap, F. Hui, L. Sifre, G. wan den Driessehe, T. Graepel, & D. Hassabis]
 * [http://science.sciencemag.org/content/sci/362/6419/1140.full.pdf A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play | Google DeepMind: David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy Lillicrap, Fan Hui, Laurent Sifre, George wan den Driessehe, Thore Graepel, & Demis Hassabis - Science]
 * [http://medium.com/applied-data-science/alphago-zero-explained-in-one-diagram-365f5abf67e0 AlphaGo Zero Explained In One Diagram | David Foster]

Difference between revisions of "Google DeepMind AlphaGo Zero"

Revision as of 11:12, 11 August 2019

Monte Carlo Tree Search

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools