Difference between revisions of "Google DeepMind AlphaGo Zero"
| Line 13: | Line 13: | ||
* [[Google DeepMind AlphaFold]] | * [[Google DeepMind AlphaFold]] | ||
* [[Minigo]] | * [[Minigo]] | ||
| − | + | * [http://www.nature.com/articles/nature16961 Mastering the game of Go with deep neural networks and tree search | D. Silver, A. Huang, C. Maddison, A. Guez, L. Sifre, G. van den Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, I. Sutskever, T. Lillicrap, M. Leach, K. Kavukcuoglu, T. Graepel & D. Hassabis] | |
| − | * [http://www.nature.com/articles/nature24270.epdf?author_access_token=VJXbVjaSHxFoctQQ4p2k4tRgN0jAjWel9jnR3ZoTv0PVW4gB86EEpGqTRDtpIz-2rmo8-KG06gqVobU5NSCFeHILHcVFUeMsbvwS-lxjqQGg98faovwjxeTUgZAUMnRQ Mastering the game of Go without human knowledge | Google DeepMind: | + | * [http://www.nature.com/articles/nature24270.epdf?author_access_token=VJXbVjaSHxFoctQQ4p2k4tRgN0jAjWel9jnR3ZoTv0PVW4gB86EEpGqTRDtpIz-2rmo8-KG06gqVobU5NSCFeHILHcVFUeMsbvwS-lxjqQGg98faovwjxeTUgZAUMnRQ Mastering the game of Go without human knowledge | Google DeepMind: D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton, Y. Chen, T. Lillicrap, F. Hui, L. Sifre, G. wan den Driessehe, T. Graepel, & D. Hassabis] |
* [http://science.sciencemag.org/content/sci/362/6419/1140.full.pdf A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play | Google DeepMind: David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy Lillicrap, Fan Hui, Laurent Sifre, George wan den Driessehe, Thore Graepel, & Demis Hassabis - Science] | * [http://science.sciencemag.org/content/sci/362/6419/1140.full.pdf A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play | Google DeepMind: David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy Lillicrap, Fan Hui, Laurent Sifre, George wan den Driessehe, Thore Graepel, & Demis Hassabis - Science] | ||
* [http://medium.com/applied-data-science/alphago-zero-explained-in-one-diagram-365f5abf67e0 AlphaGo Zero Explained In One Diagram | David Foster] | * [http://medium.com/applied-data-science/alphago-zero-explained-in-one-diagram-365f5abf67e0 AlphaGo Zero Explained In One Diagram | David Foster] | ||
Revision as of 11:12, 11 August 2019
Youtube search... ...Google search
- Service Capabilities
- Evolutionary Computation / Genetic Algorithms
- Architectures
- Google DeepMind AlphaFold
- Minigo
- Mastering the game of Go with deep neural networks and tree search | D. Silver, A. Huang, C. Maddison, A. Guez, L. Sifre, G. van den Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, I. Sutskever, T. Lillicrap, M. Leach, K. Kavukcuoglu, T. Graepel & D. Hassabis
- Mastering the game of Go without human knowledge | Google DeepMind: D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton, Y. Chen, T. Lillicrap, F. Hui, L. Sifre, G. wan den Driessehe, T. Graepel, & D. Hassabis
- A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play | Google DeepMind: David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy Lillicrap, Fan Hui, Laurent Sifre, George wan den Driessehe, Thore Graepel, & Demis Hassabis - Science
- AlphaGo Zero Explained In One Diagram | David Foster
Monte Carlo Tree Search