Difference between revisions of "Google DeepMind AlphaGo Zero"

From
Jump to: navigation, search
Line 13: Line 13:
 
* [[Google DeepMind AlphaFold]]
 
* [[Google DeepMind AlphaFold]]
 
* [[Minigo]]
 
* [[Minigo]]
 
+
* [http://www.nature.com/articles/nature16961 Mastering the game of Go with deep neural networks and tree search | D. Silver, A. Huang, C. Maddison, A. Guez, L. Sifre, G. van den Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, I. Sutskever, T. Lillicrap, M. Leach, K. Kavukcuoglu, T. Graepel & D. Hassabis]
* [http://www.nature.com/articles/nature24270.epdf?author_access_token=VJXbVjaSHxFoctQQ4p2k4tRgN0jAjWel9jnR3ZoTv0PVW4gB86EEpGqTRDtpIz-2rmo8-KG06gqVobU5NSCFeHILHcVFUeMsbvwS-lxjqQGg98faovwjxeTUgZAUMnRQ Mastering the game of Go without human knowledge | Google DeepMind: David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy Lillicrap, Fan Hui, Laurent Sifre, George wan den Driessehe, Thore Graepel, & Demis Hassabis]
+
* [http://www.nature.com/articles/nature24270.epdf?author_access_token=VJXbVjaSHxFoctQQ4p2k4tRgN0jAjWel9jnR3ZoTv0PVW4gB86EEpGqTRDtpIz-2rmo8-KG06gqVobU5NSCFeHILHcVFUeMsbvwS-lxjqQGg98faovwjxeTUgZAUMnRQ Mastering the game of Go without human knowledge | Google DeepMind: D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton, Y. Chen, T. Lillicrap, F. Hui, L. Sifre, G. wan den Driessehe, T. Graepel, & D. Hassabis]
 
* [http://science.sciencemag.org/content/sci/362/6419/1140.full.pdf A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play | Google DeepMind: David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy Lillicrap, Fan Hui, Laurent Sifre, George wan den Driessehe, Thore Graepel, & Demis Hassabis - Science]  
 
* [http://science.sciencemag.org/content/sci/362/6419/1140.full.pdf A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play | Google DeepMind: David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy Lillicrap, Fan Hui, Laurent Sifre, George wan den Driessehe, Thore Graepel, & Demis Hassabis - Science]  
 
* [http://medium.com/applied-data-science/alphago-zero-explained-in-one-diagram-365f5abf67e0 AlphaGo Zero Explained In One Diagram | David Foster]
 
* [http://medium.com/applied-data-science/alphago-zero-explained-in-one-diagram-365f5abf67e0 AlphaGo Zero Explained In One Diagram | David Foster]

Revision as of 11:12, 11 August 2019