Difference between revisions of "Gato"
m |
m (Text replacement - "http:" to "https:") |
||
| Line 5: | Line 5: | ||
|description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools | |description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools | ||
}} | }} | ||
| − | [ | + | [https://www.youtube.com/results?search_query=Google+Gato YouTube search...] |
| − | [ | + | [https://www.google.com/search?q=Google+Gato ...Google search] |
| − | * [ | + | * [https://ai.google/tools/ Google's Tools and Resources] |
* [[Google]] | * [[Google]] | ||
* [[Attention]] Mechanism ...[[Transformer]] Model ...[[Generative Pre-trained Transformer (GPT)]] | * [[Attention]] Mechanism ...[[Transformer]] Model ...[[Generative Pre-trained Transformer (GPT)]] | ||
| − | * [ | + | * [https://storage.googleapis.com/deepmind-media/A%20Generalist%20Agent/Generalist%20Agent.pdf A Generalist] [[Agents|Agent]] | S. Reed, K. Żołna, E. Parisotto, S. Gómez Colmenarejo, A. Novikov, G. Barth-Maron, M. Giménez, Y. Sulsky, J. Kay, J. Springenberg, T. Eccles, J. Bruce, A. Razavi, A. Edwards, N. Heess, Y. Chen, R. Hadsell, O. Vinyals, M. Bordbar and N. de Freitas - DeepMind |
| − | * [ | + | * [https://www.louisbouchard.ai/deepmind-gato/ Deepmind's new model Gato is amazing! | Louis Bouchard] |
DeepMind's “generalist” AI model inspired by progress in large-scale language modeling, we apply a similar approach towards building a single generalist [[Agents|agent]] beyond the realm of text outputs. The [[Agents|agent]], which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment generalist policy. The same network with the same weights can play Atari, caption images, chat, stack blocks with a real robot arm and much more, deciding based on its context whether to output text, joint torques, button presses, or other tokens. | DeepMind's “generalist” AI model inspired by progress in large-scale language modeling, we apply a similar approach towards building a single generalist [[Agents|agent]] beyond the realm of text outputs. The [[Agents|agent]], which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment generalist policy. The same network with the same weights can play Atari, caption images, chat, stack blocks with a real robot arm and much more, deciding based on its context whether to output text, joint torques, button presses, or other tokens. | ||
| Line 18: | Line 18: | ||
Gato has 16 [[Attention]] Heads... | Gato has 16 [[Attention]] Heads... | ||
| − | <img src=" | + | <img src="https://i.gzn.jp/img/2022/05/18/deepmind-gato/gato_m.png" width="800"> |
{|<!-- T --> | {|<!-- T --> | ||
| Line 50: | Line 50: | ||
<youtube>6fWEHrXN9zo</youtube> | <youtube>6fWEHrXN9zo</youtube> | ||
<b>Integrated AI - Gato by DeepMind (May/2022) 1.2B + Asimo, GPT-3, Tesla Optimus, Boston Dynamics | <b>Integrated AI - Gato by DeepMind (May/2022) 1.2B + Asimo, GPT-3, Tesla Optimus, Boston Dynamics | ||
| − | </b><br>Dr Alan D. Thompson is a world expert in artificial intelligence (AI), specializing in the augmentation of human intelligence, and advancing the evolution of ‘integrated AI’. Alan’s applied AI research and visualizations are featured across major international media, including citations in the University of Oxford’s debate on AI Ethics in December 2021. | + | </b><br>Dr Alan D. Thompson is a world expert in artificial intelligence (AI), specializing in the augmentation of human intelligence, and advancing the evolution of ‘integrated AI’. Alan’s applied AI research and visualizations are featured across major international media, including citations in the University of Oxford’s debate on AI Ethics in December 2021. https://lifearchitect.ai/ |
|} | |} | ||
|<!-- M --> | |<!-- M --> | ||
Revision as of 14:31, 28 March 2023
YouTube search... ...Google search
- Google's Tools and Resources
- Attention Mechanism ...Transformer Model ...Generative Pre-trained Transformer (GPT)
- A Generalist Agent | S. Reed, K. Żołna, E. Parisotto, S. Gómez Colmenarejo, A. Novikov, G. Barth-Maron, M. Giménez, Y. Sulsky, J. Kay, J. Springenberg, T. Eccles, J. Bruce, A. Razavi, A. Edwards, N. Heess, Y. Chen, R. Hadsell, O. Vinyals, M. Bordbar and N. de Freitas - DeepMind
- Deepmind's new model Gato is amazing! | Louis Bouchard
DeepMind's “generalist” AI model inspired by progress in large-scale language modeling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment generalist policy. The same network with the same weights can play Atari, caption images, chat, stack blocks with a real robot arm and much more, deciding based on its context whether to output text, joint torques, button presses, or other tokens.
Gato has 16 Attention Heads...
|
|
|
|