Difference between revisions of "PaLM"

Revision as of 10:33, 29 April 2023

YouTube ... Quora ...Google search ...Google News ...Bing News

PaLM-E | Google
Multimodal Language Models
Large Language Model (LLM) ... Natural Language Processing (NLP) ...Generation ... Classification ... Understanding ... Translation ... Tools & Services
Assistants ... Agents ... Negotiation ... HuggingGPT ... LangChain
Attention Mechanism ...Transformer Model ...Generative Pre-trained Transformer (GPT)
Generative AI ... Conversational AI ... OpenAI's ChatGPT ... Perplexity ... Microsoft's Bing ... You ...Google's Bard ... Baidu's Ernie
Capabilities
- Video/Image ... Vision ... Colorize ... Image/Video Transfer Learning
- End-to-End Speech ... Synthesize Speech ... Speech Recognition
Development ...AI Pair Programming Tools ... Analytics ... Visualization ... Diagrams for Business Analysis
Prompt Engineering (PE)
Foundation Models (FM)
Singularity ... Moonshots ... Emergence ... Explainable / Interpretable AI ... AGI ... Inside Out - Curious Optimistic Reasoning ... Automated Learning

An Embodied Multimodal Language Model that directly incorporates real-world continuous sensor modalities into language models and thereby establishes the link between words and percepts. It was developed by Google to be a model for robotics and can solve a variety of tasks on multiple types of robots and for multiple modalities (images, robot states, and neural scene representations). PaLM-E is also a generally-capable vision-and-language model. It can perform visual tasks, such as describing images, detecting objects, or classifying scenes, and is also proficient at language tasks, like quoting poetry, solving math equations or generating code. 562B

https://palm-e.github.io/videos/palm-e-teaser.mp4

Revision as of 10:05, 29 April 2023 (view source) BPeat (talk \| contribs) m ← Older edit		Revision as of 10:33, 29 April 2023 (view source) BPeat (talk \| contribs) m Newer edit →
Line 26:		Line 26:


−	an Embodied Multimodal Language Model that directly incorporates real-world continuous sensor modalities into language models and thereby establishes the link between words and percepts. It was developed by Google to be a model for robotics and can solve a variety of tasks on multiple types of robots and for multiple modalities (images, robot states, and neural scene representations). PaLM-E is also a generally-capable vision-and-language model. It can perform visual tasks, such as describing images, detecting objects, or classifying scenes, and is also proficient at language tasks, like quoting poetry, solving math equations or generating code. 562B	+	An Embodied Multimodal Language Model that directly incorporates real-world continuous sensor modalities into language models and thereby establishes the link between words and percepts. It was developed by Google to be a model for robotics and can solve a variety of tasks on multiple types of robots and for multiple modalities (images, robot states, and neural scene representations). PaLM-E is also a generally-capable vision-and-language model. It can perform visual tasks, such as describing images, detecting objects, or classifying scenes, and is also proficient at language tasks, like quoting poetry, solving math equations or generating code. 562B
		+
		+
		+	https://palm-e.github.io/videos/palm-e-teaser.mp4

Difference between revisions of "PaLM"

Revision as of 10:33, 29 April 2023

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools