Kosmos-1

YouTube ... Quora ...Google search ...Google News ...Bing News

Kosmos-1 | Microsoft
Multimodal Language Models ... Generative Pre-trained Transformer (GPT-4) ... GPT-5
Large Language Model (LLM) ... Natural Language Processing (NLP) ...Generation ... Classification ... Understanding ... Translation ... Tools & Services
Assistants ... Personal Companions ... Agents ... Negotiation ... LangChain
Attention Mechanism ...Transformer ...Generative Pre-trained Transformer (GPT) ... GAN ... BERT
Generative AI ... Conversational AI ... OpenAI's ChatGPT ... Perplexity ... Microsoft's Bing ... You ...Google's Bard ... Baidu's Ernie
Video/Image ... Vision ... Enhancement ... Fake ... Reconstruction ... Colorize ... Occlusions ... Predict image ... Image/Video Transfer Learning
End-to-End Speech ... Synthesize Speech ... Speech Recognition ... Music
Analytics ... Visualization ... Graphical Tools ... Loop ... Diagrams & Business Analysis ... Requirements ... Bayes ... Network Pattern
Development ... Notebooks ... AI Pair Programming ... Codeless, Generators, Drag n' Drop ... AIOps/MLOps ... AIaaS/MLaaS
Prompt Engineering (PE) ... PromptBase ... Prompt Injection Attack
Foundation Models (FM)
Singularity ... Sentience ... AGI ... Curious Reasoning ... Emergence ... Moonshots ... Explainable AI ... Automated Learning

Can perceive general modalities, learn in context (i.e., few-shot), and follow instructions (i.e., zero-shot). It can analyze images for content, solve visual puzzles, perform visual text recognition, and pass visual IQ tests. 1.6B

Kosmos-1

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools