Revision as of 13:26, 20 March 2023

YouTube ... Quora ...Google search ...Google News ...Bing News

Capabilities
- End-to-End Speech ... Synthesize Speech ... Speech Recognition
- Video ... Generated Image ... Colorize ... Image/Video Transfer Learning
Assistants ... Hybrid Assistants ... Agents ... Negotiation
Natural Language Processing (NLP) ...Generation ...LLM ...Tools & Services
Attention Mechanism ...Transformer Model ...Generative Pre-trained Transformer (GPT)
Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM)
Generative AI ... OpenAI's ChatGPT ... Perplexity ... Microsoft's BingAI ... You ...Google's Bard
Iused
Speechmatics Introduces Ursa: A Speech-To-Text System That Delivers Unprecedented Performance Across A Diverse Range of Voices | Tanushree Shenwai - MarkTechPost

Whisper Automatic Speech Recognition Service (ASR)

Whisper is an Automatic Speech Recognition Service (ASR) by OpenAI trained on 680,000 hours of multilingual and multitask supervised data collected from the web. 'We’ve trained and are open-sourcing a neural net called Whisper that approaches human level robustness and accuracy on English speech recognition.' - OpenAI

@@ Line 12: / Line 12: @@
 * [[Capabilities]]
 ** [[End-to-End Speech]] ... [[Synthesize Speech]] ... [[Speech Recognition]]
+** [[Video]] ... [[Generated Image]] ... [[Colorize]] ... [[Image/Video Transfer Learning]]
 * [[Assistants]] ... [[Hybrid Assistants]]  ... [[Agents]]  ... [[Negotiation]]
 * [[Natural Language Processing (NLP)]]  ...[[Natural Language Generation (NLG)|Generation]]  ...[[Large Language Model (LLM)|LLM]]  ...[[Natural Language Tools & Services|Tools & Services]]

Difference between revisions of "Speech Recognition"

Revision as of 13:26, 20 March 2023

Whisper Automatic Speech Recognition Service (ASR)

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools