Difference between revisions of "Speech Recognition"
m |
m |
||
| Line 16: | Line 16: | ||
* [[Generative AI]] ... [[OpenAI]]'s [[ChatGPT]] ... [[Perplexity]] ... [[Microsoft]]'s [[BingAI]] ... [[You]] ...[[Google]]'s [[Bard]] | * [[Generative AI]] ... [[OpenAI]]'s [[ChatGPT]] ... [[Perplexity]] ... [[Microsoft]]'s [[BingAI]] ... [[You]] ...[[Google]]'s [[Bard]] | ||
* [http://www.theverge.com/2022/9/23/23367296/openai-whisper-transcription-speech-recognition-open-source Iused] | * [http://www.theverge.com/2022/9/23/23367296/openai-whisper-transcription-speech-recognition-open-source Iused] | ||
| + | * [https://www.marktechpost.com/2023/03/15/speechmatics-introduces-ursa-a-speech-to-text-system-that-delivers-unprecedented-performance-across-a-diverse-range-of-voices/ Speechmatics Introduces Ursa: A Speech-To-Text System That Delivers Unprecedented Performance Across A Diverse Range of Voices | Tanushree Shenwai - MarkTechPost] | ||
Revision as of 13:26, 16 March 2023
YouTube search... ...Google search
- Capabilities
- Assistants ... Hybrid Assistants ... Agents ... Negotiation
- Natural Language Processing (NLP) ...Generation ...LLM ...Tools & Services
- Attention Mechanism ...Transformer Model ...Generative Pre-trained Transformer (GPT)
- Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM)
- Generative AI ... OpenAI's ChatGPT ... Perplexity ... Microsoft's BingAI ... You ...Google's Bard
- Iused
- Speechmatics Introduces Ursa: A Speech-To-Text System That Delivers Unprecedented Performance Across A Diverse Range of Voices | Tanushree Shenwai - MarkTechPost
Whisper Automatic Speech Recognition Service (ASR)
YouTube search... ...Google search
Whisper is an Automatic Speech Recognition Service (ASR) by OpenAI trained on 680,000 hours of multilingual and multitask supervised data collected from the web. 'We’ve trained and are open-sourcing a neural net called Whisper that approaches human level robustness and accuracy on English speech recognition.' - OpenAI