Difference between revisions of "Speech Recognition"
m |
m |
||
| Line 10: | Line 10: | ||
* [[Capabilities]] | * [[Capabilities]] | ||
* [[End-to-End Speech]] | * [[End-to-End Speech]] | ||
| + | * [[Assistants]] ... [[Hybrid Assistants]] ... [[Agents]] ... [[Negotiation]] | ||
* [[Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM)]] | * [[Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM)]] | ||
* [http://www.theverge.com/2022/9/23/23367296/openai-whisper-transcription-speech-recognition-open-source Iused] | * [http://www.theverge.com/2022/9/23/23367296/openai-whisper-transcription-speech-recognition-open-source Iused] | ||
Revision as of 07:44, 21 February 2023
YouTube search... ...Google search
- Capabilities
- End-to-End Speech
- Assistants ... Hybrid Assistants ... Agents ... Negotiation
- Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM)
- Iused
Whisper Automatic Speech Recognition Service (ASR)
YouTube search... ...Google search
A Conversation with Philip Hodgetts from Lumberjack System Whisper is an automatic speech recognition (ASR) system by OpenAI trained on 680,000 hours of multilingual and multitask supervised data collected from the web.
In this episode, I speak with Philip Hodgetts, founder of Intelligent Assistance, all about the OpenAI Whisper Automatic Speech Recognition Service (ASR). We’ll learn what it is, what are its use cases, how he’s using it in his app, how you can use it in your app or service and much more!