OpenAI Whisper Transcription
Cluster focuses on recommendations for OpenAI's Whisper model and implementations like whisper.cpp for accurate, local speech-to-text transcription, with comparisons to alternatives like Vosk and DeepSpeech.
Activity Over Time
Top Contributors
Keywords
Sample Comments
If you aren't already using OpenAI Whisper for this, you should probably try it.
whisper is very good, I wouldn't use anything else personally. it's free and open, you can run it locally. it makes very few errors (for my own voice, anyway)https://github.com/openai/whisper
Whisper democratises high-quality transcription of the languages I personally care about, whether using a CPU or a GPU. It's FOSS, self-hostable, and very easy to use.That's a big deal for someone who wants to build something using speech recognition (voice assistant, transcription of audio or video) without resorting to APIs.
Have you tried faster-whisper and whisper.cpp?
Nice!Are you aware that whisper.cpp has a WASM-version as well? See https://github.com/ggerganov/whisper.cpp/tree/master/example... - demo at https://whisper.ggerganov.com/
Been working on https://handy.computer. It's similar to SuperWhisper and others, but is open source and cross platformIt uses whisper.cpp under the hood and should be accelerated on most devices using the Vulkan backend
whisper from OpenAI works great for me.
There is a great library that has support not only with OpenAIs whisper but many others that also work offline. https://github.com/Uberi/speech_recognition
Is this just a front end to OpanAI's whisper?https://github.com/openai/whisper
Vosk is my favourite. I have used deep speech too. Vosk works better.