So I decided to see if I could vibe code it with 0 macOS app & Swift experience.
It uses a local binary of whisper.cpp (a fast implementation of OpenAI's Whisper voice-to-text model in C++).
Github: https://github.com/richardwu/openwhisper
I also decided to take this as an opportunity to compare 3 agentic coding harnesses:
Cursor w/ Opus 4.6: - Best one-shot UI by far - Didn't get permissioning correct - Had issues making the "Cancel recording" hotkey being turned on all the time
Claude Code w/ Opus 4.6: - Fewest turns to get main functionality right (recording, hotkeys, permissions) - Was able to get a decent UI with a few more turns
Codex App w/ Codex 5.3 Extra-High: - Worst one-shot UI - None of the functionality worked without multiple subsequent prompts
Which is a voice to text app as well.
https://github.com/Starmel/OpenSuperWhisper
Though it seems like the dev stopped work on it
which beats the pants off android or iphone transcription...
I clicked hoping it would leverage whisper’s translation capabilities as well, since MacWhisper does not do that (it leverages Apple’s subpar built in translation). It doesn’t do that either.
(Maybe you’re using this for dictation? That’s a very specific subset of transcription. I’d suggest using a name that leans into that.)