Knowledgeable privacy aficionados of Lemmy, perhaps you can help.
I'm searching for a U.S. English speech to text program I can use for note taking, dictation, and internet searching that runs locally on Windows and doesn't collect information or send it off to either the software company or third parties. I'm looking for an out-of-the-box easy option first- if needed I can explore writing scripts and using an LLM to craft a UI, but I'm not looking for something that would require a significant amount of extra building or coding. Ideally it'd be FLOSS and be light on compute, but I'm not averse to paying for a solid product that meets the privacy requirement and if it's not ludicrously heavy on compute, that's okay.
Vosk seems a good option, though in my brief exploration, I haven't found a UI or scripts to use it easily.
WhisperAI, while very accurate, doesn't natively support real-time speech to text, though there are some mods that try and address that.
Anything I'm completely missing?
Cory Wong! Band is as tight as can be. Vibe is playful, dynamic, and celebratory of all on stage.
Have seen him live only once, but he posts incredibly well-mixed and edited full-length concerts on his YouTube.
He fully deserves his blowup. Incredible musicianship abounds around him