Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That's a fun idea. How do you get the transcript into Claude Code (or whatever you use)? What transcription service do you use?


I'm not the person you're replying to, but I use Whispering connected to the whisper-large-v3-turbo model on Groq.

It's incredibly cheap and works reliably for me.

I have got it to paste my voice transcriptions into Chrome (Gemini, Claude, ChatGPT) as well as Cursor.

https://github.com/EpicenterHQ/epicenter


I use Handy with Claude code. Nice to just have a key combo to transcribe into whatever has focus.

https://github.com/cjpais/Handy


Love handy. I use it too when dealing with LLMs. The other day I asked chatgpt to generate interview questions based on job description and then I answered using handy. So cool!


I use Spokenly with local Parakeet 0.6B v3 model + Cerebras gpt-oss-120b for post-processing (cleaning up transcription errors and fixing technical mondegreens, e.g., `no JS` → `Node.js`). Almost imperceptible transcription and processing delay. Trigger transcription with right ⌥ key.


According to Google this is the first time the phrase "technical mondegreens" was ever used. I really like it.


your OS might have a built in dictation thing. Google for that and try it before online services.


I use the Raycast + Whisper Dictation. I don't think there is anything novel about it, but it integrates nicely into my workflow.

My main gripe is when the recording window loses focus, I haven't found a way to bring it back and continue the recorded session. So occasionally I have to start from scratch, which is particularly annoying if it happens during a long-winded brain dump.


I built my own open-source tool to do exactly this so that I can run something like `claude $(hns)` in my terminal and then I can start speaking, and after I'm done, claude receives the transcript and start working. See this workflow here: https://hns-cli.dev/docs/drive-coding-agents/


There are a few apps nowadays for voice transcription. I've used Wispr Flow and Superwhisper, and both seem good. You can map some hotkey (e.g., ctrl + windows) to start recording, then when you press it again to stop, it'll get pasted into whatever text box you have open

Superwhisper offers some AI post-processing of the text (e.g., making nice bullets or grammar), but this doesn't seem necessary and just makes things a bit slower


+1 for Superwhisper. It has an offline model for transcription. And it transcribes with very high accuracy for me and great speed.


made this tool to press double control to start and another ctrl to stop which copies to the cliboard

https://github.com/elv1n/para-speak/


So cool man! Had to add couple fixes to be able to use it on mac. Love it!


I do the same. On Mac I use macwhisper. The transcription does not have to be correct. Lots of times it writes the wrong word when talking about technical stuff but Claude understands which word I mean from context


I use VoiceInk (needed some patches to get it to compile but Claude figured it out) and the Parakeet V3 model. It’s really good!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: