Show HN: Push-to-talk dictation for Android apps and terminal workflows

I built this because MacWhisper is not available on Android and voice typing on Android is pretty bad. Moreover Gemini does not allow you to edit transcripts before they are auto-sent.

I like my SwiftKey keyboard though, so I did not want to replace that. So the only way was to make a floating push-to-talk button on top of any app.

You tap the overlay, speak, tap again, transcribe, and insert text into the currently focused field.

It supports local on-device transcription, cloud transcription with your own OpenAI key, and optional post-processing/cleanup for punctuation, formatting, prompts, commands, etc.

A nice use case for me has been Termux / terminal workflows on Android. You have a "dev mode" where you can just say "command mode" and anything after it will be converted into a proper CLI command.

The app is open source. No backend — in cloud mode requests go directly from the phone to OpenAI using the user's own API key.

Repo: https://github.com/kafkasl/phone-whisper APK: https://github.com/kafkasl/phone-whisper/releases

1 points | by pol_avec 2 hours ago

0 comments