Parakeet engine with Apple Neural Engine enables fast and accurate transcription: ~300 ms per sentence, ~700 ms per paragraph, and around 20 seconds for 1 hour of audio.
Transcribe audio files in MP3, WAV, M4A, FLAC, and OGG formats via drag and drop.
Full transcription history with searchable text.
Recording overlay, start-minimized option, and hold-to-talk mode.
This issue has been reproduced and fixed in version 1.0.44, now available on the website. Note that when you update the app, the models will be recompiled, so the app may briefly hang during recording. Each new app version → new hash → AI (MLX) code recompilation.
Sorry about that — this feature is no longer available due to its cloud-based nature, but I plan to bring it back. The video was recorded before open-sourcing, so I’ll remove it to avoid confusion.
Update: Replaced the site video with a demo showing the actual app behavior.
Transcribe audio files in MP3, WAV, M4A, FLAC, and OGG formats via drag and drop.
Full transcription history with searchable text.
Recording overlay, start-minimized option, and hold-to-talk mode.