whisper-alternative

Here are 18 public repositories matching this topic...

modelscope / FunASR

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

Updated Jun 19, 2026
Python

FunAudioLLM / SenseVoice

Star

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

multilingual python pytorch audio-analysis speech-recognition speech-to-text asr emotion-detection cross-lingual speech-emotion-recognition voice-ai llm audio-event-detection whisper-alternative

Updated Jun 19, 2026
Python

FunAudioLLM / Fun-ASR

Star

End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.

pytorch speech-recognition speech-to-text transcription asr speaker-diarization chinese-dialects real-time-asr audio-language-model multilingual-asr fun-asr whisper-alternative 31-languages llm-asr

Updated Jun 19, 2026
Python

appautomaton / tnt-asr

Star

Terminal voice-to-text TUI — Qwen3-ASR-1.7B on the Apple GPU via MLX (mlx-speech). Fully local, no PyTorch, transcribes in ~1s. macOS Apple Silicon.

python macos terminal tui speech-recognition speech-to-text dictation asr mlx voice-to-text on-device-ai apple-silicon qwen whisper-alternative

Updated Jun 11, 2026
Python

hasso5703 / talkink

Star

Talk. Ink. — Push-to-talk dictation for macOS, 100% on-device. Pick your model: Qwen3-ASR, NVIDIA Nemotron or Voxtral, all via Apple MLX.

macos swift open-source privacy speech-recognition speech-to-text dictation asr mlx swiftui on-device-ai apple-silicon qwen nemotron voxtral whisper-alternative

Updated Jun 16, 2026
Swift

felixfu824 / HushType

Star

Local voice-to-text for macOS and iOS. Multilingual (EN/ZH/JP) with Traditional Chinese output. Runs Qwen3-ASR on Apple Silicon via MLX. No cloud, no subscription.

Updated Jun 10, 2026
Swift

Vincent-WenZX / CWX-Transcribe

Star

Production pipeline around OpenAI gpt-4o-transcribe-diarize for long-form 2-speaker interviews. Cross-chunk speaker consistency · diarization hallucination fix · async GPT-5.5 domain-term correction. WER 6.05% / DER 4.28% on 2h26m benchmark. Beats raw OpenAI API by +11.5 Q.

Updated May 6, 2026
Python

199-biotechnologies / textstream-asr

Star

Live speech-to-text streaming on Apple Silicon — Qwen3-ASR + Silero VAD + MLX

python macos speech-recognition server-sent-events speech-to-text transcription asr mlx voice-activity-detection live-captions on-device-ai apple-silicon silero-vad real-time-transcription local-ai offline-asr qwen3-asr streaming-transcription whisper-alternative

Updated Mar 30, 2026
Python

charles1018 / NemoScribe

Star

🎬 AI subtitle generator: convert video to SRT subtitles locally with NVIDIA NeMo Parakeet-TDT speech-to-text. GPU-accelerated, word-level timestamps, VAD, LLM correction — a fast offline Whisper alternative.

Updated Jun 11, 2026
Python

tristan-mcinnis / local-dictation

Star

Free, private, on-device dictation for macOS (Apple Silicon). Push-to-talk speech-to-text with on-device LLM cleanup — an offline, local alternative to cloud & Whisper dictation. Parakeet TDT v3 ASR + Qwen 2.5 1.5B cleanup + macOS Accessibility injection. Pure Rust, ~300–400 ms per utterance, nothing leaves your Mac.

Updated Jun 4, 2026
Rust

briancaffey / nemotron-asr-server

Star

OpenAI-compatible speech-to-text server for nvidia/nemotron-3.5-asr-streaming-0.6b (NeMo). Runs on the DGX Spark / GB10.

nvidia speech-to-text transcription nemo asr fastapi openai-api nemotron dgx-spark whisper-alternative

Updated Jun 8, 2026
Python

bykcyc / Cadence

Star

Private, local-first meeting recorder + transcription, diarization, AI notes, voice dictation & read-aloud for Windows — runs on your own GPU.

Updated Jun 16, 2026
TypeScript

josuebustosn / gemini-transcribe

Star

Skill de Claude Code que transcribe audios y videos a Markdown estructurado con timestamps y diarización, usando Google Gemini. Reemplazo gratuito de ElevenLabs Scribe / Whisper para quien ya paga Gemini.

python gemini spanish speech-to-text transcription diarization google-gemini claude-code claude-skill elevenlabs-alternative whisper-alternative

Updated May 26, 2026
Python

Trust-1-eng / transcription-studio

Star

Local FastAPI transcription studio: AssemblyAI Universal-2 (99 lang), FFmpeg, yt-dlp, Word/PDF/ZIP export

python ai speech-to-text transcription fastapi yt-dlp assemblyai whisper-alternative

Updated Jun 6, 2026
Python

Onigam / nemotron-local-stt

Star

Reconnaissance vocale 100% locale sur Mac Apple Silicon (NVIDIA Nemotron 3.5 ASR + parakeet.cpp) : transcription micro live ou fichier, sortie .txt/.srt.

macos offline speech-to-text asr parakeet apple-silicon local-ai nemotron whisper-alternative

Updated Jun 18, 2026
Python

ashish8485 / talkink

Star

Transcribe your voice directly to text using local MLX models on macOS. Press a key to dictate and paste text instantly while keeping your data private.

macos slack chat swift open-source speech-recognition speech-to-text asr twist on-device-ai apple-silicon kontenbase talkink nemotron voxtral whisper-alternative

Updated Jun 17, 2026
Swift

agentseo / gigaam-transcribe

Star

Офлайн-расшифровка русского видео/аудио в текст и субтитры (GigaAM v3 + sherpa-onnx), нативный Windows x64/ARM64

subtitles russian speech-to-text transcription asr sherpa-onnx gigaam whisper-alternative

Updated Jun 17, 2026
PowerShell

karandeepbhardwaj / Yapper

Star

Voice-to-text desktop app that captures speech, refines transcripts with AI, and auto-pastes at your cursor

desktop-app windows macos rust productivity ai speech-to-text transcription dictation voice-to-text tauri whisper-alternative

Updated Jun 9, 2026
TypeScript

Improve this page

Add a description, image, and links to the whisper-alternative topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the whisper-alternative topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

whisper-alternative

Here are 18 public repositories matching this topic...

modelscope / FunASR

FunAudioLLM / SenseVoice

FunAudioLLM / Fun-ASR

appautomaton / tnt-asr

hasso5703 / talkink

felixfu824 / HushType

Vincent-WenZX / CWX-Transcribe

199-biotechnologies / textstream-asr

charles1018 / NemoScribe

tristan-mcinnis / local-dictation

briancaffey / nemotron-asr-server

bykcyc / Cadence

josuebustosn / gemini-transcribe

Trust-1-eng / transcription-studio

Onigam / nemotron-local-stt

ashish8485 / talkink

agentseo / gigaam-transcribe

karandeepbhardwaj / Yapper

Improve this page

Add this topic to your repo