Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
-
Updated
May 25, 2026 - Python
Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
🔊 Kokoro Web: Free AI text-to-speech, online or self-hosted, OpenAI compatible!
🎙️ VoxSherpa TTS Offline Neural Text-to-Speech Engine for Android ⚡ Sherpa-ONNX powered 🔊 Natural voice synthesis 📱 Fully offline processing 🚀 No cloud • No limits
Natural-sounding Text-to-Speech App that fits anywhere. Fast, Real-Time and flexible.
From-scratch voice agents in Python: end-to-end speech pipelines, runnable chapters, and a small shared library. Local models, explicit streaming behavior.
A Docker container for running Kokoro Text-to-Speech engine v.1, providing high-quality speech synthesis
A Python package that makes it easy to use the Kokoro voice synthesis library.
TTS toolkit built on Kokoro-82M with librosa audio enhancement, MCP server for Claude/Cursor, CLI & Python API. Free & open-source for YouTube creators.
This tool allows users to create Anki cards with words, meanings, examples, and IPA pronunciations, and convert text to speech for audio files.
Production-ready RunPod serverless endpoint for Kokoro TTS. Features high-quality text-to-speech, voice mixing, word-level timestamps, and phoneme generation. Optimized for fast cold starts and auto-scaling.
🚀 Aperture is a modern, feature-rich desktop EPUB reader built with Python 🐍 and PyQt6. It focuses on a clean reading experience ✨ and powerful, integrated Kokoro Text-to-Speech (TTS) capabilities 🗣️🔊
An advanced AI mental health assistant that combines voice interaction, fine-tuned psychology models, and intelligent knowledge retrieval to provide comprehensive psychological support.
create audio books from pdfs with one click , available on windows , linux, mac
📚 Index and enrich your PDFs and Markdown files locally for a powerful, unified knowledge base with semantic search capabilities.
A powerful, local-first AI orchestration layer that unifies advanced LLM reasoning, real-time voice synthesis, and local system automation. Featuring a Flask-based web interface, this system uses a smart Gemini API rotation strategy with a Groq fallback.
TTS Fast Web,一个简单优雅的本地文字转语音的前端与API接口。A localized, cross-platform, multi-language supported, OpenAI API format compatible, full-stack, ready-to-deploy TTS (Text to Speech) model
Offline Kokoro-82M text-to-speech for Python — library, CLI, and a unix-socket daemon for ~13ms speech from shell scripts. Apache 2.0, CPU real-time, macOS and Linux.
A lightweight, offline Rust inference library for Kokoro TTS - an 82M-parameter open-weights text-to-speech model.
Text-to-speech web application built with React, FastAPI, and Kokoro-82M. Runs locally via start.bat.
Local Kokoro-82M text-to-speech CLI
Add a description, image, and links to the kokoro-82m topic page so that developers can more easily learn about it.
To associate your repository with the kokoro-82m topic, visit your repo's landing page and select "manage topics."