You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
GenMedia Creative Studio is a Vertex AI generative media user experience highlighting the use of Gemini, Veo, Gemini Image 🍌, Gemini TTS, Chirp 3, Lyria and other generative media APIs on Google Cloud.
Take 3 on the bounce gives three expressive voice readings - experience Gemini 3.1 Flash TTS and experiment with speech expressivity, pacing and delivery.
Audiobook Generator - A TypeScript CLI tool for creating audiobooks from scripts using Google Gemini 2.5 Pro Text-to-Speech API with multi-speaker support (manual stitching)
Upload a cricket clip — AI infers the match situation using Gemini Vision, generates emotionally tagged radio commentary, and returns your video with a voiced audio track powered by Gemini 3.1 Flash TTS.
AI-powered translation app with Next.js frontend and Java JEE backend using Google Gemini for multilingual translation, meanings, alternatives, and TTS.
alat Command Line Interface (CLI) berbasis Python yang dirancang untuk menghasilkan audio text-to-speech (TTS) berkualitas tinggi menggunakan API Gemini.
Generate structured courses (overview, tutorial, glossary, modules, quizzes, podcast script + audio, interactive HTML page) about any GitHub repository using Claude Code subagents and Gemini TTS.
Provider-agnostic TTS gateway and CLI for AI coding harnesses: ElevenLabs, OpenAI, Gemini, xAI, Voicebox, system TTS. Works with Claude Code, OpenCode, Codex, Cursor and Pi out of the box