AI API gateway that ends manual channel switching with smart routing, auto failover, exponential cooldown, multi-URL scheduling, live request monitoring and soft-error detection.
-
Updated
Jun 19, 2026 - Go
AI API gateway that ends manual channel switching with smart routing, auto failover, exponential cooldown, multi-URL scheduling, live request monitoring and soft-error detection.
See what Claude Code and Codex actually send to the API — and what each part costs.
TokenMap is a desktop app for treemap-based codebase analysis by tokens, size, complexity, hotspots, and refactor priority
🚀 Intelligent Claude Code status line with multi-provider AI support, real-time token counting, and universal model compatibility. Supports Claude (Sonnet 4: 1M, 3.5: 200K), OpenAI (GPT-4.1: 1M, 4o: 128K), Gemini (1.5 Pro: 2M, 2.x: 1M), and xAI Grok (3: 1M, 4: 256K) with verified 2025 context limits.
A local proxy that converts websites and APIs to clean Markdown. Convert HTML pages, JSON APIs, and dynamic sites. Get token counts for LLM budgeting.
Lightweight token tracking, cost management, and budget enforcement for LLM API calls
ttok-style token counting for Amazon Bedrock
A high-performance, multi-agent observability engine designed for the Model Context Protocol (MCP). It provides a non-blocking, transparent proxy layer that implements deterministic token attribution, real-time context-window alerting, and heuristic-driven static analysis to optimize LLM metadata overhead at scale.
Token Optimization for Context Engineers. 4.8 KB WASM. Sub-millisecond. Zero dependencies.
.NET library for accurate token counting, cost calculation, and session-based usage tracking across 12 LLM providers including OpenAI, Anthropic, Google, Azure, and more.
Open-source LLM FinOps proxy — track OpenAI, Anthropic (Claude), and Google Gemini costs by feature, team, and customer. Zero code changes. pip install burnlens.
VS Code extension + MCP server that validates prompts, routes to the cheapest LLM, and projects token × turn cost before the call. Cuts Copilot premium-request burn on agent loops.
A blazing-fast BPE tokenizer for LLMs. Drop-in tiktoken replacement, 20-80x faster.
Compare LLM API costs instantly. npx llm-costs "your prompt" --compare across 17 models. Auto-updating pricing.
⚡ Production-grade real-time AI cost enforcement system. Sub-5ms balance checks, atomic operations, gRPC + REST APIs. Stop AI overages before they happen.
Production-grade PowerShell module bundler. Integrated with Pester for automated testing and GitHub Actions for stable CI/CD pipelines.
Real-time cost middleware for LLM APIs. Set budgets, get alerts, stop runaway Claude/OpenAI/Gemini spend.
Universal LLM token counting and cost management. Track spending, set budgets, compare providers.
Intelligent conversation compaction for LLM applications. Never hit context window limits again. Works with any LLM provider.
Add a description, image, and links to the token-counting topic page so that developers can more easily learn about it.
To associate your repository with the token-counting topic, visit your repo's landing page and select "manage topics."