cost-reduction

Here are 82 public repositories matching this topic...

rtk-ai / rtk

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

rust cli productivity open-source developer-tools command-line-tool llm cost-reduction anthropic ai-coding claude-code token-optimization agentic-coding

Updated Jun 18, 2026
Rust

zdk / lowfat

Star

lowfat - slim your command output. strips noise, saves tokens.

rust cli open-source developer-tools shell-script llm cost-reduction token-optimization agentic-coding-tool token-savings token-saving

Updated Jun 18, 2026
Rust

fajarhide / omni

Sponsor

Star

The Context OS for Autonomous AI Agents. Distill terminal noise into pure semantic signal, stop agent hallucinations, and cut token costs by up to 90%.

rust cli homebrew hooks mcp ai-agents cost-reduction token-reduction efficiency-tools antigravity context-distillation claude-code token-optimization token-efficiency token-savings

Updated Jun 15, 2026
Rust

flightlesstux / prompt-caching

Star

Automatic prompt caching for Claude Code. Cuts token costs by up to 90% on repeated file reads, bug fix sessions, and long coding conversations - zero config.

typescript mcp developer-tools claude llm cost-reduction prompt-caching anthropic claude-code token-optimization

Updated Jun 12, 2026
TypeScript

borhen68 / TokenTamer

Star

A drop-in proxy that compresses bloated code context in real-time, cutting LLM API costs by 50–80% without losing what the model actually needs to know.

python proxy openai developer-tools llm cost-reduction anthropic context-compression token-optimization ai-coding-agent

Updated Jun 15, 2026
Python

AssafWoo / homebrew-pandafilter

Star

The context intelligence layer for AI coding agents. Compressing noise, routing content to the right strategy, preserving session state across compactions, and surfacing the files that actually matter.

Updated Jun 12, 2026
Rust

fkiene / llmtrim

Star

Local proxy that compresses your LLM API requests so you pay less, with no change to the answers. Trims wasted tokens from prompts, history, tool output, and code before they're sent: -31% input / -74% output, measured live. Any provider, no extra model calls. Also an MCP server and embeddable library (Rust, Python, Ruby, Kotlin, Swift).

rust ai proxy mcp prompt openai developer-tools mitm-proxy llm prompt-engineering cost-reduction llmops anthropic prompt-compression claude-code token-optimization agentic-coding

Updated Jun 18, 2026
Rust

web-werkstatt / ai-context-optimizer

Star

💰 Save money on AI API costs! 76% token reduction, Auto-Fix token limits, Universal AI compatibility. Cline • Copilot • Claude • Cursor

Updated Jun 18, 2025

kalibr-ai / kalibr-sdk-python

Star

Stop overpaying to run your agents. Kalibr routes every request to lower-cost model and tool paths without degrading performance.

Updated Jun 3, 2026
Python

SuppieRK / ccp

Star

CLI proxy for coding agents that cuts noisy terminal output while preserving command behavior

go cli productivity open-source terminal opencode developer-tools command-line-tool codex llm cost-reduction token-reduction ai-coding claude-code agentic-coding

Updated Jun 18, 2026
Go

Sagargupta16 / claude-cost-optimizer

Sponsor

Star

Save 30-60% on Claude Code costs -- proven strategies, real benchmarks, copy-paste configs, and interactive tools

best-practices developer-tools claude cost-optimization ai-tools ai-development llm prompt-engineering cost-reduction anthropic ai-coding claude-code token-optimization

Updated Jun 17, 2026
TypeScript

i5heu / bonito-cache

Star

Just hook it in front of your public S3 bucket and enjoy reduction in bandwidth costs from your bucket

cdn cache s3 cost-reduction

Updated Feb 25, 2023
Go

KathanModh259 / latent-gate

Star

VL-JEPA inspired pipeline — compress images/text locally via Ollama, send compact payloads to any LLM API. Cut token costs by ~80%.

python ai computer-vision python3 gemini openai embedding claude multimodal vision-language cost-reduction local-llm ollama llm-pipeline prompt-compression token-optimization selective-decode vl-jepa api-cost

Updated Jun 17, 2026
Python

amahi2001 / python-token-killer

Star

Minimize LLM tokens from Python objects, code, logs, diffs, and more. Zero deps. Ultra-Lightweight.

python ai developer-tools agents llm cost-reduction llm-tools agentic-workflow token-optimization

Updated Jun 8, 2026
Python

joe-l-mathew / kube-resource-suggest

Star

A Kubernetes resource recommender that extends the API server to provide native suggestions.

kubernetes devops capacity-planning prometheus k8s autoscaling finops resource-optimization oom-killer kubernetes-tools cost-reduction rightsizing kubelet-metrics

Updated Dec 12, 2025
Go

umitkacar / llm-context-optimizer

Star

Biological code organization system with 1,029+ production-ready snippets - 95% token reduction for Claude/GPT with AI-powered discovery & offline packs

Updated Nov 10, 2025
Python

pghoshal / LeanNodes

Star

Solves Cold Start problem & saves upto 90% cost for EKS. On demand Dynamic service provisioning for business and Enterprise. CPU, GPU & AI Workloads

workflow ingress gateway openid-connect agents ai-agents cost-optimization eks approval-workflow cost-reduction agentic-workflow agentic-ai ai-workload

Updated May 4, 2026
Go

robhowley / pi-structured-return

Star

Pi extension that turns noisy CLI output into compact structured results - fewer tokens, full logs preserved.

pi cost-reduction token-optimization pi-coding-agent pi-extension

Updated May 1, 2026
TypeScript

Adrijan-Petek / gas-fee-optimizer

Star

Small utility that polls RPC endpoints for Base / Optimism / Arbitrum, writes timestamped JSON reports into `reports/`, and can post to a webhook.

Updated Jan 23, 2026
JavaScript

Paja73 / claude-auto-api

Star

Claude Code settings.json auto-config tool to quickly switch API_KEY, AUTH_TOKEN, and model configs across multi-model setups. Secure backup and desensitized previews. 🐙

python scraper ai chatbot selenium assistant free summarizer documented large-file-upload github-copilot auto-fix cost-reduction claude-ai long-text claude3 cursor-editor claude-code

Updated Jun 16, 2026
JavaScript

Improve this page

Add a description, image, and links to the cost-reduction topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cost-reduction topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cost-reduction

Here are 82 public repositories matching this topic...

rtk-ai / rtk

zdk / lowfat

fajarhide / omni

flightlesstux / prompt-caching

borhen68 / TokenTamer

AssafWoo / homebrew-pandafilter

fkiene / llmtrim

web-werkstatt / ai-context-optimizer

kalibr-ai / kalibr-sdk-python

SuppieRK / ccp

Sagargupta16 / claude-cost-optimizer

i5heu / bonito-cache

KathanModh259 / latent-gate

amahi2001 / python-token-killer

joe-l-mathew / kube-resource-suggest

umitkacar / llm-context-optimizer

pghoshal / LeanNodes

robhowley / pi-structured-return

Adrijan-Petek / gas-fee-optimizer

Paja73 / claude-auto-api

Improve this page

Add this topic to your repo