context-compression

Here are 128 public repositories matching this topic...

open-compress / claw-compactor

14-stage Fusion Pipeline for LLM token compression — reversible compression, AST-aware code analysis, intelligent content routing. Zero LLM inference cost. MIT licensed.

Updated Apr 1, 2026
Python

manojmallick / sigmap

Sponsor

Star

97% token reduction for AI coding sessions — zero deps, 31 languages, MCP server

Updated Jun 19, 2026
JavaScript

Cut your Claude / OpenAI / Gemini bill 70–95% on AI coding. Local proxy that compresses context, keeps provider caches hot, and verifies LLM output ($0 hallucination guard). Drop-in for Cursor, Claude Code, Codex, Aider + 34 more and custom providers — 30s, no code changes

rust productivity open-source ai mcp cursor ai-agents claude rag llm chatgpt anthropic hallucination-detection context-compression mcp-server claude-code token-optimization llm-grounding ai-hallucination

Updated Jun 17, 2026
Python

LearnPrompt / cc-harness-skills

Star

Portable CC-inspired skills for memory, verification, multi-agent coordination, context compression, and proactive coding-agent workflows.

multi-agent developer-tools codex ai-agent prompt-engineering context-compression agent-memory coding-agent agent-harness openclaw

Updated Jun 12, 2026
Python

borhen68 / TokenTamer

Star

A drop-in proxy that compresses bloated code context in real-time, cutting LLM API costs by 50–80% without losing what the model actually needs to know.

python proxy openai developer-tools llm cost-reduction anthropic context-compression token-optimization ai-coding-agent

Updated Jun 15, 2026
Python

jeffreysijuntan / lloco

Star

The official repo for "LLoCo: Learning Long Contexts Offline"

pytorch finetune llm long-context context-compression

Updated Jun 15, 2024
Python

agiwhitelist / tokdiet

Star

Local streaming reverse proxy between AI coding agents (Claude Code, Cursor, Codex) and model APIs (Anthropic, OpenAI, Gemini, MiniMax). Meters every token + USD cost, compacts bloated context to cut pay-per-token API spend, and runs shadow-eval to prove quality held. ccusage-style metering + live local dashboard.

cli typescript reverse-proxy gemini openai observability cost-optimization llm anthropic openai-proxy token-counter llm-proxy cost-tracking ai-gateway context-compression llm-gateway claude-code ccusage context-engineering

Updated Jun 18, 2026
TypeScript

snu-mllab / Context-Memory

Star

Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)

efficient-llm-inference context-compression kv-cache-compression

Updated Apr 18, 2024
Python

PCIRCLE-AI / toonify-mcp

Star

Context compression plugin for Claude Code. Trims large JSON, logs, stack traces, and source files before they enter the context window.

developer-tooling context-window context-compression model-context-protocol mcp-server claude-code token-optimization claude-code-plugin source-code-compression toon-format

Updated May 23, 2026
TypeScript

castnettech / mnemosyne

Star

State aware knowledge compression, ingestion, and hybrid retrieval engine. Zero dependencies. Sub-100ms queries.

python open-source developer-tools tfidf bm25 zero-dependencies code-retrieval llm context-compression token-optimization

Updated May 30, 2026
Python

HaShiShark / context-editor-agent

Star

Cursor uses AI to edit code — we use AI to edit AI's context. 🪆 Context map + compression + version control for LLM context windows.

react python codex context-map context-visualization ai-agent context-management context-compression claudecode ai-context context-engineering context-editor

Updated Jun 14, 2026
Python

HoangP8 / tokless

Star

A unified CLI to install and update token-saving plugins — RTK, Caveman, CodeGraph, and Context-Mode — for Claude Code, OpenCode, Codex, and Antigravity. Minimal setup. Any OS.

cli ai mcp opencode tokens developer-tools codex ai-agents context-window context-compression token-compression claude-code token-optimization ai-coding-agents context-window-optimization llm-cost-reduction

Updated Jun 19, 2026
Go

Adityapal67 / context-graph-compressor

Star

Convert long AI conversations into portable conversation state graphs for LLM handoffs.

agent json compression ai context problem-solving age compression-algorithm llm prompt-engineering chatgpt claude-ai context-management context-compression context-engineering claude-skills claude-skill claude-skills-creator

Updated May 30, 2026

shouvik12 / trooper

Star

LLM reliability layer -keeps agents alive with smart routing, context compaction, and local fallback

go golang proxy fallback llm local-llm ollama llm-proxy ai-gateway context-compression

Updated Jun 18, 2026
Go

DJLougen / hive

Star

Unified agent memory and context compression stack for 2026 NVIDIA + edge (Vera CPU, Grace, Jetson Thor, 3090). Glues busyBee-cpu, honey-comb, and rust-brain. Better effective reasoning per token.

agent memory nvidia grace jetson honeycomb edge-ai cpu-offload busybee llm context-compression

Updated Jun 17, 2026
Python

SonicBotMan / lobster-press

Star

🦞 LobsterPress（龙虾饼） - Cognitive Memory System for AI Agents 基于认知科学的 LLM 永久记忆引擎

bash ai shell-script claude chatgpt context-compression token-optimization openclaw

Updated Jun 18, 2026
Python

Madhan230205 / token-reducer

Star

⚡ Cut Claude token usage by 90%+ — free, open-source, local-first context compression for Claude Code. Hybrid RAG (BM25 + ONNX vectors), AST chunking, reranking. No API needed.

Updated May 2, 2026
Python

MarceloCaporale / codex-agent-mem

Star

Local-first Model Context Protocol (MCP) memory layer for Codex CLI/Desktop, Claude Code, Gemini CLI, Qwen/DeepSeek/Ollama and agent workflows. SQLite + FTS5 compact context packs, token savings, read-only mode, no external memory server.

Updated May 7, 2026
Python

NodeNestor / claude-rolling-context

Star

Rolling context compression for Claude Code — never hit the context wall. Auto-compresses old messages while keeping recent context verbatim. Zero config, zero latency. Works as a Claude Code plugin.

claude ai-agent anthropic context-window context-management prompt-compression context-compression llm-context ai-coding claude-code claude-code-plugin claude-code-extension rolling-context

Updated Jun 2, 2026
Python

pythondatascrape / engram

Star

Local-first context compression for AI coding tools. One binary saves 85-93% of redundant tokens across every LLM call.

mcp developer-tools ai-tools llm context-compression claude-code token-optimization

Updated May 6, 2026
JavaScript

Improve this page

Add a description, image, and links to the context-compression topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the context-compression topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

context-compression

Here are 128 public repositories matching this topic...

open-compress / claw-compactor

manojmallick / sigmap

juyterman1000 / entroly

LearnPrompt / cc-harness-skills

borhen68 / TokenTamer

jeffreysijuntan / lloco

agiwhitelist / tokdiet

snu-mllab / Context-Memory

PCIRCLE-AI / toonify-mcp

castnettech / mnemosyne

HaShiShark / context-editor-agent

HoangP8 / tokless

Adityapal67 / context-graph-compressor

shouvik12 / trooper

DJLougen / hive

SonicBotMan / lobster-press

Madhan230205 / token-reducer

MarceloCaporale / codex-agent-mem

NodeNestor / claude-rolling-context

pythondatascrape / engram

Improve this page

Add this topic to your repo