context-compression

Star

Here are 18 public repositories matching this topic...

jeffreysijuntan / lloco

Star

The official repo for "LLoCo: Learning Long Contexts Offline"

pytorch finetune llm long-context context-compression

Updated Jun 15, 2024
Python

snu-mllab / Context-Memory

Star

Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)

efficient-llm-inference context-compression kv-cache-compression

Updated Apr 18, 2024
Python

Stop re-explaining your codebase to AI. Infinite speed memory + code graph for Claude Code & Codex CLI. 17 MCP tools, subagent protocol, hybrid search, TUI dashboard, crash recovery. Save 80-200K+ tokens/session.

python mcp developer-tools persistent-memory claude code-intelligence vector-search code-graph subagent anthropic ai-memory context-compression mcp-server vibecoding claude-code codex-cli

Updated Mar 3, 2026
Python

umitkacar / llm-context-optimizer

Star

Biological code organization system with 1,029+ production-ready snippets - 95% token reduction for Claude/GPT with AI-powered discovery & offline packs

Updated Nov 10, 2025
Python

bailynlove / Awesome-OCR-Vision-Based-Context-Compression

Star

Awesome list of paper on vision-based context compression

awesome-list code-generation large-language-models context-compression

Updated Feb 15, 2026

NodeNestor / claude-rolling-context

Star

Rolling context compression for Claude Code — never hit the context wall. Auto-compresses old messages while keeping recent context verbatim. Zero config, zero latency. Works as a Claude Code plugin.

claude ai-agent anthropic context-window context-management prompt-compression context-compression llm-context ai-coding claude-code claude-code-plugin claude-code-extension rolling-context

Updated Mar 6, 2026
Python

joy7758 / token-governor

Star

天将｜LLM 智能体运行时资源控制引擎：Token 预算控制、成本优化、稳定性 Guard、多工具 Agent 调度

Updated Mar 3, 2026
Python

13051171521l-cmyk / openclaw-lowmem-optimization

Star

OpenClaw low-memory optimization guide for resource-constrained servers (2GB RAM)

low-memory memory-optimization ai-assistant context-compression openclaw

Updated Mar 3, 2026
JavaScript

AntonioSabbatellaUni / nlp_llm_context_cost_optimization

Star

Exploring Context Compression techniques for token reduction. Fine-tuning LLMs for efficient text compression and reduced inference costs, analyzing the trade-offs with Q&A accuracy.

transformers finetuning efficient-nlp token-reduction llm-optimization context-compression

Updated Jun 1, 2025
Jupyter Notebook

NikiforMalkov / interlingua-llm

Star

Exploring artificial compressed languages to improve efficiency, context usage, and cross-lingual unification in LLMs

natural-language-processing language-models interlingua ai-research compressed-representation llm context-compression token-efficiency

Updated Aug 30, 2025

npow / kompact

Sponsor

Star

LLM context compression proxy — 40-70% token savings, zero code changes

python proxy openai tfidf ai-agents claude fastapi gpt4 llm cost-reduction tiktoken anthropic context-window llm-optimization prompt-compression context-compression token-optimization

Updated Feb 24, 2026
Python

jack-chaudier / mirage

Star

Detecting silent pivot substitution in LLMs under context compression

benchmark evaluation kv-cache llm context-compression

Updated Feb 26, 2026
Python

seonglae / ReSRer

Sponsor

Star

Retriever, Summarizer, Reader for LLM ODQA(Open-Domain Question Answering) to increase Information Density

qa question-answering summarizer odqa llm context-compression

Updated Jan 8, 2026
Python

GandalFran / contextomizer

Star

Contextomizer is an ultra-fast, deterministic library for transforming bloated tool outputs, raw APIs, documents, and messy logs into perfectly optimized context for AI Agents 🤖🚀