14-stage Fusion Pipeline for LLM token compression — reversible compression, AST-aware code analysis, intelligent content routing. Zero LLM inference cost. MIT licensed.
-
Updated
Mar 21, 2026 - Python
14-stage Fusion Pipeline for LLM token compression — reversible compression, AST-aware code analysis, intelligent content routing. Zero LLM inference cost. MIT licensed.
Biological code organization system with 1,029+ production-ready snippets - 95% token reduction for Claude/GPT with AI-powered discovery & offline packs
AI-powered tutoring system for Indian state-board students. Upload textbook PDFs and get curriculum-aligned answers instantly. Uses Context Pruning to score and filter chapters before querying Gemini LLM — reducing API costs by ~80%. Built with Python, Flask, FAISS, sentence-transformers, and Gemini 2.5 Flash.
Add a description, image, and links to the context-pruning topic page so that developers can more easily learn about it.
To associate your repository with the context-pruning topic, visit your repo's landing page and select "manage topics."