knowledge-rag
by lyonzin·★ 79·Score 48
Knowledge RAG is a local-first RAG system with 12 MCP tools for document search and retrieval across 20+ file formats.
Overview
Knowledge RAG is a comprehensive local-first RAG system that enables instant search across various document formats without requiring any external servers or API keys. It implements hybrid search combining BM25 keyword search, semantic vectors, and cross-encoder reranking for high precision results. The system is built with Python using FastMCP framework and provides 12 MCP tools for full CRUD operations, search, evaluation, and document management.
Try asking AI
After installing, here are 5 things you can ask your AI assistant:
When to choose this
Choose this for local-first document search when you need privacy, no internet connectivity, or want to avoid API costs.
When NOT to choose this
Avoid this if you need real-time collaborative features or require searching across documents stored in cloud services.
Tools this server exposes
11 tools extracted from the READMEsearchquery: strSearch through indexed documents using hybrid search with semantic and keyword matching
getdoc_id: strRetrieve a specific document by its ID
addpath: strAdd a new document to the index
updatepath: strUpdate an existing document in the index
removepath: strRemove a document from the index
reindexRebuild the entire index from scratch
listList all documents in the index
statsGet statistics about the index
urlurl: strIndex content from a web URL
similardoc_id: strFind documents similar to a reference document
evaluateEvaluate retrieval performance with MRR@5 and Recall@5 metrics
Comparable tools
Installation
Install knowledge-rag via pip:
pip install knowledge-ragFor Claude Desktop, add to your claude_desktop_config.json:
{
"mcpServers": {
"knowledge-rag": {
"command": "python",
"args": ["-m", "knowledge_rag.server"],
"env": {}
}
}
}Also available via NPM, Docker, and one-line installer.
FAQ
- What file formats are supported?
- Knowledge RAG supports 20+ formats including PDF, Markdown, Python code, Word documents, Excel files, Jupyter notebooks, and various programming languages (C++, JavaScript, TypeScript, etc.).
- Is it truly local with no cloud dependencies?
- Yes, all processing runs locally via ONNX. No servers, API keys, or data leaves your machine.
On Hacker News
Recent discussion from the developer community.
- Story by lyonzin · 2026-03-19
Compare knowledge-rag with
Last updated · Auto-generated from public README + GitHub signals.