haiku.rag
by ggozad·★ 524·Score 53
Agentic RAG with multimodal search, analysis agents, and MCP server capabilities.
Overview
Haiku RAG is a comprehensive retrieval-augmented generation system built on LanceDB, Pydantic AI, and Docling. It features hybrid search combining vector and full-text search with multimodal capabilities for text and images. The system includes research agents for complex information tasks, analysis agents with Python code execution, and conversational interfaces. Its MCP server implementation exposes tools for document management, search, QA, and research directly within AI assistants like Claude Desktop.
Try asking AI
After installing, here are 5 things you can ask your AI assistant:
When to choose this
Choose Haiku RAG when you need a production-ready multimodal RAG system with agent capabilities and MCP integration for AI assistants.
When NOT to choose this
Don't choose this if you need a simple vector search without agent capabilities or if you're looking for an open-source solution without external dependencies.
Tools this server exposes
6 tools extracted from the READMEadd_srcadd_src(source: str)Index a document source (file path or URL) for retrieval
searchsearch(query: str, limit: int = 10)Perform hybrid vector and full-text search over indexed documents
askask(question: str, cite: bool = false)Ask questions with citations from the indexed documents
researchresearch(topic: str)Perform iterative research planning, search, and synthesis
analyzeanalyze(task: str)Perform complex analytical tasks via sandboxed Python code execution
chatchat(message: str)Engage in a multi-turn conversational interaction with session memory
Note: Tool names inferred from CLI examples in the README; actual MCP tool signatures not documented
Comparable tools
Installation
pip install haiku.ragFor MCP server with Claude Desktop:
haiku-rag serve --mcp --stdioAdd to Claude Desktop config:
{
"mcpServers": {
"haiku-rag": {
"command": "haiku-rag",
"args": ["serve", "--mcp", "--stdio"]
}
}
}FAQ
- What document formats are supported?
- The system supports PDFs and other formats through Docling, with structure-aware context expansion.
- Can I run it locally?
- Yes, it's local-first with embedded LanceDB, but also supports S3, GCS, Azure, and LanceDB Cloud.
On Hacker News
Recent discussion from the developer community.
- Story by emzo · 2025-06-24
Compare haiku.rag with
Last updated · Auto-generated from public README + GitHub signals.