MCP Catalogs
Home

haiku.rag

by ggozad·524·Score 53

Agentic RAG with multimodal search, analysis agents, and MCP server capabilities.

ai-llmknowledge-graphdeveloper-tools
35
Forks
3
Open issues
this month
Last commit
2d ago
Indexed

Overview

Haiku RAG is a comprehensive retrieval-augmented generation system built on LanceDB, Pydantic AI, and Docling. It features hybrid search combining vector and full-text search with multimodal capabilities for text and images. The system includes research agents for complex information tasks, analysis agents with Python code execution, and conversational interfaces. Its MCP server implementation exposes tools for document management, search, QA, and research directly within AI assistants like Claude Desktop.

Try asking AI

After installing, here are 5 things you can ask your AI assistant:

you:Research document analysis with citation support
you:Multimodal search across documents and images
you:Complex analytical tasks via code execution agents
you:What document formats are supported?
you:Can I run it locally?

When to choose this

Choose Haiku RAG when you need a production-ready multimodal RAG system with agent capabilities and MCP integration for AI assistants.

When NOT to choose this

Don't choose this if you need a simple vector search without agent capabilities or if you're looking for an open-source solution without external dependencies.

Tools this server exposes

6 tools extracted from the README
  • add_srcadd_src(source: str)

    Index a document source (file path or URL) for retrieval

  • searchsearch(query: str, limit: int = 10)

    Perform hybrid vector and full-text search over indexed documents

  • askask(question: str, cite: bool = false)

    Ask questions with citations from the indexed documents

  • researchresearch(topic: str)

    Perform iterative research planning, search, and synthesis

  • analyzeanalyze(task: str)

    Perform complex analytical tasks via sandboxed Python code execution

  • chatchat(message: str)

    Engage in a multi-turn conversational interaction with session memory

Note: Tool names inferred from CLI examples in the README; actual MCP tool signatures not documented

Comparable tools

semantic-search-servermemgptprivate-gptvectara-mcp

Installation

pip install haiku.rag

For MCP server with Claude Desktop:

haiku-rag serve --mcp --stdio

Add to Claude Desktop config:

{
  "mcpServers": {
    "haiku-rag": {
      "command": "haiku-rag",
      "args": ["serve", "--mcp", "--stdio"]
    }
  }
}

FAQ

What document formats are supported?
The system supports PDFs and other formats through Docling, with structure-aware context expansion.
Can I run it locally?
Yes, it's local-first with embedded LanceDB, but also supports S3, GCS, Azure, and LanceDB Cloud.

On Hacker News

Recent discussion from the developer community.

Compare haiku.rag with

GitHub →

Last updated · Auto-generated from public README + GitHub signals.