MCP Catalogs
Home

rlm-claude

by EncrEor·35·Score 46

RLM is an MCP server providing persistent memory for Claude Code with auto-save, semantic search, and multi-project organization.

ai-llmdeveloper-toolsproductivity
8
Forks
1
Open issues
2 mo ago
Last commit
2d ago
Indexed

Overview

RLM solves Claude Code's context window limitation by providing an MCP server that automatically saves conversation history before compact operations. It implements two memory systems: Insights for key decisions and facts, and Chunks for full conversation segments. The server offers advanced features including semantic search with BM25 + cosine similarity, fuzzy search, multi-project support, smart retention policies, and sub-agent analysis capabilities. RLM hooks into Claude Code's events to provide zero-configuration persistent memory across sessions.

Try asking AI

After installing, here are 5 things you can ask your AI assistant:

you:Maintaining long-term project context across multiple Claude Code sessions
you:Preserving critical decisions and architectural choices throughout development
you:Searching through historical conversations using semantic and fuzzy matching
you:How does RLM persist memory between sessions?
you:What's the difference between Insights and Chunks?

When to choose this

Choose RLM if you work with Claude Code and need persistent memory across sessions, especially for long-term projects where context continuity is crucial.

When NOT to choose this

Don't choose RLM if you need memory for non-Claude environments, or if you prefer all-in-one solutions like MemGPT that come with their own runtime.

Tools this server exposes

12 tools extracted from the README
  • rlm_remember

    Save decisions, facts, preferences with categories and importance levels

  • rlm_recall

    Search insights by keyword, category, or importance

  • rlm_forget

    Remove an insight from memory

  • rlm_status

    Get system overview including insight count and chunk stats

  • rlm_chunk

    Save conversation segments with typed categorization

  • rlm_peek

    Read a chunk in full or partially by line range

  • rlm_grep

    Regex search across all chunks with optional fuzzy matching

  • rlm_search

    Hybrid search using BM25 + semantic similarity across chunks and insights

  • rlm_list_chunks

    List all chunks with metadata

  • rlm_sessions

    Browse sessions by project or domain

  • rlm_retention_preview

    Preview what would be archived in a dry-run

  • rlm_retention_run

    Archive old unused chunks and purge ancient ones

Comparable tools

memgptlettaclaude-desktop

Installation

Installation

Via PyPI (recommended)

pip install mcp-rlm-server[all]

Via uv

uv tool install mcp-rlm-server[all] --python 3.12

Configure Claude Code

Add to your Claude Code configuration:

{
  "mcpServers": {
    "rlm": {
      "command": "python",
      "args": ["-m", "mcp_server.server"]
    }
  }
}

Then restart Claude Code and start using the 14 available RLM tools.

FAQ

How does RLM persist memory between sessions?
RLM hooks into Claude Code's /compact event to automatically save snapshots before context loss. Memory is stored locally in ~/.claude/rlm/ with JSON for insights and chunk files for conversation history.
What's the difference between Insights and Chunks?
Insights store key decisions, facts and preferences with categories and importance levels. Chunks store full conversation segments typed as snapshot, session or debug. Insights are loaded by importance, chunks are searched and retrieved on demand.

Compare rlm-claude with

GitHub →

Last updated · Auto-generated from public README + GitHub signals.