rlm-claude

Name: rlm-claude
Rating: 2.3 (35 reviews)
Author: EncrEor

by EncrEor·★ 35·Score 46

RLM is an MCP server providing persistent memory for Claude Code with auto-save, semantic search, and multi-project organization.

ai-llmdeveloper-toolsproductivity

Forks

Open issues

4 mo ago

Last commit

56d ago

Indexed

Overview

RLM solves Claude Code's context window limitation by providing an MCP server that automatically saves conversation history before compact operations. It implements two memory systems: Insights for key decisions and facts, and Chunks for full conversation segments. The server offers advanced features including semantic search with BM25 + cosine similarity, fuzzy search, multi-project support, smart retention policies, and sub-agent analysis capabilities. RLM hooks into Claude Code's events to provide zero-configuration persistent memory across sessions.

Try asking AI

After installing, here are 5 things you can ask your AI assistant:

you:Maintaining long-term project context across multiple Claude Code sessions

you:Preserving critical decisions and architectural choices throughout development

you:Searching through historical conversations using semantic and fuzzy matching

you:How does RLM persist memory between sessions?

you:What's the difference between Insights and Chunks?

When to choose this

Choose RLM if you work with Claude Code and need persistent memory across sessions, especially for long-term projects where context continuity is crucial.

When NOT to choose this

Don't choose RLM if you need memory for non-Claude environments, or if you prefer all-in-one solutions like MemGPT that come with their own runtime.

Tools this server exposes

12 tools extracted from the README

rlm_remember
Save decisions, facts, preferences with categories and importance levels
rlm_recall
Search insights by keyword, category, or importance
rlm_forget
Remove an insight from memory
rlm_status
Get system overview including insight count and chunk stats
rlm_chunk
Save conversation segments with typed categorization
rlm_peek
Read a chunk in full or partially by line range
rlm_grep
Regex search across all chunks with optional fuzzy matching
rlm_search
Hybrid search using BM25 + semantic similarity across chunks and insights
rlm_list_chunks
List all chunks with metadata
rlm_sessions
Browse sessions by project or domain
rlm_retention_preview
Preview what would be archived in a dry-run
rlm_retention_run
Archive old unused chunks and purge ancient ones

Comparable tools

memgptlettaclaude-desktop

Installation

Via PyPI (recommended)

pip install mcp-rlm-server[all]

Via uv

uv tool install mcp-rlm-server[all] --python 3.12

Configure Claude Code

Add to your Claude Code configuration:

{
  "mcpServers": {
    "rlm": {
      "command": "python",
      "args": ["-m", "mcp_server.server"]
    }
  }
}

Then restart Claude Code and start using the 14 available RLM tools.

FAQ

How does RLM persist memory between sessions?: RLM hooks into Claude Code's /compact event to automatically save snapshots before context loss. Memory is stored locally in ~/.claude/rlm/ with JSON for insights and chunk files for conversation history.
What's the difference between Insights and Chunks?: Insights store key decisions, facts and preferences with categories and importance levels. Chunks store full conversation segments typed as snapshot, session or debug. Insights are loaded by importance, chunks are searched and retrieved on demand.

Compare rlm-claude with

rlm-claude vs ultimate_mcp_server rlm-claude vs mcp-server-chart rlm-claude vs everything rlm-claude vs filesystem rlm-claude vs time

GitHub →

Last updated 2026-05-17 · Auto-generated from public README + GitHub signals.