context-engineering
by timothywarner-org·★ 23·Score 45
Production-ready MCP server implementing semantic memory with hybrid RAG and CoALA memory tiers for AI assistants.
Overview
This is a comprehensive MCP server implementing the WARNERCO Schematica application, which demonstrates production-grade semantic memory systems. It utilizes FastMCP, FastAPI, and LangGraph to implement all four CoALA memory tiers (Working, Episodic, Semantic, Procedural) in a 9-node pipeline. The server provides 28 MCP tools, 11 resources, and 5 prompts for building AI systems with persistent memory. It features progressive tool loading capabilities and episodic recall using Park et al.'s scoring formula.
Try asking AI
After installing, here are 5 things you can ask your AI assistant:
When to choose this
Choose this when you need a comprehensive memory system for AI assistants, particularly if you're implementing production solutions with semantic memory, hybrid RAG, and need educational examples of MCP implementation.
When NOT to choose this
Don't choose this if you need a simple MCP server without the complexity of a full memory architecture, or if you're looking for a solution that works with AI models other than Claude.
Tools this server exposes
12 tools extracted from the READMEwarn_search_toolsKeyword discovery of MCP tools with detail levels
warn_describe_toolGet full schema for a specific MCP tool by name
parse_intentParse user intent for the LangGraph pipeline
query_graphQuery the knowledge graph for relationships and entities
inject_scratchpadInject working memory into the current session
recall_episodesRecall episodic memories based on relevance and importance
retrieveRetrieve information from semantic memory vector store
compress_contextCompress retrieved context for efficient processing
reasonReason over compressed context to generate insights
respondGenerate a response based on the processed context
log_episodeLog the current interaction as an episodic memory
warn_episodic_recallRecall episodic memories with detailed scoring breakdown
Note: Tool names inferred from the LangGraph pipeline description and CoALA memory tiers documentation. Descriptions are based on their roles in the pipeline and memory system. Signatures are not explicitly provided in the README.
Comparable tools
Installation
Installation
Prerequisites
- Python 3.13 (pinned in
.python-version) - Node.js 20+ (for Lab 01 and MCP Inspector)
- [uv](https://docs.astral.sh/uv/) package manager (recommended)
- Claude Desktop or Claude Code
Option 1: Hello MCP Lab (Beginner)
git clone https://github.com/timothywarner-org/context-engineering.git
cd context-engineering/labs/lab-01-hello-mcp/starter
npm install && npm startOption 2: WARNERCO Schematica (Full Application)
cd src/warnerco/backend
uv sync
uv run uvicorn app.main:app --reload # HTTP server
uv run warnerco-mcp # MCP stdio serverClaude Desktop Configuration
{
"mcpServers": {
"warnerco": {
"command": "uv",
"args": ["run", "warnerco-mcp"],
"cwd": "C:/github/context-engineering/src/warnerco/backend"
}
}
}FAQ
- What is the CoALA memory system?
- CoALA (Context-Aware Long-term Architecture) is a four-tier memory system consisting of Working, Episodic, Semantic, and Procedural memory tiers implemented in this MCP server.
- How do I test this MCP server?
- You can test the server using the MCP Inspector with the command: `npx @modelcontextprotocol/inspector uv run warnerco-mcp`, which opens http://localhost:5173.
Compare context-engineering with
Last updated · Auto-generated from public README + GitHub signals.