tokio-prompt-orchestrator
by Mattbusel·★ 55·Score 45
Multi-core Tokio-native orchestration for LLM pipelines with MCP server support.
Overview
Tokio-prompt-orchestrator is a production-grade LLM orchestration framework built in Rust. It provides advanced features including bounded-backpressure DAG processing, deduplication, circuit breakers, rate limiting, prompt injection detection, and provider arbitrage. The system exposes MCP (Model Context Protocol) integration for Claude Desktop, along with REST, WebSocket, and other APIs. It supports multiple LLM backends including Anthropic, OpenAI, llama.cpp, and vLLM.
Try asking AI
After installing, here are 5 things you can ask your AI assistant:
When to choose this
Choose this when you need a production-ready, multi-core orchestration system for LLM pipelines with advanced features like provider arbitrage and request deduplication.
When NOT to choose this
Avoid this if you need a simple, single-provider solution or if you're not using Rust/Tokio in your tech stack.
Tools this server exposes
12 tools extracted from the READMEcreate_ab_testCreate or replace an A/B testing experiment for prompt variants
get_ab_testRetrieve details of a specific A/B test experiment
create_sessionCreate a new conversation session with optional system prompt
append_messageAdd a message to an existing conversation session
get_sessionRetrieve metadata and messages from a session
delete_sessionRemove a conversation session entirely
render_templateRender a prompt template with variable substitution
register_templateStore a new named prompt template for later use
list_templatesShow all available prompt templates
query_audit_logSearch and filter the audit log of LLM requests
get_audit_statsGet aggregate statistics from the audit log
export_audit_logDownload all audit log entries as JSONL
Note: Inferred tool names from REST API endpoints and documentation. The README shows API endpoints but doesn't explicitly list them as MCP tools.
Comparable tools
Installation
Add to your Cargo.toml:
tokio-prompt-orchestrator = "1.9"For Claude Desktop MCP integration, add to claude_desktop_config.json:
{
"mcpServers": {
"tokio-prompt-orchestrator": {
"command": "cargo",
"args": ["run", "--release", "--package", "tokio-prompt-orchestrator", "--", "--mcp"]
}
}
}FAQ
- What LLM providers are supported?
- The system supports Anthropic, OpenAI, llama.cpp, vLLM, and any custom backend through its extensible architecture.
- How does the deduplication feature work?
- It uses SHA-256 hashing to identify identical in-flight requests, coalescing them to a single backend call with a 30-second TTL to prevent memory leaks.
On Hacker News
Recent discussion from the developer community.
- Story by Shmungus · 2026-02-24
- Story by Shmungus · 2026-02-21
Compare tokio-prompt-orchestrator with
Last updated · Auto-generated from public README + GitHub signals.