tokio-prompt-orchestrator

Name: tokio-prompt-orchestrator
Rating: 2.3 (55 reviews)
Author: Mattbusel

by Mattbusel·★ 55·Score 45

Multi-core Tokio-native orchestration for LLM pipelines with MCP server support.

ai-llmdeveloper-toolsops-infra

Forks

Open issues

2 mo ago

Last commit

2d ago

Indexed

Overview

Tokio-prompt-orchestrator is a production-grade LLM orchestration framework built in Rust. It provides advanced features including bounded-backpressure DAG processing, deduplication, circuit breakers, rate limiting, prompt injection detection, and provider arbitrage. The system exposes MCP (Model Context Protocol) integration for Claude Desktop, along with REST, WebSocket, and other APIs. It supports multiple LLM backends including Anthropic, OpenAI, llama.cpp, and vLLM.

Try asking AI

After installing, here are 5 things you can ask your AI assistant:

you:Enterprise LLM API orchestration with provider failover and cost optimization

you:Production LLM service with prompt injection protection and request deduplication

you:Multi-model deployments with load balancing and adaptive worker pool sizing

you:What LLM providers are supported?

you:How does the deduplication feature work?

When to choose this

Choose this when you need a production-ready, multi-core orchestration system for LLM pipelines with advanced features like provider arbitrage and request deduplication.

When NOT to choose this

Avoid this if you need a simple, single-provider solution or if you're not using Rust/Tokio in your tech stack.

Tools this server exposes

12 tools extracted from the README

create_ab_test
Create or replace an A/B testing experiment for prompt variants
get_ab_test
Retrieve details of a specific A/B test experiment
create_session
Create a new conversation session with optional system prompt
append_message
Add a message to an existing conversation session
get_session
Retrieve metadata and messages from a session
delete_session
Remove a conversation session entirely
render_template
Render a prompt template with variable substitution
register_template
Store a new named prompt template for later use
list_templates
Show all available prompt templates
query_audit_log
Search and filter the audit log of LLM requests
get_audit_stats
Get aggregate statistics from the audit log
export_audit_log
Download all audit log entries as JSONL

Note: Inferred tool names from REST API endpoints and documentation. The README shows API endpoints but doesn't explicitly list them as MCP tools.

Comparable tools

llama-cpp-mcpopenai-mcpanthropic-mcpvllm-mcp

Installation

Add to your Cargo.toml:

tokio-prompt-orchestrator = "1.9"

For Claude Desktop MCP integration, add to claude_desktop_config.json:

{
  "mcpServers": {
    "tokio-prompt-orchestrator": {
      "command": "cargo",
      "args": ["run", "--release", "--package", "tokio-prompt-orchestrator", "--", "--mcp"]
    }
  }
}

FAQ

What LLM providers are supported?: The system supports Anthropic, OpenAI, llama.cpp, vLLM, and any custom backend through its extensible architecture.
How does the deduplication feature work?: It uses SHA-256 hashing to identify identical in-flight requests, coalescing them to a single backend call with a 30-second TTL to prevent memory leaks.

On Hacker News

Recent discussion from the developer community.

Show HN: Tokio-prompt-orchestrator – LLM pipeline orchestration in Rust▲ 2
Story by Shmungus · 2026-02-24
24 Simultaneous Claude Code agents on local hardware▲ 2💬 1
Story by Shmungus · 2026-02-21

Compare tokio-prompt-orchestrator with

tokio-prompt-orchestrator vs ultimate_mcp_server tokio-prompt-orchestrator vs mcp-server-chart tokio-prompt-orchestrator vs everything tokio-prompt-orchestrator vs filesystem tokio-prompt-orchestrator vs time

GitHub →

Last updated 2026-05-17 · Auto-generated from public README + GitHub signals.