MCP Catalogs
Home

tokio-prompt-orchestrator

by Mattbusel·55·Score 45

Multi-core Tokio-native orchestration for LLM pipelines with MCP server support.

ai-llmdeveloper-toolsops-infra
5
Forks
2
Open issues
2 mo ago
Last commit
2d ago
Indexed

Overview

Tokio-prompt-orchestrator is a production-grade LLM orchestration framework built in Rust. It provides advanced features including bounded-backpressure DAG processing, deduplication, circuit breakers, rate limiting, prompt injection detection, and provider arbitrage. The system exposes MCP (Model Context Protocol) integration for Claude Desktop, along with REST, WebSocket, and other APIs. It supports multiple LLM backends including Anthropic, OpenAI, llama.cpp, and vLLM.

Try asking AI

After installing, here are 5 things you can ask your AI assistant:

you:Enterprise LLM API orchestration with provider failover and cost optimization
you:Production LLM service with prompt injection protection and request deduplication
you:Multi-model deployments with load balancing and adaptive worker pool sizing
you:What LLM providers are supported?
you:How does the deduplication feature work?

When to choose this

Choose this when you need a production-ready, multi-core orchestration system for LLM pipelines with advanced features like provider arbitrage and request deduplication.

When NOT to choose this

Avoid this if you need a simple, single-provider solution or if you're not using Rust/Tokio in your tech stack.

Tools this server exposes

12 tools extracted from the README
  • create_ab_test

    Create or replace an A/B testing experiment for prompt variants

  • get_ab_test

    Retrieve details of a specific A/B test experiment

  • create_session

    Create a new conversation session with optional system prompt

  • append_message

    Add a message to an existing conversation session

  • get_session

    Retrieve metadata and messages from a session

  • delete_session

    Remove a conversation session entirely

  • render_template

    Render a prompt template with variable substitution

  • register_template

    Store a new named prompt template for later use

  • list_templates

    Show all available prompt templates

  • query_audit_log

    Search and filter the audit log of LLM requests

  • get_audit_stats

    Get aggregate statistics from the audit log

  • export_audit_log

    Download all audit log entries as JSONL

Note: Inferred tool names from REST API endpoints and documentation. The README shows API endpoints but doesn't explicitly list them as MCP tools.

Comparable tools

llama-cpp-mcpopenai-mcpanthropic-mcpvllm-mcp

Installation

Add to your Cargo.toml:

tokio-prompt-orchestrator = "1.9"

For Claude Desktop MCP integration, add to claude_desktop_config.json:

{
  "mcpServers": {
    "tokio-prompt-orchestrator": {
      "command": "cargo",
      "args": ["run", "--release", "--package", "tokio-prompt-orchestrator", "--", "--mcp"]
    }
  }
}

FAQ

What LLM providers are supported?
The system supports Anthropic, OpenAI, llama.cpp, vLLM, and any custom backend through its extensible architecture.
How does the deduplication feature work?
It uses SHA-256 hashing to identify identical in-flight requests, coalescing them to a single backend call with a 30-second TTL to prevent memory leaks.

On Hacker News

Recent discussion from the developer community.

Compare tokio-prompt-orchestrator with

GitHub →

Last updated · Auto-generated from public README + GitHub signals.