houtini-lm

Name: houtini-lm
Rating: 2.4 (88 reviews)
Author: houtini-ai

by houtini-ai·★ 88·综合分 48

MCP 服务器，将 Claude Code 中的有界任务卸载到本地或云 LLM，减少代币消耗。

ai-llmdeveloper-toolsproductivity

Forks

活跃 Issue

3 个月前

概述

Houtini LM 是一个专门的 MCP 服务器，旨在通过将有界任务从 Claude Code 卸载到本地或云 LLM 来节省代币。它可与多种端点协同工作，包括 LM Studio、Ollama、vLLM、DeepSeek、Groq、Cerebras 和 OpenRouter。该服务器能够智能地将任务路由到合适的模型，跟踪性能指标，并提供模型发现功能。它专注于卸载明确定义的任务，如代码审查、测试生成和文档编写，同时将复杂的推理任务保留给 Claude。

试试问 AI

装完之后，这里有 5 个你可以让 AI 做的事：

你:将代码审查和测试生成卸载到本地 LLM，同时 Claude 处理架构规划

你:使用 DeepSeek 等 API 生成样板代码，节省 Claude 的昂贵代币

你:使用更便宜的本地方言根据差异生成提交消息和文档

你:哪些类型的任务会被卸载到本地模型？

你:模型路由如何工作？

什么时候选它

已经使用 Claude Code 的开发者，希望通过将有限任务委托给本地或云端大语言模型来降低 API 成本，同时不损害复杂推理能力。

什么时候不要选它

需要超低延迟响应的用户，因为本地大语言模型的推理速度通常比前沿模型慢 3-30 倍。

此 server 暴露的工具

从 README 抽取出 4 个工具

chatmessage: string, system?: string, temperature?: number, max_tokens?: number, json_schema?: object, model?: string
Send a task to an LLM and get an answer. Offloads bounded tasks like generating boilerplate or explanations.
custom_promptinstruction: string, system?: string, context?: string, temperature?: number, max_tokens?: number, json_schema?: object, model?: string
Send a three-part prompt (system, context, instruction) to an LLM for consistent high-quality output.
code_taskcode: string, task: string, language?: string, max_tokens?: number, model?: string
Built-in code analysis with pre-configured system prompts for finding bugs, explaining code, or writing tests.
code_task_filestask: string, files: string[], max_tokens?: number, model?: string
Review multiple related files or large files directly from disk without context window limits.

可对比工具

claude-llms-mcpmcp-llmlocal-mcp-serverollama-mcp

安装

# 用于 Claude Code
claude mcp add houtini-lm -- npx -y @houtini/lm

# 用于 Claude Desktop
添加到 claude_desktop_config.json:
{
  "mcpServers": {
    "houtini-lm": {
      "command": "npx",
      "args": ["-y", "@houtini/lm"],
      "env": {
        "HOUTINI_LM_ENDPOINT_URL": "http://localhost:1234"
      }
    }
  }
}

FAQ

哪些类型的任务会被卸载到本地模型？: 有界且定义明确的任务，如生成测试存根、代码审查、格式转换、生成模拟数据和编写类型定义。复杂的推理、工具访问和多文件更改仍保留给 Claude。
模型路由如何工作？: Houtini-LM 查询您的 LLM 服务器以获取可用模型，在 HuggingFace 上查找它们的元数据，并在本地数据库中维护模型功能。然后它会根据任务类型对加载的模型进行评分，自动为每个任务选择最佳模型。

houtini-lm 对比

houtini-lm vs ultimate_mcp_server houtini-lm vs mcp-server-chart houtini-lm vs everything houtini-lm vs filesystem houtini-lm vs time

GitHub →

最后更新于 2026-05-17 · 由 README + GitHub 公开数据自动生成。