
ingero
by ingero-io·★ 81·Score 48
Ingero is an eBPF-based GPU observability tool that exposes diagnostic capabilities through MCP for AI assistants.
Overview
Ingero traces the full chain from Linux kernel events through CUDA API calls to Python source lines with less than 2% overhead. It correlates CPU scheduler events, CUDA Runtime API calls, and CUDA Driver API calls by timestamp and PID, then generates causal chain explanations. The MCP server component allows AI assistants like Claude Code and Cursor to ask natural language questions about GPU performance issues and receive resolved causal chains with Python source lines, no SQL or hex addresses required.
Try asking AI
After installing, here are 5 things you can ask your AI assistant:
When to choose this
Choose Ingero when you need deep causal analysis of GPU performance issues in ML/AI workloads and want to query insights through AI assistants without SQL knowledge.
When NOT to choose this
Don't choose Ingero if you need cross-platform support (Linux-only), require very low overhead (<0.5%), or need to analyze non-GPU performance issues.
Tools this server exposes
6 tools extracted from the READMEmcpExpose Ingero as a Model Context Protocol server for AI-assisted GPU observability
traceTrace GPU workloads and capture CUDA events in real-time
explainGenerate an automated diagnosis of GPU performance issues
checkVerify system readiness for GPU tracing
queryQuery the GPU trace data for specific time periods
demoRun demonstration scenarios with synthetic data
Comparable tools
Installation
Binary Install (recommended)
VERSION=0.16.0
curl -fsSL "https://github.com/ingero-io/ingero/releases/download/v${VERSION}/ingero_${VERSION}_linux_amd64.tar.gz" | tar xz
sudo mv ingero /usr/local/bin/Start MCP server
ingero mcpClaude Desktop Integration
Add to claude_desktop_config.json:
{
"mcpServers": {
"ingero": {
"command": "ingero",
"args": ["mcp"]
}
}
}FAQ
- What does the MCP server expose?
- The MCP server exposes tools for AI assistants to query GPU performance data, causal chains, and diagnostics through natural language questions.
- What is the overhead of using Ingero?
- Less than 2% overhead as measured on GPU hardware (0.4-1.7% across RTX 3090 through H100 with stack tracing).
Compare ingero with
Last updated · Auto-generated from public README + GitHub signals.