MCP Catalogs
Home

extract-llms-docs

by nirholas·30·Score 44

An MCP server that extracts llms.txt documentation from websites for AI agents.

ai-llmdeveloper-toolsweb-scraping
4
Forks
1
Open issues
this month
Last commit
2d ago
Indexed

Overview

llm.energy is a comprehensive web application and MCP server that fetches, parses, and organizes documentation from websites implementing the llms.txt and install.md standards. It transforms raw documentation into structured, agent-ready formats optimized for large language models, AI assistants, and developer tooling. The project provides both a web interface and MCP server capabilities.

Try asking AI

After installing, here are 5 things you can ask your AI assistant:

you:Feed documentation to AI coding assistants like Cursor and Windsurf
you:Build context-aware AI agents with up-to-date documentation
you:Create documentation pipelines for RAG systems
you:What is llms.txt?
you:What file formats can be exported?

When to choose this

Choose this MCP server when you need to extract and organize documentation from websites supporting llms.txt standard for AI assistants and RAG systems.

When NOT to choose this

Don't choose this if you need to extract documentation from websites that don't support the llms.txt standard, as it relies on this specific format for proper parsing.

Tools this server exposes

6 tools extracted from the README
  • extract_docs

    Extract documentation from a URL with llms.txt support

  • validate_url

    Check if a URL has llms.txt available

  • verify_llms_txt

    Verify llms.txt exists and get file info

  • discover_documentation_urls

    Find documentation URLs for a domain

  • list_sites

    Get directory of known llms.txt-enabled sites

  • search_sites

    Search the site directory by category or keyword

Comparable tools

mcp-web-scrapermcp-documentation-extractorllms-txt-cliai-documentation-tools

Installation

Installation

Web Application

Visit [llm.energy](https://llm.energy) to use the hosted version.

MCP Server

Add to your MCP client configuration (Claude Desktop, Cursor, etc.):

{
  "mcpServers": {
    "llm-energy": {
      "command": "npx",
      "args": ["-y", "@llm-energy/mcp-server"]
    }
  }
}

Local Development

# Clone the repository
git clone https://github.com/nirholas/extract-llms-docs.git
cd extract-llms-docs

# Install dependencies
pnpm install

# Start development server
pnpm dev

FAQ

What is llms.txt?
llms.txt is a machine-readable documentation format designed specifically for AI systems. It allows AI agents to easily understand and consume documentation from websites.
What file formats can be exported?
The application supports exporting documentation in Markdown, JSON, YAML, or as a ZIP archive containing all sections.

Compare extract-llms-docs with

GitHub →

Last updated · Auto-generated from public README + GitHub signals.