MCP Catalogs
Home

crawl4ai-mcp-server

by sadiuysal·85·Score 46

A lightweight MCP server exposing Crawl4AI web scraping tools for AI agents with Docker support.

web-scrapingdeveloper-toolsai-llm
11
Forks
1
Open issues
3 mo ago
Last commit
2d ago
Indexed

Overview

Crawl4AI MCP Server is a purpose-built implementation that exposes web scraping and crawling capabilities through the Model Context Protocol. It offers four main tools: scrape for single-page content extraction, crawl for multi-page breadth-first crawling with adaptive stopping, crawl_site for comprehensive site crawling, and crawl_sitemap for sitemap-based crawling. The server is self-hosted and free, providing an alternative to Firecrawl's API while offering similar functionality. It supports various AI development environments including OpenAI Agents SDK, Cursor, and Claude Code.

Try asking AI

After installing, here are 5 things you can ask your AI assistant:

you:Research assistants gathering information from multiple web pages
you:Content analysis tools extracting structured data from websites
you:AI agents that need to browse and understand web content as part of their workflow
you:Is this server similar to Firecrawl?
you:What safety features does the server implement?

When to choose this

Choose this when you need a self-hosted, free alternative to commercial web scraping APIs like Firecrawl, especially when working with sensitive data or requiring offline capabilities.

When NOT to choose this

Avoid this if you need advanced OCR capabilities for image-heavy sites or require official enterprise support with service-level agreements.

Tools this server exposes

4 tools extracted from the README
  • scrapescrape(url, output_dir, crawler, browser, script, timeout_sec)

    Fetch a single URL and return markdown content.

  • crawlcrawl(seed_url, max_depth, max_pages, same_domain_only, include_patterns, exclude_patterns, adaptive, output_dir, crawler, browser, script, timeout_sec)

    Multi-page breadth-first crawling with filtering and adaptive stopping.

  • crawl_sitecrawl_site(entry_url, output_dir, max_depth, max_pages, ...)

    Comprehensive site crawling with persistence.

  • crawl_sitemapcrawl_sitemap(sitemap_url, output_dir, max_entries, ...)

    Sitemap-based crawling with persistence.

Comparable tools

firecrawl-mcpbrowserless-mcpscrapingbee-api

Installation

Quick Start with Docker (Recommended)

# Pull pre-built image
docker pull uysalsadi/crawl4ai-mcp-server:latest

# Test the installation
python test-config.py

Manual Installation

git clone https://github.com/uysalsadi/crawl4ai-mcp-server.git
cd crawl4ai-mcp-server
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
pip install -r requirements.txt
python -m playwright install chromium

Claude Desktop Configuration

Add to ~/.claude/claude_desktop_config.json:

{
  "mcpServers": {
    "crawl4ai-mcp": {
      "command": "docker",
      "args": [
        "run", "--rm", "-i",
        "--volume", "/tmp/crawl4ai-crawls:/app/crawls",
        "uysalsadi/crawl4ai-mcp-server:latest"
      ],
      "env": {
        "CRAWL4AI_MCP_LOG": "INFO"
      }
    }
  }
}

FAQ

Is this server similar to Firecrawl?
Yes, it provides similar functionality to Firecrawl's API but is self-hosted and free, with no API key requirements.
What safety features does the server implement?
The server blocks internal networks, localhost, and private IPs by default to prevent scraping of sensitive or private content.

Compare crawl4ai-mcp-server with

GitHub →

Last updated · Auto-generated from public README + GitHub signals.