MCP Catalogs
Homemcp-wayback-machine screenshot

mcp-wayback-machine

by Mearman·24·Score 43

MCP server for Internet Archive's Wayback Machine with CDX search, snapshot retrieval, comparison and optional authentication.

web-scrapingai-llmdeveloper-tools
6
Forks
0
Open issues
this month
Last commit
2d ago
Indexed

Overview

This MCP server provides comprehensive access to the Internet Archive's Wayback Machine through the Model Context Protocol. It features full CDX search functionality, snapshot content retrieval, screenshot listing, snapshot comparison capabilities, and optional authentication for higher SPN2 rate limits. The server implements proper caching strategies with different TTLs for various endpoints, and includes rate limiting with automatic retry handling for 429 responses. It's built with TypeScript and Zod schemas for validation, requiring Node.js 22+.

Try asking AI

After installing, here are 5 things you can ask your AI assistant:

you:Archiving web pages directly through AI assistants
you:Comparing how a website has changed over time
you:Searching for historical snapshots of specific URLs
you:What authentication is required for this server?
you:How do I obtain credentials for the Wayback Machine?

When to choose this

Choose this MCP server when you need to archive, retrieve, or compare web pages through AI agents, especially when working with historical web content or tracking changes to websites over time.

When NOT to choose this

Avoid if you need to upload files to the Internet Archive (this only works with the Wayback Machine) or if you require non-commercial use only due to the CC BY-NC-SA license.

Tools this server exposes

7 tools extracted from the README
  • save_urlurl, captureScreenshot, captureOutlinks, ifNotArchivedWithin, jsBehaviorTimeout, forceGet, delayWbAvailability

    Archive a URL to the Wayback Machine using the SPN2 API.

  • get_archived_urlurl, timestamp, modifier

    Retrieve an archived snapshot's content and metadata.

  • search_archivesurl, matchType, from, to, limit, offset, collapse, filter, resolveRevisits, showDupeCount, page, pageSize

    Search the CDX API for archived versions of a URL.

  • check_archive_statusurl

    Check archival statistics for a URL — capture counts, yearly breakdowns, and first/last capture dates.

  • list_screenshotsurl, limit

    List available screenshots for a URL.

  • compare_snapshotsurl, timestampA, timestampB

    Compare two archived snapshots of a URL. Fetches the raw content of both and provides a visual diff URL.

  • clear_cache

    Clear all cached API responses. Use when fresh data is needed or after saving a new URL.

Comparable tools

internet-archive-skillsarchivebox-mcpwebpage-mcpwayback-cli

Installation

Installation

For Claude Desktop

Add to your claude_desktop_config.json:

{
  "mcpServers": {
    "wayback-machine": {
      "command": "npx",
      "args": ["-y", "mcp-wayback-machine"],
      "env": {
        "WAYBACK_ACCESS_KEY": "your-access-key",
        "WAYBACK_SECRET_KEY": "your-secret-key"
      }
    }
  }
}

CLI Shorthand

claude mcp add wayback-machine -- npx -y mcp-wayback-machine

Manual Configuration

For other harnesses, add to appropriate config file:

{
  "wayback-machine": {
    "command": "npx",
    "args": ["-y", "mcp-wayback-machine"],
    "env": {
      "WAYBACK_ACCESS_KEY": "your-access-key",
      "WAYBACK_SECRET_KEY": "your-secret-key"
    }
  }
}

FAQ

What authentication is required for this server?
The server works anonymously by default, but setting Internet Archive S3 credentials provides higher rate limits on save operations.
How do I obtain credentials for the Wayback Machine?
Log in to archive.org and visit your S3 API keys page at https://archive.org/account/s3.php to obtain your access and secret keys.

Compare mcp-wayback-machine with

GitHub →

Last updated · Auto-generated from public README + GitHub signals.