mcp-wayback-machine

Name: mcp-wayback-machine
Rating: 2.1 (24 reviews)
Author: Mearman

by Mearman·★ 24·Score 43

MCP server for Internet Archive's Wayback Machine with CDX search, snapshot retrieval, comparison and optional authentication.

web-scrapingai-llmdeveloper-tools

Forks

Open issues

2 mo ago

Last commit

56d ago

Indexed

Overview

This MCP server provides comprehensive access to the Internet Archive's Wayback Machine through the Model Context Protocol. It features full CDX search functionality, snapshot content retrieval, screenshot listing, snapshot comparison capabilities, and optional authentication for higher SPN2 rate limits. The server implements proper caching strategies with different TTLs for various endpoints, and includes rate limiting with automatic retry handling for 429 responses. It's built with TypeScript and Zod schemas for validation, requiring Node.js 22+.

Try asking AI

After installing, here are 5 things you can ask your AI assistant:

you:Archiving web pages directly through AI assistants

you:Comparing how a website has changed over time

you:Searching for historical snapshots of specific URLs

you:What authentication is required for this server?

you:How do I obtain credentials for the Wayback Machine?

When to choose this

Choose this MCP server when you need to archive, retrieve, or compare web pages through AI agents, especially when working with historical web content or tracking changes to websites over time.

When NOT to choose this

Avoid if you need to upload files to the Internet Archive (this only works with the Wayback Machine) or if you require non-commercial use only due to the CC BY-NC-SA license.

Tools this server exposes

7 tools extracted from the README

save_urlurl, captureScreenshot, captureOutlinks, ifNotArchivedWithin, jsBehaviorTimeout, forceGet, delayWbAvailability
Archive a URL to the Wayback Machine using the SPN2 API.
get_archived_urlurl, timestamp, modifier
Retrieve an archived snapshot's content and metadata.
search_archivesurl, matchType, from, to, limit, offset, collapse, filter, resolveRevisits, showDupeCount, page, pageSize
Search the CDX API for archived versions of a URL.
check_archive_statusurl
Check archival statistics for a URL — capture counts, yearly breakdowns, and first/last capture dates.
list_screenshotsurl, limit
List available screenshots for a URL.
compare_snapshotsurl, timestampA, timestampB
Compare two archived snapshots of a URL. Fetches the raw content of both and provides a visual diff URL.
clear_cache
Clear all cached API responses. Use when fresh data is needed or after saving a new URL.

Comparable tools

internet-archive-skillsarchivebox-mcpwebpage-mcpwayback-cli

Installation

For Claude Desktop

Add to your claude_desktop_config.json:

{
  "mcpServers": {
    "wayback-machine": {
      "command": "npx",
      "args": ["-y", "mcp-wayback-machine"],
      "env": {
        "WAYBACK_ACCESS_KEY": "your-access-key",
        "WAYBACK_SECRET_KEY": "your-secret-key"
      }
    }
  }
}

CLI Shorthand

claude mcp add wayback-machine -- npx -y mcp-wayback-machine

Manual Configuration

For other harnesses, add to appropriate config file:

{
  "wayback-machine": {
    "command": "npx",
    "args": ["-y", "mcp-wayback-machine"],
    "env": {
      "WAYBACK_ACCESS_KEY": "your-access-key",
      "WAYBACK_SECRET_KEY": "your-secret-key"
    }
  }
}

FAQ

What authentication is required for this server?: The server works anonymously by default, but setting Internet Archive S3 credentials provides higher rate limits on save operations.
How do I obtain credentials for the Wayback Machine?: Log in to archive.org and visit your S3 API keys page at https://archive.org/account/s3.php to obtain your access and secret keys.

Compare mcp-wayback-machine with

mcp-wayback-machine vs ultimate_mcp_server mcp-wayback-machine vs mcp-server-chart mcp-wayback-machine vs everything mcp-wayback-machine vs filesystem mcp-wayback-machine vs time

GitHub →

Last updated 2026-05-17 · Auto-generated from public README + GitHub signals.