
mcp-wayback-machine
by Mearman·★ 24·Score 43
MCP server for Internet Archive's Wayback Machine with CDX search, snapshot retrieval, comparison and optional authentication.
Overview
This MCP server provides comprehensive access to the Internet Archive's Wayback Machine through the Model Context Protocol. It features full CDX search functionality, snapshot content retrieval, screenshot listing, snapshot comparison capabilities, and optional authentication for higher SPN2 rate limits. The server implements proper caching strategies with different TTLs for various endpoints, and includes rate limiting with automatic retry handling for 429 responses. It's built with TypeScript and Zod schemas for validation, requiring Node.js 22+.
Try asking AI
After installing, here are 5 things you can ask your AI assistant:
When to choose this
Choose this MCP server when you need to archive, retrieve, or compare web pages through AI agents, especially when working with historical web content or tracking changes to websites over time.
When NOT to choose this
Avoid if you need to upload files to the Internet Archive (this only works with the Wayback Machine) or if you require non-commercial use only due to the CC BY-NC-SA license.
Tools this server exposes
7 tools extracted from the READMEsave_urlurl, captureScreenshot, captureOutlinks, ifNotArchivedWithin, jsBehaviorTimeout, forceGet, delayWbAvailabilityArchive a URL to the Wayback Machine using the SPN2 API.
get_archived_urlurl, timestamp, modifierRetrieve an archived snapshot's content and metadata.
search_archivesurl, matchType, from, to, limit, offset, collapse, filter, resolveRevisits, showDupeCount, page, pageSizeSearch the CDX API for archived versions of a URL.
check_archive_statusurlCheck archival statistics for a URL — capture counts, yearly breakdowns, and first/last capture dates.
list_screenshotsurl, limitList available screenshots for a URL.
compare_snapshotsurl, timestampA, timestampBCompare two archived snapshots of a URL. Fetches the raw content of both and provides a visual diff URL.
clear_cacheClear all cached API responses. Use when fresh data is needed or after saving a new URL.
Comparable tools
Installation
Installation
For Claude Desktop
Add to your claude_desktop_config.json:
{
"mcpServers": {
"wayback-machine": {
"command": "npx",
"args": ["-y", "mcp-wayback-machine"],
"env": {
"WAYBACK_ACCESS_KEY": "your-access-key",
"WAYBACK_SECRET_KEY": "your-secret-key"
}
}
}
}CLI Shorthand
claude mcp add wayback-machine -- npx -y mcp-wayback-machineManual Configuration
For other harnesses, add to appropriate config file:
{
"wayback-machine": {
"command": "npx",
"args": ["-y", "mcp-wayback-machine"],
"env": {
"WAYBACK_ACCESS_KEY": "your-access-key",
"WAYBACK_SECRET_KEY": "your-secret-key"
}
}
}FAQ
- What authentication is required for this server?
- The server works anonymously by default, but setting Internet Archive S3 credentials provides higher rate limits on save operations.
- How do I obtain credentials for the Wayback Machine?
- Log in to archive.org and visit your S3 API keys page at https://archive.org/account/s3.php to obtain your access and secret keys.
Compare mcp-wayback-machine with
Last updated · Auto-generated from public README + GitHub signals.