MCP Catalogs
Home

pdf-reader-mcp

by SylphxAI·711·Score 55

Production-ready MCP server for high-performance PDF processing with 5-10x faster parallel processing and comprehensive test coverage.

file-systemdeveloper-toolsai-llm
66
Forks
7
Open issues
this month
Last commit
2d ago
Indexed

Overview

PDF Reader MCP is a robust Model Context Protocol server designed to empower AI agents with enterprise-grade PDF processing capabilities. It offers significant performance advantages over traditional sequential processing through automatic parallelization, allowing users to process large PDF documents (50-100+ pages) in seconds rather than minutes. The server maintains a simple, elegant API while providing advanced features like Y-coordinate based content ordering that preserves document layout natural reading flow, making it particularly valuable for AI comprehension tasks.

Try asking AI

After installing, here are 5 things you can ask your AI assistant:

you:AI agents analyzing research papers and extracting key information
you:Document automation systems processing contracts and agreements
you:Content management systems extracting text from archived PDF documents
you:How is this different from traditional PDF processing?
you:What types of PDF files are supported?

When to choose this

Choose this when you need high-performance PDF processing for AI applications, especially with large documents where parallel processing makes a significant difference.

When NOT to choose this

Not ideal if you need to modify PDF content (this is read-only only) or require support for document formats other than PDF.

Tools this server exposes

1 tool extracted from the README
  • read_pdfsources: Array<{path?: string, url?: string, pages?: string | number[]}>, include_full_text?: boolean, include_metadata?: boolean, include_page_count?: boolean, include_images?: boolean

    Extract text, images, and metadata from PDF files with parallel processing

Comparable tools

pdf-parsepdf2picpdf-lib

Installation

Installation

Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "pdf-reader": {
      "command": "npx",
      "args": ["@sylphx/pdf-reader-mcp"]
    }
  }
}

VS Code

code --add-mcp '{"name":"pdf-reader","command":"npx","args":["@sylphx/pdf-reader-mcp"]}'

npm

npm install -g @sylphx/pdf-reader-mcp

FAQ

How is this different from traditional PDF processing?
PDF Reader MCP uses parallel processing to achieve 5-10x faster performance compared to sequential processing. It also provides Y-coordinate based content ordering that preserves natural document layout.
What types of PDF files are supported?
The server supports local files (both absolute and relative paths on Windows/Unix) and HTTP/HTTPS URLs. It can process documents with text, images, and metadata extraction capabilities.

Compare pdf-reader-mcp with

GitHub →

Last updated · Auto-generated from public README + GitHub signals.