mcp-server-webscan
by bsmi021·★ 12·Score 37
MCP server for web scanning with page fetching, link extraction, crawling, and sitemap generation.
Overview
The MCP Webscan Server is a TypeScript-based implementation that provides tools for fetching, analyzing, and extracting information from web pages. It features page fetching with Markdown conversion, link extraction with filtering options, recursive site crawling with depth control, broken link checking, pattern matching for URLs, and XML sitemap generation. The server runs on stdio transport, making it compatible with MCP clients like Claude Desktop. The codebase is well-structured with clear separation between services, tools, and types.
Try asking AI
After installing, here are 5 things you can ask your AI assistant:
When to choose this
Choose this MCP server when you need to analyze web content, perform site audits, or generate sitemaps through AI agents.
When NOT to choose this
Don't choose this if you need real-time monitoring, have strict performance requirements, or require authentication for accessing protected content.
Tools this server exposes
6 tools extracted from the READMEfetch-pageurl: string, selector?: stringFetches a web page and converts it to Markdown
extract-linksurl: string, baseUrl?: string, limit?: numberExtracts all links from a web page with their text
crawl-siteurl: string, maxDepth?: numberRecursively crawls a website up to a specified depth
check-linksurl: stringChecks for broken links on a page
find-patternsurl: string, pattern: stringFinds URLs matching a specific pattern
generate-site-mapurl: string, maxDepth?: number, limit?: numberGenerates a simple XML sitemap by crawling
Comparable tools
Installation
Installing via Smithery
To install Webscan for Claude Desktop automatically via [Smithery](https://smithery.ai/server/mcp-server-webscan):
npx -y @smithery/cli install mcp-server-webscan --client claudeManual Installation
# Clone the repository
git clone <repository-url>
cd mcp-server-webscan
# Install dependencies
npm install
# Build the project
npm run buildClaude Desktop Configuration
{
"mcpServers": {
"webscan": {
"command": "node",
"args": ["path/to/mcp-server-webscan/build/index.js"],
"env": {
"NODE_ENV": "development",
"LOG_LEVEL": "info"
}
}
}
}FAQ
- What is the maximum depth for site crawling?
- The default max depth is 2, but it can be configured between 0-5 for the crawl-site and generate-site-map tools.
- Can I filter links by base URL?
- Yes, the extract-links tool accepts an optional baseUrl parameter to filter links.
Compare mcp-server-webscan with
Last updated · Auto-generated from public README + GitHub signals.