MCP Catalogs
Home

gemini-mcp

by RLabs-Inc·196·Score 51

MCP Server enabling Claude to interact with Google's Gemini models, offering text processing, image generation, video creation, and document analysis.

ai-llmdeveloper-toolsmedia
39
Forks
6
Open issues
2 mo ago
Last commit
2d ago
Indexed

Overview

This is a robust MCP server that facilitates integration between Claude and Google's Gemini AI systems. It offers an extensive collection of 37 tools spanning text processing, image generation with multi-turn editing capabilities, video creation using Veo, document analysis, and web search functionality. The server provides well-documented installation options for both Claude integration and standalone CLI usage, with extensive configuration options including tool presets to reduce context usage.

Try asking AI

After installing, here are 5 things you can ask your AI assistant:

you:Content creation with image generation and editing workflows
you:Research and analysis using web search, document processing, and summarization tools
you:Development assistance through code analysis and brainstorming between Claude and Gemini
you:Which Gemini models are supported?
you:Can I limit which tools are loaded?

When to choose this

Choose gemini-mcp when you need to integrate Gemini's capabilities (especially image/video generation) into Claude workflows, or when you want a versatile AI assistant with both text and media generation tools.

When NOT to choose this

Avoid this if you need commercial support, as it's a community project without formal backing, or if you're concerned about vendor lock-in to Google's AI ecosystem.

Tools this server exposes

12 tools extracted from the README
  • gemini-queryprompt, model, thinkingLevel?

    Direct queries to Gemini with thinking level control

  • gemini-generate-imageprompt, style?, aspectRatio?, imageSize?, useGoogleSearch?, thinkingLevel?, personGeneration?, seed?

    Generate images with Nano Banana Pro

  • gemini-start-image-editprompt, aspectRatio?, imageSize?, useGoogleSearch?, thinkingLevel?, personGeneration?, seed?

    Start a multi-turn image editing session

  • gemini-continue-image-editsessionId, prompt

    Continue refining an image from an editing session

  • gemini-analyze-codecode, language?, focus?

    Analyze code for quality, security, or issues

  • gemini-analyze-texttext, type?

    Analyze text for sentiment, entities, or summaries

  • gemini-brainstormprompt, claudeThoughts?, maxRounds?

    Collaborative brainstorming with Gemini

  • gemini-summarizecontent, length?, format?

    Summarize content with different detail levels

  • gemini-run-codeprompt, data?

    Let Gemini write and execute Python code

  • gemini-searchquery, returnCitations?

    Real-time web search with citations

  • gemini-youtubeurl, question, startTime?, endTime?

    Analyze YouTube videos with timestamp clipping

  • gemini-analyze-documentfilePath, question, mediaResolution?

    Analyze PDFs and documents with table extraction

Comparable tools

openai-mcpanthropic-mcppalm-mcpvertex-ai-mcp

Installation

# Using npm (Recommended)
claude mcp add gemini -s user -- env GEMINI_API_KEY=YOUR_KEY npx -y @rlabs-inc/gemini-mcp

# Using bun
claude mcp add gemini -s user -- env GEMINI_API_KEY=YOUR_KEY bunx @rlabs-inc/gemini-mcp

Or install globally:

npm install -g @rlabs-inc/gemini-mngcli config set api-key YOUR_KEY

FAQ

Which Gemini models are supported?
The server supports Gemini 3 Pro and Flash models for text queries, Nano Banana Pro for image generation, and Veo 2.0 for video creation.
Can I limit which tools are loaded?
Yes, you can use preset configurations (minimal, text, image, research, media, full) or specify individual tools via the GEMINI_ENABLED_TOOLS environment variable.

Compare gemini-mcp with

GitHub →

Last updated · Auto-generated from public README + GitHub signals.