io.github.rsmdt/multimodal

io.github.rsmdt/multimodalv1.3.1Image, Video & Audio

Multi-provider media generation — images, video, audio, and transcription via a unified interface

In-path gate · all MCP tools

Using io.github.rsmdt/multimodal in Claude, Cursor, Gemini CLI, Cline, or Zed?

MCP tool contracts can change remotely with no version bump. The mcpindex gate pins each contract and HOLDs the call when it drifts-before your agent acts. Zero credentials. This is not the package install for this server itself (use Install this server for that).

Install the mcpindex gate (one command)

Rewrites your MCP host config so each server launches behind the gate. Inspect first: curl -fsSL https://mcpindex.ai/install.sh | less

uv tool install mcpindex-gate && mcpindex-config-wire

Auditable install path →Watch it hold a drift →

Trust verdict · v1 advisory · method

REVIEWstatus: PARTIALfresh until 2026-08-01 19:02 UTC

screened 2026-07-02tier: scannedgranularity: description-levelsource: registry

Semantic screen found no manipulation pattern in the description. Conformance probe not yet run.

mcpindex.integrity.descriptionpassINFO

evidence“No malicious instructions found”via static_description

Limits of this verdict

- Semantic screen only - the deterministic conformance probe has not run on this server
- Confidence is reported but not yet calibrated (v1)
- Screen reads the tool description, not the live behavior
- advisory
- registry description only no input schema
- screen model 8b

Semantic screen: an LLM judge reads the tool description for hidden instructions (status PARTIAL). A pass means the description is not lying, not that the tool is safe: a high-capability tool with an honest description still warrants caution. The deterministic conformance probe has not been run on this server yet, so the screen here is semantic-only. Posture: advisory. Confidences are reported but not yet calibrated (calibrated=false at v1). Full verdict history is not shown on this page.

Own this server? Screen its description →

That verdict was true at screening time (snapshot 2026-07-31).

Contracts can change after screening, with no version bump. The gate pins io.github.rsmdt/multimodal’s tool contracts on first sight and holds any silent change before your agent acts - the check that keeps being true on Tuesday.

See your first HOLD in 2 minutes →

Embed this badge

A live verdict badge for your README or listing. It reflects the current screen, links back here, and updates when the verdict does.

Markdown

[![mcpindex](https://mcpindex.ai/api/v1/badge/io-github-rsmdt-multimodal)](https://mcpindex.ai/server/io-github-rsmdt-multimodal)

HTML

<a href="https://mcpindex.ai/server/io-github-rsmdt-multimodal"><img src="https://mcpindex.ai/api/v1/badge/io-github-rsmdt-multimodal" alt="mcpindex verdict" height="20" /></a>

Environment variables

OPENAI_API_KEY

secret

OpenAI API key for image, video, audio generation and transcription

XAI_API_KEY

secret

xAI API key for image and video generation

GEMINI_API_KEY

secret

Google Gemini API key for image, video, and audio generation

ELEVENLABS_API_KEY

secret

ElevenLabs API key for audio generation and transcription

BFL_API_KEY

secret

BFL API key for FLUX image generation and editing

MEDIA_OUTPUT_DIR

Directory for saved media files (defaults to cwd)

MCP quality score · maturity, not trust · methodology

freshness

completeness

installability

documentation

stability

Alternatives in Image, Video & Audio

ModelRunner

ai.modelrunner/mcp

Run 100+ AI models — image, video, audio, 3D — through one API with pay-per-use billing.

Glif

app.glif/glif

Generate images, video, and audio with Glif's media-generation agent

io.github.runapi-builder/imagen-4-mcp

RunAPI MCP server for Imagen 4: create tasks, poll status, check pricing.

Install this server

Claude Desktop (claude_desktop_config.json)

{
  "mcpServers": {
    "multimodal": {
      "command": "npx",
      "args": [
        "-y",
        "--",
        "@r16t/multimodal-mcp"
      ],
      "env": {
        "OPENAI_API_KEY": "<your-openai_api_key>",
        "XAI_API_KEY": "<your-xai_api_key>",
        "GEMINI_API_KEY": "<your-gemini_api_key>",
        "ELEVENLABS_API_KEY": "<your-elevenlabs_api_key>",
        "BFL_API_KEY": "<your-bfl_api_key>",
        "MEDIA_OUTPUT_DIR": "<media_output_dir>"
      }
    }
  }
}

Cursor (.cursor/mcp.json)

{
  "mcpServers": {
    "multimodal": {
      "command": "npx",
      "args": [
        "-y",
        "--",
        "@r16t/multimodal-mcp"
      ],
      "env": {
        "OPENAI_API_KEY": "<your-openai_api_key>",
        "XAI_API_KEY": "<your-xai_api_key>",
        "GEMINI_API_KEY": "<your-gemini_api_key>",
        "ELEVENLABS_API_KEY": "<your-elevenlabs_api_key>",
        "BFL_API_KEY": "<your-bfl_api_key>",
        "MEDIA_OUTPUT_DIR": "<media_output_dir>"
      }
    }
  }
}

Cline (cline_mcp_settings.json)

npx -y -- @r16t/multimodal-mcp

Claude Code (claude mcp add)

claude mcp add --scope user multimodal -- npx -y -- @r16t/multimodal-mcp

Gemini CLI (gemini mcp add)

gemini mcp add -s user multimodal npx -y -- @r16t/multimodal-mcp

Verdict API

curl -s mcpindex.ai/api/v1/trust/server/io-github-rsmdt-multimodal

Free-tier verdict as JSON: decision + dimensions + severity. Call it from your agent before it invokes a tool it just discovered.

Details

version: v1.3.1
category: Image, Video & Audio
verdict expires: 2026-08-01
quality: 76 / 100
operator: rsmdt

Listing

Listed from the official MCP registry. Unclaimed by its maintainer. Maintainer? Email us →

Provenance

Each verdict is bound to a hash of the exact description it judged, so a re-crawl that changes the text produces a new record rather than silently inheriting this one. Verdict history is hash-chained and anchored to Bitcoin via OpenTimestamps (latest confirmed at block 960,209). Verify it yourself.

Links

Repository →