← Index

io.github.rsmdt/multimodal

io.github.rsmdt/multimodal·v1.3.1·Image, Video & Audio
Quality Score
83
/100

Multi-provider media generation — images, video, audio, and transcription via a unified interface

§01  Install
Claude Desktop (claude_desktop_config.json)
{
  "mcpServers": {
    "multimodal": {
      "command": "npx",
      "args": [
        "-y",
        "@r16t/multimodal-mcp"
      ],
      "env": {
        "OPENAI_API_KEY": "<your-openai_api_key>",
        "XAI_API_KEY": "<your-xai_api_key>",
        "GEMINI_API_KEY": "<your-gemini_api_key>",
        "ELEVENLABS_API_KEY": "<your-elevenlabs_api_key>",
        "BFL_API_KEY": "<your-bfl_api_key>",
        "MEDIA_OUTPUT_DIR": "<media_output_dir>"
      }
    }
  }
}
Cursor (.cursor/mcp.json)
{
  "mcpServers": {
    "multimodal": {
      "command": "npx",
      "args": [
        "-y",
        "@r16t/multimodal-mcp"
      ],
      "env": {
        "OPENAI_API_KEY": "<your-openai_api_key>",
        "XAI_API_KEY": "<your-xai_api_key>",
        "GEMINI_API_KEY": "<your-gemini_api_key>",
        "ELEVENLABS_API_KEY": "<your-elevenlabs_api_key>",
        "BFL_API_KEY": "<your-bfl_api_key>",
        "MEDIA_OUTPUT_DIR": "<media_output_dir>"
      }
    }
  }
}
Cline (cline_mcp_settings.json)
npx -y @r16t/multimodal-mcp
§02  Environment variables
OPENAI_API_KEY
secret

OpenAI API key for image, video, audio generation and transcription

XAI_API_KEY
secret

xAI API key for image and video generation

GEMINI_API_KEY
secret

Google Gemini API key for image, video, and audio generation

ELEVENLABS_API_KEY
secret

ElevenLabs API key for audio generation and transcription

BFL_API_KEY
secret

BFL API key for FLUX image generation and editing

MEDIA_OUTPUT_DIR

Directory for saved media files (defaults to cwd)

§03  MCP Quality Score  ·  methodology
freshness
23
completeness
10
installability
25
documentation
15
stability
10
§04  Alternatives in Image, Video & Audio