io.github.rsmdt/multimodal
Quality Score
83
/100
Multi-provider media generation — images, video, audio, and transcription via a unified interface
§01 Install
Claude Desktop (claude_desktop_config.json)
{
"mcpServers": {
"multimodal": {
"command": "npx",
"args": [
"-y",
"@r16t/multimodal-mcp"
],
"env": {
"OPENAI_API_KEY": "<your-openai_api_key>",
"XAI_API_KEY": "<your-xai_api_key>",
"GEMINI_API_KEY": "<your-gemini_api_key>",
"ELEVENLABS_API_KEY": "<your-elevenlabs_api_key>",
"BFL_API_KEY": "<your-bfl_api_key>",
"MEDIA_OUTPUT_DIR": "<media_output_dir>"
}
}
}
}Cursor (.cursor/mcp.json)
{
"mcpServers": {
"multimodal": {
"command": "npx",
"args": [
"-y",
"@r16t/multimodal-mcp"
],
"env": {
"OPENAI_API_KEY": "<your-openai_api_key>",
"XAI_API_KEY": "<your-xai_api_key>",
"GEMINI_API_KEY": "<your-gemini_api_key>",
"ELEVENLABS_API_KEY": "<your-elevenlabs_api_key>",
"BFL_API_KEY": "<your-bfl_api_key>",
"MEDIA_OUTPUT_DIR": "<media_output_dir>"
}
}
}
}Cline (cline_mcp_settings.json)
npx -y @r16t/multimodal-mcp§02 Environment variables
OPENAI_API_KEYsecret
OpenAI API key for image, video, audio generation and transcription
XAI_API_KEYsecret
xAI API key for image and video generation
GEMINI_API_KEYsecret
Google Gemini API key for image, video, and audio generation
ELEVENLABS_API_KEYsecret
ElevenLabs API key for audio generation and transcription
BFL_API_KEYsecret
BFL API key for FLUX image generation and editing
MEDIA_OUTPUT_DIRDirectory for saved media files (defaults to cwd)
§03 MCP Quality Score · methodology
freshness
23
completeness
10
installability
25
documentation
15
stability
10
§04 Alternatives in Image, Video & Audio
inference.sh
ac.inference.sh/mcp
Run 150+ AI apps — image, video, audio, LLMs, 3D and more. Browse, execute, stream results.
Contabo (VPS) MCP Server
ai.com.mcp/contabo
Contabo API (v1.0.0) as MCP tools for cloud provisioning, and management. Powered by HAPI MCP server
Filtrix AI MCP
ai.filtrix.mcp/filtrix-ai
Filtrix MCP for image/video generation. Portal: https://agent.filtrix.ai/