← Index

io.github.rsmdt/multimodal

io.github.rsmdt/multimodal·v1.3.1·Image, Video & Audio

Multi-provider media generation — images, video, audio, and transcription via a unified interface

Trust verdict · v1 advisory · method
NOT YET SCREENEDno verdict on file

Verdict not yet evaluated for this tool. The semantic screen takes adversarial cases first; coverage rolls out as the corpus expands (15/150 labels to graduation). The deterministic conformance probe is built but has not yet run on the public corpus, so a recorded verdict here is REVIEW or UNVERIFIED, never a clearing ALLOW. Until a verdict is recorded, an agent should treat this tool as not-yet-cleared and fall back to its own checks. Method: the eval, four-state verdict, honest limits.

Own this server? Screen its description →

Environment variables
OPENAI_API_KEY
secret

OpenAI API key for image, video, audio generation and transcription

XAI_API_KEY
secret

xAI API key for image and video generation

GEMINI_API_KEY
secret

Google Gemini API key for image, video, and audio generation

ELEVENLABS_API_KEY
secret

ElevenLabs API key for audio generation and transcription

BFL_API_KEY
secret

BFL API key for FLUX image generation and editing

MEDIA_OUTPUT_DIR

Directory for saved media files (defaults to cwd)

MCP quality score · maturity, not trust · methodology
freshness
19
completeness
10
installability
25
documentation
15
stability
10
Alternatives in Image, Video & Audio