dev.safeprompt/mcp
Detect prompt injection, jailbreaks, and code injection in untrusted text before it reaches an LLM.
Verdict not yet evaluated for this tool. The semantic screen takes adversarial cases first; coverage rolls out as the corpus expands (15/150 labels to graduation). The deterministic conformance probe is built but has not yet run on the public corpus, so a recorded verdict here is REVIEW or UNVERIFIED, never a clearing ALLOW. Until a verdict is recorded, an agent should treat this tool as not-yet-cleared and fall back to its own checks. Method: the eval, four-state verdict, honest limits.
Own this server? Screen its description →
SAFEPROMPT_API_KEYSafePrompt API key from https://dashboard.safeprompt.dev
SAFEPROMPT_PROVIDERAPI base URL (default https://api.safeprompt.dev)
SAFEPROMPT_USER_IPValue sent as X-User-IP for threat-intel tracking (default 203.0.113.1)
Focused MCP server for OpenAI image/audio generation (v2.0.0). Wraps endpoints via HAPI CLI.
Public MCP server for the LLM Search Engine
Audit your brand's visibility across ChatGPT, Gemini, Claude, Perplexity + 6 more engines.