io.github.hidai25/evalview-mcp
Quality Score
80
/100
Regression testing for AI agents. Golden baselines, CI/CD, LangGraph, CrewAI, OpenAI, Claude.
§01 Install
Claude Desktop (uvx)
{
"mcpServers": {
"evalview-mcp": {
"command": "uvx",
"args": [
"evalview"
],
"env": {
"OPENAI_API_KEY": "<your-openai_api_key>"
}
}
}
}§02 Environment variables
OPENAI_API_KEYsecret
OpenAI API key for LLM-as-judge output quality scoring. Optional — deterministic tool/sequence evaluation works without it.
§03 MCP Quality Score · methodology
freshness
25
completeness
10
installability
25
documentation
15
stability
5
§04 Alternatives in AI & LLMs
OpenAI Tools MCP Server
ai.com.mcp/openai-tools
Focused MCP server for OpenAI image/audio generation (v2.0.0). Wraps endpoints via HAPI CLI.
ai.llmse/mcp
ai.llmse/mcp
Public MCP server for the LLM Search Engine
Perplexity API Platform
ai.perplexity/mcp-server
Real-time web search, reasoning, and research through Perplexity's API