io.github.hidai25/evalview-mcp

io.github.hidai25/evalview-mcp·v0.6.0·AI & LLMs

Quality Score

/100

Regression testing for AI agents. Golden baselines, CI/CD, LangGraph, CrewAI, OpenAI, Claude.

Repository →

§01 Install

Claude Desktop (uvx)

{
  "mcpServers": {
    "evalview-mcp": {
      "command": "uvx",
      "args": [
        "evalview"
      ],
      "env": {
        "OPENAI_API_KEY": "<your-openai_api_key>"
      }
    }
  }
}

§02 Environment variables

OPENAI_API_KEY

secret

OpenAI API key for LLM-as-judge output quality scoring. Optional — deterministic tool/sequence evaluation works without it.

§03 MCP Quality Score · methodology

freshness

completeness

installability

documentation

stability

§04 Alternatives in AI & LLMs

OpenAI Tools MCP Server

ai.com.mcp/openai-tools

Focused MCP server for OpenAI image/audio generation (v2.0.0). Wraps endpoints via HAPI CLI.

ai.llmse/mcp

Public MCP server for the LLM Search Engine

Perplexity API Platform

ai.perplexity/mcp-server

Real-time web search, reasoning, and research through Perplexity's API