← Index

RAGScore

io.github.HZYAI/ragscore·v0.8.6·AI & LLMs

Generate QA datasets & evaluate RAG systems with failure diagnosis. Any LLM.

Trust verdict · v1 advisory · method
NOT YET SCREENEDno verdict on file

Verdict not yet evaluated for this tool. The semantic screen takes adversarial cases first; coverage rolls out as the corpus expands (15/150 labels to graduation). The deterministic conformance probe is built but has not yet run on the public corpus, so a recorded verdict here is REVIEW or UNVERIFIED, never a clearing ALLOW. Until a verdict is recorded, an agent should treat this tool as not-yet-cleared and fall back to its own checks. Method: the eval, four-state verdict, honest limits.

Own this server? Screen its description →

Environment variables
OPENAI_API_KEY
secret

OpenAI API key (if using OpenAI provider)

ANTHROPIC_API_KEY
secret

Anthropic API key (if using Anthropic provider)

MCP quality score · maturity, not trust · methodology
freshness
25
completeness
15
installability
25
documentation
15
stability
5
Alternatives in AI & LLMs