io.github.shinpr/mcp-local-rag

io.github.shinpr/mcp-local-ragv0.17.2Memory & RAG

Easy-to-setup local RAG server with minimal configuration

In-path gate · all MCP tools

Using io.github.shinpr/mcp-local-rag in Claude, Cursor, Gemini CLI, Cline, or Zed?

MCP tool contracts can change remotely with no version bump. The mcpindex gate pins each contract and HOLDs the call when it drifts-before your agent acts. Zero credentials. This is not the package install for this server itself (use Install this server for that).

Install the mcpindex gate (one command)

Rewrites your MCP host config so each server launches behind the gate. Inspect first: curl -fsSL https://mcpindex.ai/install.sh | less

uv tool install mcpindex-gate && mcpindex-config-wire

Auditable install path →Watch it hold a drift →

Trust verdict · v1 advisory · method

REVIEWstatus: PARTIALfresh until 2026-08-08 10:02 UTC

screened 2026-07-09tier: scannedgranularity: description-levelsource: registry

Semantic screen found no manipulation pattern in the description. Conformance probe not yet run.

mcpindex.integrity.descriptionpassINFO

evidence“No malicious instructions found”via static_description

Limits of this verdict

- Semantic screen only - the deterministic conformance probe has not run on this server
- Confidence is reported but not yet calibrated (v1)
- Screen reads the tool description, not the live behavior
- advisory
- registry description only no input schema
- screen model 8b

Semantic screen: an LLM judge reads the tool description for hidden instructions (status PARTIAL). A pass means the description is not lying, not that the tool is safe: a high-capability tool with an honest description still warrants caution. The deterministic conformance probe has not been run on this server yet, so the screen here is semantic-only. Posture: advisory. Confidences are reported but not yet calibrated (calibrated=false at v1). Full verdict history is not shown on this page.

Own this server? Screen its description →

That verdict was true at screening time (snapshot 2026-07-31).

Contracts can change after screening, with no version bump. The gate pins io.github.shinpr/mcp-local-rag’s tool contracts on first sight and holds any silent change before your agent acts - the check that keeps being true on Tuesday.

See your first HOLD in 2 minutes →

Embed this badge

A live verdict badge for your README or listing. It reflects the current screen, links back here, and updates when the verdict does.

Markdown

[![mcpindex](https://mcpindex.ai/api/v1/badge/io-github-shinpr-mcp-local-rag)](https://mcpindex.ai/server/io-github-shinpr-mcp-local-rag)

HTML

<a href="https://mcpindex.ai/server/io-github-shinpr-mcp-local-rag"><img src="https://mcpindex.ai/api/v1/badge/io-github-shinpr-mcp-local-rag" alt="mcpindex verdict" height="20" /></a>

Environment variables

BASE_DIR

Base directory for document storage (defaults to current working directory). Ignored when BASE_DIRS is set.

BASE_DIRS

JSON array of base directories (e.g. '["/a","/b"]'). Takes precedence over BASE_DIR.

DB_PATH

Path to LanceDB database directory (defaults to ./lancedb/)

CACHE_DIR

Directory where Transformers.js models are cached (defaults to ./models/)

MODEL_NAME

Embedding model name (defaults to Xenova/all-MiniLM-L6-v2)

MAX_FILE_SIZE

Maximum file size in bytes (defaults to 104857600 / 100MB)

RAG_MAX_DISTANCE

Maximum distance threshold for filtering search results. Results with distance greater than this value will be excluded. Lower values mean stricter filtering (e.g., 0.5 for high relevance only)

RAG_GROUPING

Grouping mode for quality filtering. 'similar' returns only the most similar group (stops at first distance jump). 'related' includes related groups (stops at second distance jump). Unset means no grouping filter

RAG_MAX_FILES

Maximum number of files to keep in search results. Results are filtered to include only chunks from the top N best-scoring files. For example, 1 returns only the single best-matching file's chunks. Unset means no file filtering.

CHUNK_MIN_LENGTH

Minimum chunk length in characters (1-10000, defaults to 50). Chunks shorter than this threshold are filtered out during ingestion.

RAG_DEVICE

Execution device for the embedder (defaults to cpu). Passed straight to ONNX Runtime; see the Transformers.js device source for the supported backend names. If the requested device fails to initialize, the server throws an error.

RAG_DTYPE

Embedding quantization dtype for the embedder (defaults to fp32). Opt-in and pass-through; accepts any dtype the chosen model provides (fp32, fp16, q8, int8, ...). If the model has no variant for the requested dtype, the server throws an error. Changing this changes the embedding space — re-ingest existing data.

RAG_HYBRID_WEIGHT

Keyword boost factor for hybrid search (0.0-1.0, defaults to 0.6). 0 means semantic similarity only; higher values increase the keyword-match contribution to the final score.

MCP quality score · maturity, not trust · methodology

freshness

completeness

installability

documentation

stability

Alternatives in Memory & RAG

Fodda Knowledge Graphs

ai.fodda/mcp-server

Expert-curated knowledge, brand, research & earnings intelligence — 31 tools, 220+ graphs.

Achriom

com.achriom/achriom

The media memory layer for AI agents and their humans. Books, movies, music, shows, and anime.

SMRITI Memory

io.github.shivamtyagi18/smriti-memory

Neuro-inspired long-term memory for AI agents with semantic graph and consolidation.

Install this server

Claude Desktop (claude_desktop_config.json)

{
  "mcpServers": {
    "mcp-local-rag": {
      "command": "npx",
      "args": [
        "-y",
        "--",
        "mcp-local-rag"
      ],
      "env": {
        "BASE_DIR": "<base_dir>",
        "BASE_DIRS": "<base_dirs>",
        "DB_PATH": "<db_path>",
        "CACHE_DIR": "<cache_dir>",
        "MODEL_NAME": "<model_name>",
        "MAX_FILE_SIZE": "<max_file_size>",
        "RAG_MAX_DISTANCE": "<rag_max_distance>",
        "RAG_GROUPING": "<rag_grouping>",
        "RAG_MAX_FILES": "<rag_max_files>",
        "CHUNK_MIN_LENGTH": "<chunk_min_length>",
        "RAG_DEVICE": "<rag_device>",
        "RAG_DTYPE": "<rag_dtype>",
        "RAG_HYBRID_WEIGHT": "<rag_hybrid_weight>"
      }
    }
  }
}

Cursor (.cursor/mcp.json)

{
  "mcpServers": {
    "mcp-local-rag": {
      "command": "npx",
      "args": [
        "-y",
        "--",
        "mcp-local-rag"
      ],
      "env": {
        "BASE_DIR": "<base_dir>",
        "BASE_DIRS": "<base_dirs>",
        "DB_PATH": "<db_path>",
        "CACHE_DIR": "<cache_dir>",
        "MODEL_NAME": "<model_name>",
        "MAX_FILE_SIZE": "<max_file_size>",
        "RAG_MAX_DISTANCE": "<rag_max_distance>",
        "RAG_GROUPING": "<rag_grouping>",
        "RAG_MAX_FILES": "<rag_max_files>",
        "CHUNK_MIN_LENGTH": "<chunk_min_length>",
        "RAG_DEVICE": "<rag_device>",
        "RAG_DTYPE": "<rag_dtype>",
        "RAG_HYBRID_WEIGHT": "<rag_hybrid_weight>"
      }
    }
  }
}

Cline (cline_mcp_settings.json)

npx -y -- mcp-local-rag

Claude Code (claude mcp add)

claude mcp add --scope user mcp-local-rag -- npx -y -- mcp-local-rag

Gemini CLI (gemini mcp add)

gemini mcp add -s user mcp-local-rag npx -y -- mcp-local-rag

Verdict API

curl -s mcpindex.ai/api/v1/trust/server/io-github-shinpr-mcp-local-rag

Free-tier verdict as JSON: decision + dimensions + severity. Call it from your agent before it invokes a tool it just discovered.

Details

version: v0.17.2
category: Memory & RAG
verdict expires: 2026-08-08
quality: 80 / 100
operator: shinpr

Listing

Listed from the official MCP registry. Unclaimed by its maintainer. Maintainer? Email us →

Provenance

Each verdict is bound to a hash of the exact description it judged, so a re-crawl that changes the text produces a new record rather than silently inheriting this one. Verdict history is hash-chained and anchored to Bitcoin via OpenTimestamps (latest confirmed at block 960,209). Verify it yourself.

Links

Repository →