ensemble-mcp docs
Presentation GitHub

Ensemble mcp User Documentation

Ensemble mcp is a Python MCP server providing vector memory, drift detection, model routing, skills discovery, session management, codebase indexing, context compression, and a local web dashboard for AI-assisted development pipelines.

All processing is 100% local — ONNX Runtime embeddings (~5ms), numpy cosine similarity, SQLite storage. Zero LLM or cloud API calls.


For Users

Get started with installing and configuring Ensemble mcp for your AI coding tool.

GuideDescription
Getting Started5-minute quick start — install, register, verify
InstallationDetailed install: pip, source, Docker, system requirements
AI Tool CompatibilityWhich AI tools work, what gets installed, which MCP tools each uses
CLI ReferenceAll commands: serve, web, install, uninstall, add-agents, add-skills
ConfigurationConfig files, layering, all settings with defaults
MCP Client SetupPer-tool registration: OpenCode, Claude Code, Copilot, Cursor, Windsurf, Devin CLI
Web DashboardDashboard usage, features, and JSON API endpoints
TroubleshootingCommon issues, error codes, and fixes

For Developers

Integrate Ensemble mcp into AI agent pipelines or contribute to the project.

GuideDescription
Tool ReferenceAll 19 MCP tools: parameters, types, response schemas, examples
Integration GuidePipeline patterns: pre/mid/post pipeline tool usage
Architecture OverviewSystem design, subpackages, data flow, extension points

19 MCP Tools at a Glance

CategoryToolsPurpose
Patternspatterns_search, patterns_store, patterns_pruneSemantic memory of past solutions
Driftdrift_checkDetect scope drift during implementation
Routingmodel_recommendChoose model tier per agent and task
Skillsskills_discover, skills_suggest, skills_generateFind, suggest, and create reusable skills
Sessionsession_save, session_load, session_searchPipeline checkpoints with resume support
Indexerproject_index, project_query, project_dependencies, project_snapshotCodebase intelligence
Compresscontext_compress, context_prepareToken-efficient context optimization
Utilityhealth, resetServer status and data management