Quick Start: Claude Code
Connect the EvalHub MCP server to Claude Code in under 5 steps.
Option A: stdio transport (recommended)
Section titled “Option A: stdio transport (recommended)”Stdio is the simplest setup — Claude Code launches the MCP server as a child process.
Using the standalone binary
Section titled “Using the standalone binary”-
Install the MCP server
Download the binary, see Install the MCP server.
-
Create a config file
Terminal window mkdir -p ~/.evalhubcat > ~/.evalhub/config.yaml <<EOFbase_url: "https://evalhub.apps.my-cluster.example.com"token: "YOUR_TOKEN"tenant: "my-team"EOFReplace
base_urlandtokenwith your EvalHub instance details. For local mode, usehttp://localhost:8080and omit the token. -
Register the MCP server with Claude Code
Terminal window claude mcp add evalhub -- evalhub-mcp --config ~/.evalhub/config.yaml -
Verify the connection
Start Claude Code and ask:
List the available evaluation providersClaude should query the
evalhub://providersresource and return the list of registered providers.
Using the Python SDK
Section titled “Using the Python SDK”-
Install the SDK with MCP support
Terminal window pip install "eval-hub-sdk[mcp]" -
Configure the connection
Terminal window evalhub config set base_url https://evalhub.apps.my-cluster.example.comevalhub config set token YOUR_TOKENevalhub config set tenant my-team -
Register the MCP server with Claude Code
Terminal window claude mcp add evalhub -- evalhub mcpTo use a named profile:
Terminal window claude mcp add evalhub -- evalhub --profile agent mcp -
Verify the connection
Start Claude Code and ask:
List the available evaluation providers
Scope: project vs global
Section titled “Scope: project vs global”By default, claude mcp add registers the server for the current project only. To make it available globally:
claude mcp add -s user evalhub -- evalhub-mcp --config ~/.evalhub/config.yamlOption B: HTTP transport
Section titled “Option B: HTTP transport”Use HTTP when the MCP server runs remotely or you want to share a single server across multiple clients.
-
Start the MCP server
Terminal window evalhub-mcp --transport http --port 3001 --config ~/.evalhub/config.yaml -
Register with Claude Code
Terminal window claude mcp add evalhub --transport http http://localhost:3001/For a remote server, replace
localhostwith the server’s hostname. -
Verify the connection
Start Claude Code and ask:
What benchmarks are available?
Example interactions
Section titled “Example interactions”Once connected, try these with Claude Code:
Evaluate the model at https://my-model.example.com/v1 using the leaderboard-v2 collectionWhat's the status of job abc123?Cancel job abc123Walk me through an Evaluation-Driven Development workflow for a RAG application