Eval Hub (0.3.0)

Download OpenAPI specification:

License: Apache 2.0

API REST server for evaluation backend orchestration

Evaluations

Evaluation job management endpoints

Create Evaluation

Create and execute evaluation request using the simplified benchmark schema.

Request Body schema: application/json
required

One of

name required	string The evaluation job name.
description	string The evaluation job description.
tags	Array of strings The evaluation job tags.
required	object (ModelRef) The model to evaluate.
required	Array of objects (EvaluationBenchmarkConfig) The evaluation benchmarks to run.
	object (PassCriteria) The overall pass criteria for the evaluation job.
	object (ExperimentConfig) The MLFlow experiment configuration. When provided, the evaluation job will be tracked in MLFlow.
	object (EvaluationExports) Optional exports configuration for the evaluation job. When provided, the evaluation job results will be exported to the specified location.
	object Custom request data. This can be used for user specific job data.

Responses

Request samples

Payload

Content type

application/json

Example

EvaluationJobConfigBenchmarks

{"name": "string",
"description": "string",
"tags": ["string"
],
"model": {"url": "string",
"name": "string",
"parameters": { },
"auth": {"secret_ref": "string"
}
},
"benchmarks": [{"id": "string",
"provider_id": "string",
"weight": 1,
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
},
"parameters": { },
"test_data_ref": {"s3": {"bucket": "my-eval-bucket",
"key": "datasets/benchmark-a/v1",
"secret_ref": "my-s3-connection-secret"
}
}
}
],
"pass_criteria": {"threshold": 0
},
"experiment": {"name": "string",
"tags": [{"key": "string",
"value": "string"
}
],
"artifact_location": "string"
},
"exports": {"oci": {"coordinates": {"oci_host": "string",
"oci_repository": "string",
"oci_tag": "string",
"oci_subject": "string",
"annotations": {"property1": "string",
"property2": "string"
}
},
"k8s": {"connection": "string"
}
}
},
"custom": { }
}

Response samples

202
400
401
403
404

Content type

application/json

Example

EvaluationJobConfigBenchmarks

{"name": "string",
"description": "string",
"tags": ["string"
],
"model": {"url": "string",
"name": "string",
"parameters": { },
"auth": {"secret_ref": "string"
}
},
"benchmarks": [{"id": "string",
"provider_id": "string",
"weight": 1,
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
},
"parameters": { },
"test_data_ref": {"s3": {"bucket": "my-eval-bucket",
"key": "datasets/benchmark-a/v1",
"secret_ref": "my-s3-connection-secret"
}
}
}
],
"pass_criteria": {"threshold": 0
},
"experiment": {"name": "string",
"tags": [{"key": "string",
"value": "string"
}
],
"artifact_location": "string"
},
"exports": {"oci": {"coordinates": {"oci_host": "string",
"oci_repository": "string",
"oci_tag": "string",
"oci_subject": "string",
"annotations": {"property1": "string",
"property2": "string"
}
},
"k8s": {"connection": "string"
}
}
},
"custom": { },
"resource": {"id": "string",
"tenant": "string",
"created_at": "2019-08-24T14:15:22Z",
"updated_at": "2019-08-24T14:15:22Z",
"owner": "string",
"mlflow_experiment_id": "string"
},
"status": {"state": "pending",
"message": {"message": "string",
"message_code": "string"
},
"benchmarks": [{"provider_id": "string",
"id": "string",
"benchmark_index": 0,
"status": "pending",
"error_message": {"message": "string",
"message_code": "string"
},
"started_at": "2019-08-24T14:15:22Z",
"completed_at": "2019-08-24T14:15:22Z"
}
]
},
"results": {"benchmarks": [{"id": "string",
"provider_id": "string",
"benchmark_index": 0,
"metrics": { },
"artifacts": { },
"mlflow_run_id": "string",
"logs_path": "string",
"test": {"primary_score": 0.1,
"threshold": 0.1,
"pass": true
}
}
],
"mlflow_experiment_url": "string",
"test": {"score": 0.1,
"threshold": 0.1,
"pass": true
}
}
}

List Evaluations

List all evaluation requests.

query Parameters

limit	integer (Limit) [ 1 .. 100 ] Default: 50 Maximum number of evaluations to return
offset	integer (Offset) >= 0 Default: 0 Offset for pagination
status	string (Status Filter) Filter by status
name	string (Name) Name to search for
tags	string (Tags) Tags to search for

Responses

Response samples

200
400
401
403
404

Content type

application/json

{"first": {"href": "string"
},
"next": {"href": "string"
},
"limit": 0,
"total_count": 0,
"items": [{"resource": {"id": "string",
"tenant": "string",
"created_at": "2019-08-24T14:15:22Z",
"updated_at": "2019-08-24T14:15:22Z",
"owner": "string",
"mlflow_experiment_id": "string"
},
"status": {"state": "pending",
"message": {"message": "string",
"message_code": "string"
},
"benchmarks": [{"provider_id": "string",
"id": "string",
"benchmark_index": 0,
"status": "pending",
"error_message": {"message": "string",
"message_code": "string"
},
"started_at": "2019-08-24T14:15:22Z",
"completed_at": "2019-08-24T14:15:22Z"
}
]
},
"results": {"benchmarks": [{"id": "string",
"provider_id": "string",
"benchmark_index": 0,
"metrics": { },
"artifacts": { },
"mlflow_run_id": "string",
"logs_path": "string",
"test": {"primary_score": 0.1,
"threshold": 0.1,
"pass": true
}
}
],
"mlflow_experiment_url": "string",
"test": {"score": 0.1,
"threshold": 0.1,
"pass": true
}
},
"name": "string",
"description": "string",
"tags": ["string"
],
"model": {"url": "string",
"name": "string",
"parameters": { },
"auth": {"secret_ref": "string"
}
},
"benchmarks": [{"id": "string",
"provider_id": "string",
"weight": 1,
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
},
"parameters": { },
"test_data_ref": {"s3": {"bucket": "my-eval-bucket",
"key": "datasets/benchmark-a/v1",
"secret_ref": "my-s3-connection-secret"
}
}
}
],
"pass_criteria": {"threshold": 0
},
"experiment": {"name": "string",
"tags": [{"key": "string",
"value": "string"
}
],
"artifact_location": "string"
},
"exports": {"oci": {"coordinates": {"oci_host": "string",
"oci_repository": "string",
"oci_tag": "string",
"oci_subject": "string",
"annotations": {"property1": "string",
"property2": "string"
}
},
"k8s": {"connection": "string"
}
}
},
"custom": { }
}
],
"errors": ["string"
]
}

Get Evaluation

Returns the evaluation job resource with the current status and results.

path Parameters

id

required

string (Id)

Responses

Response samples

200
400
401
403
404

Content type

application/json

Example

EvaluationJobConfigBenchmarks

{"name": "string",
"description": "string",
"tags": ["string"
],
"model": {"url": "string",
"name": "string",
"parameters": { },
"auth": {"secret_ref": "string"
}
},
"benchmarks": [{"id": "string",
"provider_id": "string",
"weight": 1,
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
},
"parameters": { },
"test_data_ref": {"s3": {"bucket": "my-eval-bucket",
"key": "datasets/benchmark-a/v1",
"secret_ref": "my-s3-connection-secret"
}
}
}
],
"pass_criteria": {"threshold": 0
},
"experiment": {"name": "string",
"tags": [{"key": "string",
"value": "string"
}
],
"artifact_location": "string"
},
"exports": {"oci": {"coordinates": {"oci_host": "string",
"oci_repository": "string",
"oci_tag": "string",
"oci_subject": "string",
"annotations": {"property1": "string",
"property2": "string"
}
},
"k8s": {"connection": "string"
}
}
},
"custom": { },
"resource": {"id": "string",
"tenant": "string",
"created_at": "2019-08-24T14:15:22Z",
"updated_at": "2019-08-24T14:15:22Z",
"owner": "string",
"mlflow_experiment_id": "string"
},
"status": {"state": "pending",
"message": {"message": "string",
"message_code": "string"
},
"benchmarks": [{"provider_id": "string",
"id": "string",
"benchmark_index": 0,
"status": "pending",
"error_message": {"message": "string",
"message_code": "string"
},
"started_at": "2019-08-24T14:15:22Z",
"completed_at": "2019-08-24T14:15:22Z"
}
]
},
"results": {"benchmarks": [{"id": "string",
"provider_id": "string",
"benchmark_index": 0,
"metrics": { },
"artifacts": { },
"mlflow_run_id": "string",
"logs_path": "string",
"test": {"primary_score": 0.1,
"threshold": 0.1,
"pass": true
}
}
],
"mlflow_experiment_url": "string",
"test": {"score": 0.1,
"threshold": 0.1,
"pass": true
}
}
}

Cancel Evaluation

Cancel a running evaluation.

path Parameters

id

required

string (Id)

query Parameters

hard_delete

boolean (Hard Delete)

Default: false

If true, delete the evaluation job permanently so that GET /api/v1/evaluations/jobs/{id} will return a 404.

Responses

Response samples

400
401
403
404
409

Content type

application/json

{"message": "The field 'state' is not valid.",
"message_code": "invalid_value",
"trace": "b12692e1-8582-4628-88ca-7a13fefb73e2"
}

Collections

Benchmark collection management endpoints

List Collections

List all benchmark collections.

query Parameters

limit	integer (Limit) [ 1 .. 100 ] Default: 50 Maximum number of collections to return
offset	integer (Offset) >= 0 Default: 0 Offset for pagination
name	string (Name) Name to search for
category	string (Category) Category to search for
tags	string (Tags) Tags to search for
scope	string (Scope of collections) Enum: "system" "tenant" Set to `system` to get only system defined collections, or `tenant` to get only user defined collections. If `scope` is not provided, both system and user defined collections will be returned.

Responses

Response samples

200
400
401
403
404

Content type

application/json

{"first": {"href": "string"
},
"next": {"href": "string"
},
"limit": 0,
"total_count": 0,
"items": [{"resource": {"id": "string",
"tenant": "string",
"created_at": "2019-08-24T14:15:22Z",
"updated_at": "2019-08-24T14:15:22Z",
"owner": "string"
},
"name": "string",
"category": "string",
"description": "string",
"tags": ["string"
],
"custom": { },
"pass_criteria": {"threshold": 0
},
"benchmarks": [{"id": "string",
"provider_id": "string",
"url": "string",
"weight": 1,
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
},
"parameters": { },
"test_data_ref": {"s3": {"bucket": "my-eval-bucket",
"key": "datasets/benchmark-a/v1",
"secret_ref": "my-s3-connection-secret"
}
}
}
]
}
]
}

Create Collection

Create a new collection.

Request Body schema: application/json
required

name required	string Collection name.
category required	string Collection category.
description	string Optional description.
tags	Array of strings Tags.
	object Custom key-value data.
	object (PassCriteria) Pass criteria for the collection.
required	Array of objects (CollectionBenchmarkConfig) Benchmarks in the collection.

Responses

Request samples

Payload

Content type

application/json

{"name": "string",
"category": "string",
"description": "string",
"tags": ["string"
],
"custom": { },
"pass_criteria": {"threshold": 0
},
"benchmarks": [{"id": "string",
"provider_id": "string",
"url": "string",
"weight": 1,
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
},
"parameters": { },
"test_data_ref": {"s3": {"bucket": "my-eval-bucket",
"key": "datasets/benchmark-a/v1",
"secret_ref": "my-s3-connection-secret"
}
}
}
]
}

Response samples

201
400
401
403
404

Content type

application/json

{"resource": {"id": "string",
"tenant": "string",
"created_at": "2019-08-24T14:15:22Z",
"updated_at": "2019-08-24T14:15:22Z",
"owner": "string"
},
"name": "string",
"category": "string",
"description": "string",
"tags": ["string"
],
"custom": { },
"pass_criteria": {"threshold": 0
},
"benchmarks": [{"id": "string",
"provider_id": "string",
"url": "string",
"weight": 1,
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
},
"parameters": { },
"test_data_ref": {"s3": {"bucket": "my-eval-bucket",
"key": "datasets/benchmark-a/v1",
"secret_ref": "my-s3-connection-secret"
}
}
}
]
}

Get Collection

Get details of a specific collection.

path Parameters

id

required

string (Collection Id)

Responses

Response samples

200
400
401
403
404

Content type

application/json

{"resource": {"id": "string",
"tenant": "string",
"created_at": "2019-08-24T14:15:22Z",
"updated_at": "2019-08-24T14:15:22Z",
"owner": "string"
},
"name": "string",
"category": "string",
"description": "string",
"tags": ["string"
],
"custom": { },
"pass_criteria": {"threshold": 0
},
"benchmarks": [{"id": "string",
"provider_id": "string",
"url": "string",
"weight": 1,
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
},
"parameters": { },
"test_data_ref": {"s3": {"bucket": "my-eval-bucket",
"key": "datasets/benchmark-a/v1",
"secret_ref": "my-s3-connection-secret"
}
}
}
]
}

Update Collection

Update an existing collection.

path Parameters

id

required

string (Collection Id)

Request Body schema: application/json
required

name required	string Collection name.
category required	string Collection category.
description	string Optional description.
tags	Array of strings Tags.
	object Custom key-value data.
	object (PassCriteria) Pass criteria for the collection.
required	Array of objects (CollectionBenchmarkConfig) Benchmarks in the collection.

Responses

Request samples

Payload

Content type

application/json

{"name": "string",
"category": "string",
"description": "string",
"tags": ["string"
],
"custom": { },
"pass_criteria": {"threshold": 0
},
"benchmarks": [{"id": "string",
"provider_id": "string",
"url": "string",
"weight": 1,
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
},
"parameters": { },
"test_data_ref": {"s3": {"bucket": "my-eval-bucket",
"key": "datasets/benchmark-a/v1",
"secret_ref": "my-s3-connection-secret"
}
}
}
]
}

Response samples

200
400
401
403
404

Content type

application/json

{"resource": {"id": "string",
"tenant": "string",
"created_at": "2019-08-24T14:15:22Z",
"updated_at": "2019-08-24T14:15:22Z",
"owner": "string"
},
"name": "string",
"category": "string",
"description": "string",
"tags": ["string"
],
"custom": { },
"pass_criteria": {"threshold": 0
},
"benchmarks": [{"id": "string",
"provider_id": "string",
"url": "string",
"weight": 1,
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
},
"parameters": { },
"test_data_ref": {"s3": {"bucket": "my-eval-bucket",
"key": "datasets/benchmark-a/v1",
"secret_ref": "my-s3-connection-secret"
}
}
}
]
}

Patch Collection

Partially update an existing collection.

path Parameters

id

required

string (Collection Id)

Request Body schema: application/json
required

Array

op required	string (PatchOp) Enum: "replace" "add" "remove" Patch operation type
path required	string JSON Pointer path
value	any Value for add/replace (omit for remove)

Responses

Request samples

Payload

Content type

application/json

[{"op": "replace",
"path": "string",
"value": null
}
]

Response samples

200
400
401
403
404

Content type

application/json

{"resource": {"id": "string",
"tenant": "string",
"created_at": "2019-08-24T14:15:22Z",
"updated_at": "2019-08-24T14:15:22Z",
"owner": "string"
},
"name": "string",
"category": "string",
"description": "string",
"tags": ["string"
],
"custom": { },
"pass_criteria": {"threshold": 0
},
"benchmarks": [{"id": "string",
"provider_id": "string",
"url": "string",
"weight": 1,
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
},
"parameters": { },
"test_data_ref": {"s3": {"bucket": "my-eval-bucket",
"key": "datasets/benchmark-a/v1",
"secret_ref": "my-s3-connection-secret"
}
}
}
]
}

Delete Collection

Delete a collection.

path Parameters

id

required

string (Collection Id)

Responses

Response samples

400
401
403
404

Content type

application/json

{"message": "The field 'state' is not valid.",
"message_code": "invalid_value",
"trace": "b12692e1-8582-4628-88ca-7a13fefb73e2"
}

Providers

Evaluation provider endpoints

List Providers

List all registered evaluation providers.

query Parameters

limit	integer (Limit) [ 1 .. 100 ] Default: 50 Maximum number of providers to return
offset	integer (Offset) >= 0 Default: 0 Offset for pagination
benchmarks	boolean (Benchmarks) Default: true Include or exclude benchmarks supported by this provider in the response
name	string (Name) Name to search for
tags	string (Tags) Tags to search for
scope	string (Scope of providers) Enum: "system" "tenant" Set to `system` to get only system defined providers, or `tenant` to get only user defined providers. If `scope` is not provided, both system and user defined providers will be returned.

Responses

Response samples

200
400
401
403
404

Content type

application/json

{"first": {"href": "string"
},
"next": {"href": "string"
},
"limit": 0,
"total_count": 0,
"items": [{"resource": {"id": "string",
"tenant": "string",
"created_at": "2019-08-24T14:15:22Z",
"updated_at": "2019-08-24T14:15:22Z",
"owner": "string"
},
"name": "string",
"title": "string",
"description": "string",
"tags": ["string"
],
"runtime": {"k8s": {"image": "string",
"entrypoint": ["string"
],
"cpu_request": "string",
"memory_request": "string",
"cpu_limit": "string",
"memory_limit": "string",
"env": [{"name": "string",
"value": "string"
}
]
},
"local": {"command": "string",
"env": [{"name": "string",
"value": "string"
}
]
}
},
"benchmarks": [{"id": "string",
"url": "string",
"name": "string",
"description": "string",
"category": "string",
"metrics": ["string"
],
"num_few_shot": 0,
"dataset_size": 0,
"tags": ["string"
],
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
}
}
]
}
],
"errors": ["string"
]
}

Create a new provider scoped to the current tenant (Bring Your Own Provider)

Request Body schema: application/json
required

name required	string Provider name
title	string Provider display title
description	string Provider description
tags	Array of strings Provider tags
required	object (Runtime) Provider runtime configuration
required	Array of objects (BenchmarkResource) Benchmarks offered by this provider

Responses

Request samples

Payload

Content type

application/json

{"name": "string",
"title": "string",
"description": "string",
"tags": ["string"
],
"runtime": {"k8s": {"image": "string",
"entrypoint": ["string"
],
"cpu_request": "string",
"memory_request": "string",
"cpu_limit": "string",
"memory_limit": "string",
"env": [{"name": "string",
"value": "string"
}
]
},
"local": {"command": "string",
"env": [{"name": "string",
"value": "string"
}
]
}
},
"benchmarks": [{"id": "string",
"url": "string",
"name": "string",
"description": "string",
"category": "string",
"metrics": ["string"
],
"num_few_shot": 0,
"dataset_size": 0,
"tags": ["string"
],
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
}
}
]
}

Response samples

201
400
401
403
404

Content type

application/json

{"resource": {"id": "string",
"tenant": "string",
"created_at": "2019-08-24T14:15:22Z",
"updated_at": "2019-08-24T14:15:22Z",
"owner": "string"
},
"name": "string",
"title": "string",
"description": "string",
"tags": ["string"
],
"runtime": {"k8s": {"image": "string",
"entrypoint": ["string"
],
"cpu_request": "string",
"memory_request": "string",
"cpu_limit": "string",
"memory_limit": "string",
"env": [{"name": "string",
"value": "string"
}
]
},
"local": {"command": "string",
"env": [{"name": "string",
"value": "string"
}
]
}
},
"benchmarks": [{"id": "string",
"url": "string",
"name": "string",
"description": "string",
"category": "string",
"metrics": ["string"
],
"num_few_shot": 0,
"dataset_size": 0,
"tags": ["string"
],
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
}
}
]
}

Get Provider

Get a provider by ID.

path Parameters

id

required

string (Provider Id)

Provider ID

Responses

Response samples

200
400
401
403
404

Content type

application/json

{"resource": {"id": "string",
"tenant": "string",
"created_at": "2019-08-24T14:15:22Z",
"updated_at": "2019-08-24T14:15:22Z",
"owner": "string"
},
"name": "string",
"title": "string",
"description": "string",
"tags": ["string"
],
"runtime": {"k8s": {"image": "string",
"entrypoint": ["string"
],
"cpu_request": "string",
"memory_request": "string",
"cpu_limit": "string",
"memory_limit": "string",
"env": [{"name": "string",
"value": "string"
}
]
},
"local": {"command": "string",
"env": [{"name": "string",
"value": "string"
}
]
}
},
"benchmarks": [{"id": "string",
"url": "string",
"name": "string",
"description": "string",
"category": "string",
"metrics": ["string"
],
"num_few_shot": 0,
"dataset_size": 0,
"tags": ["string"
],
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
}
}
]
}

Update Provider

Update an existing provider.

path Parameters

id

required

string (Provider Id)

Provider ID

Request Body schema: application/json
required

name required	string Provider name
title	string Provider display title
description	string Provider description
tags	Array of strings Provider tags
required	object (Runtime) Provider runtime configuration
required	Array of objects (BenchmarkResource) Benchmarks offered by this provider

Responses

Request samples

Payload

Content type

application/json

{"name": "string",
"title": "string",
"description": "string",
"tags": ["string"
],
"runtime": {"k8s": {"image": "string",
"entrypoint": ["string"
],
"cpu_request": "string",
"memory_request": "string",
"cpu_limit": "string",
"memory_limit": "string",
"env": [{"name": "string",
"value": "string"
}
]
},
"local": {"command": "string",
"env": [{"name": "string",
"value": "string"
}
]
}
},
"benchmarks": [{"id": "string",
"url": "string",
"name": "string",
"description": "string",
"category": "string",
"metrics": ["string"
],
"num_few_shot": 0,
"dataset_size": 0,
"tags": ["string"
],
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
}
}
]
}

Response samples

200
400
401
403
404

Content type

application/json

{"resource": {"id": "string",
"tenant": "string",
"created_at": "2019-08-24T14:15:22Z",
"updated_at": "2019-08-24T14:15:22Z",
"owner": "string"
},
"name": "string",
"title": "string",
"description": "string",
"tags": ["string"
],
"runtime": {"k8s": {"image": "string",
"entrypoint": ["string"
],
"cpu_request": "string",
"memory_request": "string",
"cpu_limit": "string",
"memory_limit": "string",
"env": [{"name": "string",
"value": "string"
}
]
},
"local": {"command": "string",
"env": [{"name": "string",
"value": "string"
}
]
}
},
"benchmarks": [{"id": "string",
"url": "string",
"name": "string",
"description": "string",
"category": "string",
"metrics": ["string"
],
"num_few_shot": 0,
"dataset_size": 0,
"tags": ["string"
],
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
}
}
]
}

Patch Provider

Partially update an existing provider.

path Parameters

id

required

string (Provider Id)

Request Body schema: application/json
required

Array

op required	string (PatchOp) Enum: "replace" "add" "remove" Patch operation type
path required	string JSON Pointer path
value	any Value for add/replace (omit for remove)

Responses

Request samples

Payload

Content type

application/json

[{"op": "replace",
"path": "string",
"value": null
}
]

Response samples

200
400
401
403
404

Content type

application/json

{"resource": {"id": "string",
"tenant": "string",
"created_at": "2019-08-24T14:15:22Z",
"updated_at": "2019-08-24T14:15:22Z",
"owner": "string"
},
"name": "string",
"title": "string",
"description": "string",
"tags": ["string"
],
"runtime": {"k8s": {"image": "string",
"entrypoint": ["string"
],
"cpu_request": "string",
"memory_request": "string",
"cpu_limit": "string",
"memory_limit": "string",
"env": [{"name": "string",
"value": "string"
}
]
},
"local": {"command": "string",
"env": [{"name": "string",
"value": "string"
}
]
}
},
"benchmarks": [{"id": "string",
"url": "string",
"name": "string",
"description": "string",
"category": "string",
"metrics": ["string"
],
"num_few_shot": 0,
"dataset_size": 0,
"tags": ["string"
],
"primary_score": {"metric": "string",
"lower_is_better": false
},
"pass_criteria": {"threshold": 0
}
}
]
}

Delete Provider

Delete provider by ID.

path Parameters

id

required

string (Provider Id)

Provider ID

Responses

Response samples

400
401
403
404

Content type

application/json

{"message": "The field 'state' is not valid.",
"message_code": "invalid_value",
"trace": "b12692e1-8582-4628-88ca-7a13fefb73e2"
}

Health

Health check endpoints

Health Check

Health check endpoint.

Responses

Response samples

200
400
401
403
404

Content type

application/json

{"status": "string",
"version": "string",
"timestamp": "2019-08-24T14:15:22Z",
"components": {"property1": { },
"property2": { }
},
"uptime": 0,
"active_evaluations": 0
}

Eval Hub (0.3.0)

Evaluations

Create Evaluation

Request Body schema: application/jsonrequired

Responses

Request samples

Response samples

List Evaluations

query Parameters

Responses

Response samples

Get Evaluation

path Parameters

Responses

Response samples

Cancel Evaluation

path Parameters

query Parameters

Responses

Response samples

Collections

List Collections

query Parameters

Responses

Response samples

Create Collection

Request Body schema: application/jsonrequired

Responses

Request samples

Response samples

Get Collection

path Parameters

Responses

Response samples

Update Collection

path Parameters

Request Body schema: application/jsonrequired

Responses

Request samples

Response samples

Patch Collection

path Parameters

Request Body schema: application/jsonrequired

Responses

Request samples

Response samples

Delete Collection

path Parameters

Responses

Response samples

Providers

List Providers

query Parameters

Responses

Response samples

Create a new provider scoped to the current tenant (Bring Your Own Provider)

Request Body schema: application/jsonrequired

Responses

Request samples

Response samples

Get Provider

path Parameters

Responses

Response samples

Update Provider

path Parameters

Request Body schema: application/jsonrequired

Responses

Request samples

Response samples

Patch Provider

path Parameters

Request Body schema: application/jsonrequired

Responses

Request samples

Response samples

Delete Provider

path Parameters

Responses

Response samples

Request Body schema: application/json
required

Request Body schema: application/json
required

Request Body schema: application/json
required

Request Body schema: application/json
required

Request Body schema: application/json
required

Request Body schema: application/json
required

Request Body schema: application/json
required