Authentication
All endpoints require API key authentication:Path Parameters
Parameter | Type | Description |
---|---|---|
evaluator_id | string | The unique ID of the evaluator |
Examples
Response
Status: 200 OKLLM Evaluator Response
Human Categorical Evaluator Response
Code Evaluator Response
Response Fields
Field | Type | Description |
---|---|---|
id | string | Unique evaluator identifier |
name | string | Display name of the evaluator |
evaluator_slug | string | URL-friendly identifier |
type | string | Evaluator type: llm , human , or code |
score_value_type | string | Score format: numerical , boolean , categorical , or comment |
eval_class | string | Pre-built template class (if used) |
description | string | Description of the evaluator |
configurations | object | Type-specific configuration settings |
categorical_choices | array | Choices for categorical evaluators (null for non-categorical) |
created_by | object | User who created the evaluator |
updated_by | object | User who last updated the evaluator |
created_at | string | ISO timestamp of creation |
updated_at | string | ISO timestamp of last update |
custom_required_fields | array | Additional required fields |
starred | boolean | Whether the evaluator is starred |
organization | integer | Organization ID |
tags | array | Tags associated with the evaluator |
Configuration Fields by Type
LLM Evaluators (type: "llm"
)
Field | Type | Description |
---|---|---|
evaluator_definition | string | The evaluation prompt/instruction with template variables |
scoring_rubric | string | Description of the scoring criteria |
llm_engine | string | LLM model to use (e.g., “gpt-4o-mini”, “gpt-4o”) |
model_options | object | LLM parameters like temperature, max_tokens |
min_score | number | Minimum possible score |
max_score | number | Maximum possible score |
passing_score | number | Score threshold for passing |
Code Evaluators (type: "code"
)
Field | Type | Description |
---|---|---|
eval_code_snippet | string | Python code with evaluate() function |
Human Evaluators (type: "human"
)
- No specific configuration fields
- Use
categorical_choices
field whenscore_value_type
is"categorical"