Create Experiments in the UI

1

Step 1: Click New experiment

Go to Experiments and click New experiment.

Experiments page with New experiment button

2

Step 2: Select a dataset

Choose the dataset you want to run on.

Dataset selector in the New experiment flow

3

Step 3: Select task = Prompt

Pick Prompt as the task type.

4

Step 4: Select a prompt

Choose the prompt you want to test (and the version if applicable).

5

Step 5: Select evaluators

Select one or more evaluators to score outputs.

6

Step 6: Create and wait for the run to finish

Click Create. The run will process in the background. Wait until the status is complete, then inspect outputs and evaluator scores.

Experiment outputs and evaluator results

1

Step 1: Click New experiment

Go to Experiments and click New experiment.

2

Step 2: Select a dataset

Choose the dataset you want to run on.

3

Step 3: Select task = LLM generation (chat completion)

Pick LLM generation (chat completion) as the task type.

4

Step 4: Configure the model and parameters

Choose the model and set parameters like temperature and max tokens.

Model configuration (model and parameters)

5

Step 5: Select evaluators

Select one or more evaluators to score outputs.

6

Step 6: Create and wait for the run to finish

Click Create. The run will process in the background. Wait until the status is complete, then inspect outputs and evaluator scores.

1

Step 1: Click New experiment

Go to Experiments and click New experiment.

2

Step 2: Select a dataset

Choose the dataset you want to run on.

3

Step 3: Select task = Custom

Pick Custom as the task type.

4

Step 4: Select evaluators

Select one or more evaluators to score outputs.

5

Step 5: Create and wait for placeholders

Click Create. The system will create placeholder rows for each dataset entry.You can then fill outputs using the API flow (recommended) so evaluators run on your submitted results.

For custom tasks, the UI is usually used to monitor progress and review results, while outputs are submitted via API.

Get started

Features

Admin

Security

Resources

Help & Community

Create Experiments in the UI

What is Experiments?

Resources

Steps to use

Get started

Features

Admin

Security

Resources

Help & Community

​What is Experiments?

​Resources

​Steps to use

What is Experiments?

Resources

Steps to use