A spreadsheet-style editor for running prompts and models across multiple test cases. Import testsets to easily test, evaluate, and optimize your LLM outputs.

Prerequisites

  • You have already created at least one prompt in Keywords AI. Learn how to create a prompt here.
  • You have added variables to your prompt. Learn how to add variables to your prompt here.

Steps

1

Create a testset (optional)

Go to Testsets and create a new testset, you can either import a CSV file or create a blank testset. You can check Testsets to know more about testsets.

2

Create an experiment

Go to Experiments and create a new experiment.

Then you should select the prompt and the versions you want to test.

3

Add test cases

Then you should add test cases for your experiment. You can either add test cases manually or import a testset from Testsets.

4

Run the experiment

Now you can run the experiment. You can run a single cell by clicking the Run button in the each cell, or run all the cells by clicking the Run all button.

5

Run evaluations for outputs

After the experiment is finished, you can run evaluations for the outputs. You can check out Run evaluations in the UI to learn how to run evaluations.