Prompt version
You can create a prompt version by sending a POST request to the prompt versions endpoint. The request should include the list of messages for the prompt version, the model you want to use, and other optional parameters. After you created a prompt version, you can deploy it as the live version by setting the deploy
parameter to true
.
Get the prompt_id
from the prompts endpoint.
The list of messages for the prompt version. If you want to add a variable, you can use the following format: {{variable_name}}
.
Speciy the model you want to use in this version.
Description of the prompt version
Whether the prompt version should be streamed or not. Default is false
.
The temperature of the model.
The maximum number of tokens to generate.
The nucleus sampling probability.
Specify how much to penalize new tokens based on their existing frequency in the text so far. Decreases the model’s likelihood of repeating the same line verbatim
Specify how much to penalize new tokens based on whether they appear in the text so far. Increases the model’s likelihood of talking about new topics
The list of variables for the prompt version. You can use these variables in the messages.
The list of fallback models for the prompt version. Check out fallback models for more information.
The list of models to load balance the prompt version. Check out load balancing for more information.
The list of tools to use for the prompt version. Check out tools for more information.
Whether to deploy this version as the live version Default is false
.