Prompt version
You can update a prompt version by sending a PATCH request to the prompt version endpoint. The request should include the name of the prompt version, and the list of fallback models, load balance models, tools, and whether to deploy this version as the live version.
The list of messages for the prompt version. If you want to add a variable, you can use the following format: {{variable_name}}
.
Speciy the model you want to use in this version.
Description of the prompt version
Whether the prompt version should be streamed or not. Default is false
.
The temperature of the model.
The maximum number of tokens to generate.
The nucleus sampling probability.
Specify how much to penalize new tokens based on their existing frequency in the text so far. Decreases the model’s likelihood of repeating the same line verbatim
Specify how much to penalize new tokens based on whether they appear in the text so far. Increases the model’s likelihood of talking about new topics
The list of variables for the prompt version. You can use these variables in the messages.
The list of fallback models for the prompt version. Check out fallback models for more information.
The list of models to load balance the prompt version. Check out load balancing for more information.
The list of tools to use for the prompt version. Check out tools for more information.
Whether to deploy this version as the live version Default is false
.