Logging API

The Async logging endpoint allows you to directly log an LLM inference to Keywords AI, instead of using Keywords AI as a proxy with the chat completion endpoint.

Body Parameters

model

string

required

Model used for the LLM inference. Default is an empty string. See the list of model here

prompt_messages

array

required

An Array of prompt messages. Default is an empty list.

Properties

Example

"prompt_messages": [
  {
    "role": "user",
    "content": "Hi"
  },
  # optional function call
  {
    "role": "tool",
    "tool_call_id": "your tool call id",
    "content": "...." # tool call content
  }
],

completion_message

dict

Completion message in JSON format. Default is an empty dictionary.

"completion_message": {
    "role": "assistant",
    "content": "Hi, how can I assist you today?"
},

cost

float

default:0

Cost of the inference in US dollars.

completion_tokens

integer

Number of tokens in the completion.

completion_unit_price

number

Pass this parameter in if you want to log your self-host / fine-tuned model.

Example

customer_params

string

Parameters related to the customer. Default is an empty dictionary.

Properties

Example

custom_identifier

string

Same functionality as metadata, but it’s faster to query since it’s indexed.

Example

error_message

text

Error message if the LLM inference failed. Default is an empty string.

full_request

object

The full request object. Default is an empty dictionary. This is optional and it is helpful for logging configurations such as temperature, precence_penalty etc.

completion_messages, tool_calls will be automatically extracted from full_request

{
"full_request": {
"temperature": 0.5,
"top_p": 0.5,
//... other parameters
},
}

frequency_penalty

number

Specify how much to penalize new tokens based on their existing frequency in the text so far. Decreases the model’s likelihood of repeating the same line verbatim

generation_time

float

default:0

Total generation time. Generation time = TTFT (Time To First Token) + TPOT (Time Per Output Token) * #tokens. Do not confuse this with ttft.

The unit of generation time is seconds.

group_identifier

string

Group identifier. Use group identifier to group logs together.

is_custom_prompt

boolean

default:false

Whether the prompt is a custom prompt. Default is False.

keywordsai_api_controls

object

Use this parameter to control the behavior of the Keywords AI API. Default is an empty dictionary.

Properties

Example

metadata

dict

You can add any key-value pair to this metadata field for your reference.

Example

positive_feedback

boolean

Whether the user liked the output. True means the user liked the output.

presence_penalty

number

Specify how much to penalize new tokens based on whether they appear in the text so far. Increases the model’s likelihood of talking about new topics

prompt_id

string

ID of the prompt. If you want to log a custom prompt ID, you need to pass is_custom_prompt as True. Otherwise, use the Prompt ID in Prompts.

prompt_name

string

Name of the prompt.

prompt_tokens

integer

Number of tokens in the prompt.

prompt_unit_price

number

Pass this parameter in if you want to log your self-host / fine-tuned model.

Example

response_format

object

Setting to { "type": "json_schema", "json_schema": {...} } enables Structured Outputs which ensures the model will match your supplied JSON schema.Setting to { "type": "json_object" } enables the older JSON mode, which ensures the message the model generates is valid JSON. Using json_schema is preferred for models that support it.

Possible types

stream

boolean

Whether the LLM inference was streamed. Default is false.

status_code

integer

default:200

The status code of the LLM inference. Default is 200 (ok). See supported status codes here.

Supported status codes

stop

array[string]

Stop sequence

temperature

number

default:1

Controls randomness in the output in the range of 0-2, higher temperature will a more random response.

thread_identifier

string

A unique identifier for the thread.

tools

array

A list of tools the model may call. Currently, only functions are supported as a tool.

Properties

Example

"tools": [
  {
    "type": "function",
    "function": {
      "name": "get_current_weather",
      "description": "Get the current weather in a given location",
      "parameters": {
        "type": "object",
        "properties": {
          "location": {
            "type": "string",
            "description": "The city and state, e.g. San Francisco, CA",
          },
          "unit": {"type": "string", "enum": ["celsius", "fahrenheit"]},
        },
        "required": ["location"],
      },
    },
  }
]

tool_choice

object

Controls which (if any) tool is called by the model. none means the model will not call any tool and instead generates a message.

Properties

type

string

required

The type of the tool. Currently, only function is supported.

function

object

required

Properties

Example

top_p

number

default:1

An alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered.We generally recommend altering this or temperature but not both.

ttft

float

default:0

Time to first token. The time it takes for the model to generate the first token after receiving a request.

The unit of ttft is seconds.

usage

object

Usage details for the LLM inference. Currently, only support Prompt Caching.

Properties

Example

warnings

string

Any warnings that occurred during the LLM inference. You could pass a warning message here. Default is an empty string.

import requests

url = "https://api.keywordsai.co/api/request-logs/create/"
headers = {
    "Authorization": "Bearer YOUR_KEYWORDS_AI_API_KEY",
    "Content-Type": "application/json"
}
payload = {
    "model": "gpt-4",
    "prompt_messages": [
        {
          "role": "user",
          "content": "Hi"
        },
        {
          "role": "assistant",
          "content": None,
          "tool_calls": [
            {
              "id": "xxxx",
              "type": "function",
              "function": {
                "name": "get_current_weather", # Function name
                "arguments": "{\n\"location\": \"Boston, MA\"\n}" # Function arguments
              }
            }
          ]
        }, #optional
    ],
    "completion_message": {
        "role": "assistant",
        "content": "Hi, how can I assist you today?"
    },
    "tool_choice": {
        "type": "function",
        "function": {
            "name": "get_current_weather"
        }
    },
    "tools":[
      {
      "type": "function",
      "function": {
        "name": "get_current_weather",
        "description": "Get the current weather in a given location",
        "parameters": {
          "type": "object",
          "properties": {
            "location": {
              "type": "string",
              "description": "The city and state, e.g. San Francisco, CA",
            },
            "unit": {"type": "string", "enum": ["celsius", "fahrenheit"]},
          },
          "required": ["location"],
        },
      },
      }
    ],
    "customer_params": {
        "customer_identifier": "customer_123",
        "name": "Hendrix Liu", # optional
        "email": "hendrix@keywordsai.co" # optional
    },
    "prompt_tokens": 8,
    "completion_tokens": 16,
    "cost": 0.00042,
    "latency": 0.0,
    "timestamp": "2024-04-15T08:30:37.721313Z",
    "time_to_first_token": 0.0,
    "metadata": {},
    "stream": False,
    "status_code": 200,
    "warnings": "",
    "error_message": "",
    "type":"text", # "json_schema", "json_object"
}
response = requests.request("POST", url, headers=headers, json=payload)

Integration methods

Logs

Threads

Prompts

Prompt versions

Multimodal integrations

User

Model

API keys management

Logging API

Body Parameters

Integration methods

Logs

Threads

Prompts

Prompt versions

Multimodal integrations

User

Model

API keys management

​Body Parameters

Body Parameters