Overview
Thelist_evaluation_reports
method allows you to retrieve a list of evaluation reports with optional filtering and pagination. This is useful for monitoring evaluation history and tracking performance over time.
Method Signature
Synchronous
Asynchronous
Parameters
Parameter | Type | Required | Description |
---|---|---|---|
dataset_id | str | No | Filter by specific dataset ID |
status | str | No | Filter by evaluation status (running, completed, failed) |
limit | int | No | Maximum number of reports to return (default: 50) |
offset | int | No | Number of reports to skip for pagination (default: 0) |
Returns
Returns a dictionary containing the list of evaluation reports and pagination information.Examples
Basic Usage
Filter by Dataset
Filter by Status
With Pagination
Asynchronous Usage
Combined Filtering
Error Handling
Response Structure
The response includes:reports
: List of evaluation report summariestotal_count
: Total number of reports matching filtershas_more
: Whether more reports are availablenext_offset
: Offset for the next page
evaluation_id
: Unique identifierdataset_id
: Associated dataset IDstatus
: Current statusoverall_score
: Aggregate score (if completed)created_at
: Creation timestampcompleted_at
: Completion timestamp (if applicable)
Common Use Cases
- Monitoring evaluation history across datasets
- Tracking model performance trends
- Finding failed evaluations for debugging
- Generating performance dashboards
- Audit an