Use Projects to organize your Testsets, Runs, and Results

Projects help you organize and navigate your Scorecard assets

Sometimes your Scorecard assets such as Testsets or Runs can multiply to the point that it’s hard to find what you’re looking for. Projects help you organize these assets into buckets of related work so that it’s easier to get your work done. Projects group Testsets together, as well as the Runs and Results that are generated from them.

Now, when you load the Scorecard app the homepage is the Projects page, where you will see all of your Organization’s projects. You can click into a Project to see all of the Testsets, Runs, and Results of that Project.

The New Projects Page
The New Projects Page

Schemas define how your Testsets are structured

A new feature of Projects are Schemas. Schemas are created at the Project level, and define the Input, Label, and Output Variables of your evaluation pipeline.

Defining a Schema

You define the Schema of a Project when you create a new Project. Once created, it is expected that all testsets within a Project adhere to the same Schema. If a testset, or certain testcase does not require an Input or Label Variable, it can simply be left blank.

What’s a Variable?

Variables are the individual components of testsets and runs. Testsets have Input and Label Variables. Runs have Output Variables. Variables are one of three types: text, file_url, or json_object.

Input Variables

Input Variables are the inputs to your systems. There are two default, required Input Variables in Scorecard testsets: user_query and context. These are the standard inputs to an LLM system. context is provided in the Context window, and the user_query is added to the Prompt template. You can define additional, custom Input Variables for your Testsets which allows you to define separate inputs in separate columns. For example, you might add an image_url Input Variable that allows your Prompt template to reference both the user_query and image_url in different places.

Label Variables

Label Variables are typically used to define ideal values for model outputs, which are useful in evaluation. Every testset has one required Label Variable in the Schema called ideal. You can optionally define additional Label Variables in your testset. For example, if you find that for a certain test case your system often provides a common, incorrect response, you could add an incorrect label which you can reference in your Metric.

Defining Custom Input and Label Variables During Project Creation
Defining Custom Input and Label Variables During Project Creation

Output Variables

Output Variables are the various oututs you expect from your system. Typically this is just model_response which is the single required Output Variable in a Schema. However, if your system returns multiple outputs, you can write each of them to a respective Output Variable. For example your system might generate an image, and a description of it. In this case, the description would be provided as the model_response Output variable, and the location of the image might be written to a custom image_response Output Variable you define.

Defining Custom Output Variables During Project Creation
Defining Custom Output Variables During Project Creation

Creating and Editing a New Project

You can create as many Projects as you like from the Projects page. When you create a Project, you will define its name, a description, and the Schema of the Project. The Scorecard Default Schema of user_query, context, ideal, and model_response is always included in a Project Schema, however you can add additional variables if you like. You can edit a Schema along with all other Project details any time in the Project Settings tab.

Built with