Rubric Development

Create a rubric to evaluate your best prompts

Farsight utilizes the Prometheusarrow-up-right prompting methodology to auto-evaluate system prompts.

To begin creating your rubric, we suggest

1) Synthesizing a description of your use case. For example:

To develop a secure and efficient internal HR chatbot for a financial
institution that assists employees with HR-related queries while ensuring the 
protection of private personal information.

2) Prompting by using chatGPTarrow-up-right to generate your evaluation rubric with your use case in the Prometheusarrow-up-right prompt. To do so, prompt the chat as follows:

Given this use case: To develop a secure and efficient internal HR chatbot for a financial
institution that assists employees with HR-related queries while ensuring the 
protection of private personal information.

I would like to create an evaluation rubric to effectively evaluate chat bot 
responses. Can you provide an example {instruction}, example {reference_answer}, 
can fill in the {criteria_description}, and the five {score_descriptions} for my 
use case? Please keep the rest of the format exactly the same. Please create one 
evaluation rubric from 1 to 5 with no subcategories.

### Reference Answer (Score 5):
{reference_answer}

### Score Rubric:
[{criteria_description}]
Score 1: {score1_description}
Score 2: {score2_description}
Score 3: {score3_description}
Score 4: {score4_description}
Score 5: {score5_description}

Please provide a single, consolidated rubric for evaluating these criteria.

Example Response:

Here is an example response from chatGPT, simply input the reference answer and score rubric into our get_best_prompt function with a few different prompts to evaluate.

Last updated