Skip to content

Conversation

@jordanhunt22
Copy link
Collaborator

@jordanhunt22 jordanhunt22 commented Mar 12, 2025

Adds Convex code for persisting results for a test run in a deployment. This includes authentication so that only we can update the scores. I tested this locally to make sure it works correctly.

For this to work properly, I need to set CONVEX_AUTH_TOKEN and CONVEX_EVAL_ENDPOINT in CI.

This enables us to have external places (e.g. the website) be able to read the most recent eval scores.

Currently, this only stores the most recent scores, but maybe we want to store the scores for every run.

@jordanhunt22 jordanhunt22 requested a review from sujayakar March 12, 2025 22:39
@jordanhunt22 jordanhunt22 merged commit 2de26e0 into main Mar 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

3 participants