Skip to content

Add multi-model smoke tests #457

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Jun 19, 2024
Merged

Add multi-model smoke tests #457

merged 2 commits into from
Jun 19, 2024

Conversation

njhale
Copy link
Member

@njhale njhale commented Jun 7, 2024

Add a multi-model smoke test workflow to detect breaking changes to core GPTScript features.

The smoke test runner uses gpt-4o to perform a fuzzy equality check between existing golden files and event stream files generated at runtime.

To add a new test case, just create a new directory and the GPTScript you want to test; e.g. pkg/tests/smoke/testdata/<test-case>/<script>.gpt, set your GPTSCRIPT_DEFAULT_MODEL (and the respective auth environment variables), and run the smoke make target. This will generate the initial, model-specific, golden file in the pkg/tests/smoke/testdata/<test-case> directory. Successive runs will then reference this file for comparison.

In order for the workflow to run for PRs from external contributors -- that aren't members of the gptscript-ai org -- a member must first add the run-smoke label to the PR. This gate gives members a chance to review the PR first and ensure that it doesn't contain code that would compromise the org's GitHub secrets.

Note: The tests are sparse right now, but I'd like to get the framework in first to make sure it gels with folks and all the labeling safeguards work in the real repo since I've been testing against my fork. If that sounds good, I'll follow up with more test cases immediately.

@njhale njhale force-pushed the test/smoke branch 8 times, most recently from d7e1e68 to 5127250 Compare June 19, 2024 18:18
@njhale njhale requested review from drpebcak and g-linville June 19, 2024 19:01
@njhale njhale merged commit 57d5f00 into gptscript-ai:main Jun 19, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants