Release Fuzzer V2.0.0 · prompt-security/ps-fuzz

Changelog

We added a custom benchmark interface which will allow users to bring their own benchmark to fuzz their system prompt. The benchmark should be in the CSV format and include a “prompt” and “response” columns.
We added an interface to run a subset of tests which will help with the speed and efficiency of the system prompt refinement process. Now users can run only a subset of tests iteratively to fix localized problems. It will save both in time and in tokens!
We added a response similarity evaluation to several dataset based tests and to the custom benchmark test. This is an upgrade on the previous way we did that, which was to just look for refusal words within the response. This allows for better accuracy when checking the results of testing several prompts and seeing whether they match their expected responses or not.
We created a Google Colab notebook! The notebook contains the entire prompt refinement process; from the initial fuzzing through the refinement and localized testing to the regression test and end result. The link is at the top of the README and at the google colab section.