Warning
By downloading and using the datasets provided by GuardBench, you agree with the license of the specific datasets you intend to use. If you are unsure about what you can and can't do with specific datasets, please refer to their original repositories and owners.
To download some datasets you need to be logged into HuggingFace. If you don't have an account, create one. Then, create/copy an access token from your profile settings. Lastly, log into HuggingFace using the HuggingFace CLI:
huggingface-cli login
To download DecodingTrust, you must first fill this form.
To download HEx-PHI, you must first fill this form.
To download ToxiGen, you must first fill this form. We suggest to use the same email address of your HuggingFace account.
Datasets are downloaded and formatted on the first request and stored in ~/.guardbench/datasets
for later use.
You can also download and format all datasets in advance with the following command:
python -c "import guardbench; guardbench.download_all()"