In this iteration, we removed the category "Impersonation" due to its ambiguous definition, and the fa most models more or less fulfill such requests.
AI & ML interests
None defined yet.
Organization Card
datasets
4
sorry-bench/sorry-bench-human-judgment-202503
Viewer
•
Updated
•
7.04k
•
64
sorry-bench/sorry-bench-202503
Viewer
•
Updated
•
9.24k
•
684
•
8
sorry-bench/sorry-bench-human-judgment-202406
Viewer
•
Updated
•
7.2k
•
22
•
5
sorry-bench/sorry-bench-202406
Viewer
•
Updated
•
9.45k
•
136
•
20
RRY-Bench: Systematically Evaluating LLM Safety Refusal
