AI & ML interests

None defined yet.

SRRY-Bench: Systematically Evaluating LLM Safety Refusal

sorry-bench (SORRY-Bench)

AI & ML interests

None defined yet.

SRRY-Bench: Systematically Evaluating LLM Safety Refusal