A Case Study with the StrongREJECT Benchmark – The Berkeley Artificial Intelligence Research Blog

by Techaiapp
16 minutes read

A Case Study with the StrongREJECT Benchmark – The Berkeley Artificial Intelligence Research Blog

When we began studying jailbreak evaluations, we found a fascinating paper claiming that you could jailbreak frontier
Send this to a friend