Measuring LLMs' ability to develop exploits
Posted 2 hours ago by
Kneenex
3
points
https://red.anthropic.com/2026/exploit-evals/
0
comments