Measuring LLMs' ability to develop exploits

  • Posted 2 hours ago by Kneenex
  • 3 points
https://red.anthropic.com/2026/exploit-evals/

0 comments